Valo Health

Data Engineer II, Translational Data Science

Job Posted 4 days ago

Job Description

About Us

 

Valo Health is a human-centric, AI-enabled biotechnology company working to make new drugs for patients faster. The company’s Opal Computational Platform transforms drug discovery and development through a unique combination of real-world data, AI, human translational models and predictive chemistry. 

Our talented team of biologists, chemists and engineers, armed with advanced AI/ML tools, work together to break down traditional R&D silos and accelerate the speed and scale of drug discovery and development. 

Valo is committed to hiring diverse talent, prioritizing growth and development, fostering an inclusive environment, and creating opportunities to bring together a group of different experiences, backgrounds, and voices to work together. We embrace new ways of learning, solve complex problems and welcome diverse perspectives that can help us advance patient-centric innovation. 

Valo is headquartered in Lexington, MA, with additional offices in New York, NY and Tel Aviv, Israel.  To learn more, visit www.valohealth.com.  

 

About the role 

You will be part of the data engineering core in the Translational Data Sciences group, working with data engineers and data scientists building powerful computational tools and answering critical scientific questions about patients, diseases, and drug development. In this role, you will contribute to Valo’s EHR data engineering capabilities, transforming structured and unstructured data from proprietary sources into analysis-ready data products for internal teams. To do so you will work closely with data engineers and data scientists supporting epidemiology, patient clustering, and biomarker identification. 

What You'll Do...

  • Work on a team developing data transformation pipelines and systems to ingest and harmonize data in Valo’s data ecosystems.  
  • Contribute to the data source backend for our epidemiology and patient machine learning. 
  • Build visualization and data extraction tools. 
  • Work using agile development techniques. 
  • Learn to provide high quality data through unit tests, validation tests, code reviews.   
  • Be a dynamic and active team member. 

What You Bring...

  • Bachelor’s + 2 years of experience or recent Master’s degree in computer science, information systems, computational sciences (e.g. bioinformatics, computational biology, epidemiology), or related fields. 
  • Familiarity with various clinical data types (e.g., EHR/EMR, clinical trials, claims).  
  • Familiarity with any medical coding systems is a plus, (e.g. ICD9/10, CPT, ATC, LOINC, SNOMED, UMLS). 
  • Must have experience in Python and SQL (e.g. PostgreSQL, MySQL, etc.). 
  • Familiarity with releases/versioning of datasets for internal users. 
  • Familiarity with data processing workflows, data platforms (e.g., spark, snowflake) , and cloud environments (e.g., AWS, GCP). 
  • Experience with data and software engineering best practices and testing methodologies (data provenance, collaborative development using source control management (git, bitbucket), code versioning, reproducibility, etc.). 

 

Remote Salary Range
$125,000$164,000 USD

Ready for Your Next Step?

To apply for this position, please use the link below. You will be redirected to the official application page on the company's website.