Scientist-Computational Biology, Biomarker Platform

Scientist-Computational Biology, Biomarker Platform Development

Location: San Francisco (On-site Post Covid)

1 Contract with Extension Possibilities

Summary

The Clinical Biomarkers & Diagnostics (CBD) department is looking for a highly motivated Computational Biologist with expertise in clinical bioinformatics, biomarker development, assay development, data ETL and application development. This role will operate within a multi-disciplinary setting, working closely with computational biologists, biomarker scientists and data scientists to develop data ingestion, platform analysis and applications for analyzing and visualizing clinical trial datasets with a focus on clinical biomarkers. Candidate must be familiar with verbiage and meanings of data entities in the context of clinical trials. Familiarity with diagnostics and common analytes utilized within clinical trials is required. A strong handle on clinical metadata and expected values as well as analytes and their dynamic ranges & units is needed for this role. Experience in the clinical domain with data ETL and application development in Python is a must have. Expertise in data visualization and familiarity with full-stack development is needed for this role. Excellent communication skills and great attention to details are required. Experience with clinical trial datasets and statistical analysis is a plus. Experience in DASH/AngularJS/Flask/R shiny is a plus.

Core Responsibilities

Development of data ETL, QC, data store architecture, schema definitions, ingestion, transformation and QC logic
Full-stack application development & deployment, integration with analysis modules, data visualization

Top 3 Must Have Skill Sets:

Ability to analyze and interpret clinical biomarker data
Expert level knowledge of machine learning and analytical techniques
Experience/ interest in application development (DASH)

Day to Day Responsibilities

Partner with Clinical Data Management, Study Management and Statistical Programming on development and standardization of data ingestion processes
Define acceptance standards for data generated using a number of diagnostic technologies such as flow cytometry, Nanostring, NGS diagnostic assays and imaging platforms
Perform residual data QC as needed
Document acceptance standards, socialize them and educate stakeholders
Author code/scripts to perform automatic quality control, formatting and transformation of data to adhere to defined standards
Document values, dynamic ranges, controlled vocabulary and obtain organization-wide approval for this intellectual property
Ensure that QCed data for active programs is delivered in an analyzable format in a timely manner
Ensure that data is stored in a manner that adheres to internal standards for storage
Make recommendations for process improvements
Support the development of the web application through integration with backend database, and development of front end user interfaces and visualizations
Support the deployment of the web application
Work with computational biologists, biomarker scientists and data scientists to develop new features and visualizations
Monitor usage of the web application and provide operational support
Optimize the design and performance of the web application
Write unit tests, user manual, best practices and internal documentations of the web application