Statistical Data Engineer/Scientist at Whole Biome
San Francisco, CA, US

Whole Biome is on a mission to help people improve physical and mental health by creating a new category of products that target the microbiome. We are researching, developing and commercializing a novel class of rationally-designed Live SynbioticsTM (probiotics + prebiotics) that have demonstrated clinical efficacy to treat conditions like metabolic syndrome, inflammation and neurodegeneration. ​Whole Biome has created proprietary pipelines to build a unique discovery platform that identifies key, novel bacterial strains and the prebiotics that feed them. 
We are a highly collaborative team of scientists, engineers, physicians, marketers and salespeople interested in improving human health by using the latest research from diverse fields, such as, microbiology, molecular biology, high-throughput genomics, distributed computing, pharmaceutical development and nutrition. We believe strongly in an individual’s transparency and strong communication to enable the most effective and efficient path to team success. If you’re interested in building a new category of products that will help improve the lives of people globally and you love working in a cross-functional, collaborative, inspiring environment, please continue reading!

Whole Biome is seeking a Statistical Data Engineer/Scientist to join the Compute team. The person will contribute to the Whole Biome mission by working with meaningful data to generate impactful evidence and insights on our products that improve people’s health. The ideal candidate will have experience in successfully developing and deploying cloud-based bioinformatics services and prototyping robust statistical models. This position will interact closely with the R&D teams.


  • Prototype, build, and maintain in-house software for high-throughput bioinformatics pipelines and web services
  • Prototype robust and scalable models based on statistical modeling in analytics languages (Python, R, C/C++) 
  • Document, test and release software services in a continuous fashion
  • Keep developing a comprehensive knowledge of fundamentals of statistical predictive modeling, machine learning, and data mining
  • Develop a deep understanding of the data we work with and foster learning with colleagues using analytical tools


  • PhD (or Master) + >4 years of experience in a quantitative data science discipline (e.g., statistics, computer science, engineering, biostatistics, biophysics)
  • Solid Object Oriented design and implementation skills (e.g., Python)
  • Experience in Unix/Linux environments using Bash, Make and related Unix tools
  • Familiarity with AWS is a big plus
  • Demonstrated entrepreneurial mindset and self-direction, ability to teach others and willingness to learn new techniques
  • Desire to work in a cross-functional, collaborative environment that values a balanced life
  • Belief in transparency and open communication and the ability for an individual’s power to improve and enable those around them
  • Nice to have
    - Experience implementing advanced analytics approaches (machine learning, longitudinal data analysis, etc.)
    - Contributor to open source packages, libraries or functions
    - Demonstrated track record of developing and execution of patient-level data analyses (e.g., real world data, surveys, clinical trials, registries, claims or genomic)

​​Email  with your resume and cover letter to begin the application process.  Even if you don't see an opening that fits you, please send us your resume if you think your background and personality would be a great fit!


  •  competitive salary and equity packages
  •  health, dental, and vision
  •  401k with corporate matching
  •  flexible schedules with a focus on work/life balance
  • unlimited vacation policy (we're all adults and professionals, not clockwatchers)
  • commuter benefits

Sweet Perks​

  • casual culture (66% of founders usually wearing hoodies)  
  • ​on-site gym
  • work/life balance: we believe it, we encourage it, we live it
  • policies that strongly support maternity / paternity leaves
  • collaborative, team environment
  • off-site, team building adventures
  • walking distance to Caltrain, Muni
  • walking distance to bars and restaurants in Dogpatch