Experienced Computational Scientist

  • Cross-Disciplinary Expertise
    Over a decade of experience at the intersection of biology, chemistry, and machine learning—spanning metabolomics, proteomics, genomics, and drug discovery.
  • Industry-Backed Innovation
    Led high-impact projects at global companies and startups, including Datacca, Triplebar, Brightseed, Amyris, Hexagon Bio, and Mondelez, building platforms used by R&D and commercial teams.
  • Scientific Rigor & Engineering Precision
    Combines deep scientific knowledge with advanced software and cloud engineering to turn raw biological data into scalable, actionable systems.

Machine Learning in Biosciences

  • Genome Language Models
    Trained and fine-tuned fungal genome language models to optimize protein expression and secretion, accelerating synthetic biology applications.
  • Protein Language Models
    Fine-tuned antibody-specific PLMs to enhance sequence design with demonstrably improved functional outcomes
  • Molecule Fingerprints from Mass Spectrometry
    Developed machine learning models that mapped 20,000+ plant compounds to human health outcomes, driving a 1000x increase in phytonutrient discovery throughput.
  • Fungal Compound Discovery
    Designed and deployed scalable workflows on Google Cloud Vertex AI using GATK, samtools, Dragen-OS, and other bioinformatics tools to streamline genomic analysis at scale.

Omics Infrastructure & Pipelines

🧫 Genomics

  • Scalable DNA Processing
    Designed and deployed scalable workflows on Google Cloud Vertex AI using GATK, samtools, Dragen-OS, and other bioinformatics tools to streamline genomic analysis at scale.

🧪 Proteomics

  • End-to-End Proteomics Platform
    Built a comprehensive proteomics pipeline supporting the entire research workflow—from benchside experimentation to insights-driven decision-making by scientists.

🧉 Metabolomics

  • Scalable DNA Processing
    Developed high-throughput metabolomics pipelines capable of processing terabyte-scale LC-MS/MS datasets for accelerated discovery.

Data Science & Visualization

  • Multi-Omics Platform for R&D
    Built interactive Python Dash and FastHTML dashboards to empower scientists to explore complex datasets and extract actionable insights across multi-omics domains.

Infrastructure & Scalability

  • Cloud Bioinformatics
    Architected and deployed scalable, production-ready infrastructure to support AI/ML-driven genome and antibody design workflows in cloud environments.