I am a Data Scientist and Bioinformatician (PhD level) with over 4 years of experience analyzing large-scale human genomics, transcriptomics and epigenomics data in clinical and translational research. I combine applied statistics, machine learning, and robust data pipelines to extract actionable insights from complex biological data. My work spans NGS data quality control, cross-sectional analyses, and multi-omics integration to support disease risk stratification and health-related outcomes predictions. I enjoy collaborating with clinicians and researchers to translate omics signals into mechanistic hypotheses and practical applications for patient care.

Catarina dos Santos Gomes

I am a Data Scientist and Bioinformatician (PhD level) with over 4 years of experience analyzing large-scale human genomics, transcriptomics and epigenomics data in clinical and translational research. I combine applied statistics, machine learning, and robust data pipelines to extract actionable insights from complex biological data. My work spans NGS data quality control, cross-sectional analyses, and multi-omics integration to support disease risk stratification and health-related outcomes predictions. I enjoy collaborating with clinicians and researchers to translate omics signals into mechanistic hypotheses and practical applications for patient care.

Available to hire

I am a Data Scientist and Bioinformatician (PhD level) with over 4 years of experience analyzing large-scale human genomics, transcriptomics and epigenomics data in clinical and translational research. I combine applied statistics, machine learning, and robust data pipelines to extract actionable insights from complex biological data.

My work spans NGS data quality control, cross-sectional analyses, and multi-omics integration to support disease risk stratification and health-related outcomes predictions. I enjoy collaborating with clinicians and researchers to translate omics signals into mechanistic hypotheses and practical applications for patient care.

See more

Experience Level

Expert
Expert
Expert
Intermediate

Language

Portuguese
Fluent
English
Fluent
Spanish; Castilian
Intermediate
French
Beginner

Work Experience

Bioinformatician – Advanced Analytics & Multi-Omics at Albert Einstein Israelite Hospital
January 1, 2022 - Present
Designed, maintained, and optimized scalable data science pipelines for biobank-scale human genomics and multi-omics datasets. Developed and validated polygenic risk scores for common diseases in diverse populations using genomic and EHR-linked data. Led data quality control and statistical validation to ensure robustness and reproducibility of analytical results. Integrated WGS, RNA-seq, and clinical data to identify candidate molecular mechanisms in unresolved rare disease cases. Applied machine learning approaches to multi-omics integration and phenotype derivation.
PhD Researcher – Bioinformatics & Statistical Genetics at University of São Paulo
January 1, 2021 - December 31, 2025
Conducted large-scale statistical analyses of genotype array data from longitudinal birth cohorts. Built reproducible analytical pipelines to model associations between polygenic risk and neurodevelopmental outcomes. Performed gene–environment interaction analyses integrating genetic, environmental, and clinical variables.
Research Assistant – Bioinformatics at University of São Paulo
January 1, 2019 - December 31, 2020
Processed and analyzed whole-exome sequencing data from autism spectrum disorder cohorts. Applied unsupervised machine learning techniques to characterize phenotypic heterogeneity in large consortium datasets.

Education

PhD in Bioinformatics at University of São Paulo, Brazil
January 1, 2021 - December 31, 2025
Guest Researcher at Max Planck Institute for Human Development
January 1, 2024 - February 1, 2024

Qualifications

Gene-environment interactions in human health and disease
January 1, 2025 - December 31, 2025
Large-language models in bioinformatics
January 1, 2025 - December 31, 2025
Machine learning for genomics
January 1, 2020 - January 1, 2020
International Society for Computational Biology
January 1, 2020 - December 31, 2020

Industry Experience

Healthcare, Life Sciences, Professional Services, Education, Software & Internet

Experience Level

Expert
Expert
Expert
Intermediate

Hire a Data Scientist

We have the best data scientist experts on Twine. Hire a data scientist today.