Hi, I’m Zoey Genesis, a Ph.D.-level machine learning professional focused on building data-centric AI systems that perform reliably in production. I have 3+ years of hands-on experience optimizing models, designing scalable data pipelines, and leading annotation strategies for large NLP projects at AWS, Hugging Face, and OpenAI. I enjoy turning complex data challenges into robust, end-to-end solutions that drive real-world impact. I love collaborating with cross-functional teams to implement data curation workflows, reduce labeling costs, and push the boundaries of transfer learning and few-shot methods. I’m excited to bring my practical, data-centric approach to a forward-thinking tech company and help create innovative, production-ready AI systems.

Zoey Genesis

Hi, I’m Zoey Genesis, a Ph.D.-level machine learning professional focused on building data-centric AI systems that perform reliably in production. I have 3+ years of hands-on experience optimizing models, designing scalable data pipelines, and leading annotation strategies for large NLP projects at AWS, Hugging Face, and OpenAI. I enjoy turning complex data challenges into robust, end-to-end solutions that drive real-world impact. I love collaborating with cross-functional teams to implement data curation workflows, reduce labeling costs, and push the boundaries of transfer learning and few-shot methods. I’m excited to bring my practical, data-centric approach to a forward-thinking tech company and help create innovative, production-ready AI systems.

Available to hire

Hi, I’m Zoey Genesis, a Ph.D.-level machine learning professional focused on building data-centric AI systems that perform reliably in production. I have 3+ years of hands-on experience optimizing models, designing scalable data pipelines, and leading annotation strategies for large NLP projects at AWS, Hugging Face, and OpenAI. I enjoy turning complex data challenges into robust, end-to-end solutions that drive real-world impact.

I love collaborating with cross-functional teams to implement data curation workflows, reduce labeling costs, and push the boundaries of transfer learning and few-shot methods. I’m excited to bring my practical, data-centric approach to a forward-thinking tech company and help create innovative, production-ready AI systems.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
See more

Language

English
Fluent
French
Intermediate

Work Experience

Lead Machine Learning Engineer at Amazon Web Services (AWS)
January 1, 2023 - Present
Led design and deployment of data annotation tools for large-scale NLP datasets; developed machine learning pipelines for model evaluation and fine-tuning; spearheaded annotation process improvements, reducing error rates by 30%; guided cross-functional teams to ensure smooth integration of annotation workflows into production.
Senior Research Engineer at Hugging Face
January 1, 2021 - January 1, 2023
Designed cutting-edge data preprocessing tools for training deep learning models; developed novel techniques for improving the quality and efficiency of data labeling in NLP; collaborated with research teams to optimize annotation schema and validation protocols; published multiple papers on data-centric techniques.
Research Intern (Ph.D.) at OpenAI
June 1, 2021 - August 31, 2021
Contributed to research on training models with human-in-the-loop feedback; designed a data curation system that minimized labeling cost while maintaining quality; assisted in refining and validating large-scale language models.
Data Science Intern at Kaggle (Google)
January 1, 2019 - January 1, 2020
Analyzed large datasets and helped develop tools to streamline data labeling workflows; collaborated with data scientists to build data pipelines for classification tasks; enhanced annotation automation through custom scripting and model-assisted tagging.

Education

PhD in Artificial Intelligence (AI & Data Engineering) at Stanford University
January 1, 2017 - January 1, 2022
Master of Science in Computer Science at Stanford University
January 1, 2019 - January 1, 2019
Bachelor of Science in Computer Science at University of Texas at Austin
January 1, 2013 - January 1, 2017

Qualifications

AWS Certified Machine Learning Specialty
January 1, 2023 - January 12, 2026
Google AI Residency Finalist
January 1, 2022 - January 12, 2026

Industry Experience

Software & Internet, Professional Services, Media & Entertainment