Hi, I’m Zachary Butler—a PhD‑level AI/CS professional with over 11 years of experience in AI model training, data labeling, and machine learning pipelines. I’ve led large-scale labeling projects for computer vision, NLP, and geospatial AI, and I’ve built robust workflows using LabelStudio, CVAT, Prodigy, and related tools to improve data quality and speed up iteration. I’m passionate about ethical AI and collaboration across product, engineering, and research teams. I enjoy turning complex data challenges into scalable, reliable labeling systems and contributing to open-source AI tooling so others can build better models faster.

Zachary Butler

Hi, I’m Zachary Butler—a PhD‑level AI/CS professional with over 11 years of experience in AI model training, data labeling, and machine learning pipelines. I’ve led large-scale labeling projects for computer vision, NLP, and geospatial AI, and I’ve built robust workflows using LabelStudio, CVAT, Prodigy, and related tools to improve data quality and speed up iteration. I’m passionate about ethical AI and collaboration across product, engineering, and research teams. I enjoy turning complex data challenges into scalable, reliable labeling systems and contributing to open-source AI tooling so others can build better models faster.

Available to hire

Hi, I’m Zachary Butler—a PhD‑level AI/CS professional with over 11 years of experience in AI model training, data labeling, and machine learning pipelines. I’ve led large-scale labeling projects for computer vision, NLP, and geospatial AI, and I’ve built robust workflows using LabelStudio, CVAT, Prodigy, and related tools to improve data quality and speed up iteration.

I’m passionate about ethical AI and collaboration across product, engineering, and research teams. I enjoy turning complex data challenges into scalable, reliable labeling systems and contributing to open-source AI tooling so others can build better models faster.

See more

Language

English
Fluent
Spanish; Castilian
Advanced

Work Experience

Data Analyst & AI Assistant (Full-Time) at AI Systems Lab
June 1, 2023 - Present
Supported AI research teams with data preprocessing, model evaluation, and automation of labeling workflows. Built Python pipelines for structured and unstructured data analysis using Pandas, NumPy, and TensorFlow. Developed Power BI dashboards to visualize key performance metrics and ensure transparency of data insights across departments. Collaborated with machine learning engineers to document data governance standards and improve data quality.
Freelance AI & Data Annotation Specialist at Freelance – Remote
June 1, 2022 - Present
Delivered freelance projects for startups and research clients in computer vision, NLP, and data automation domains. Labeled and curated over 200,000 data samples using LabelStudio, CVAT, and Prodigy for various machine learning models. Developed Python scripts for automating data cleaning and preprocessing, reducing manual effort by 30%. Coordinated with cross-functional teams to meet project timelines and ensure consistent data accuracy and documentation.
Junior Machine Learning Assistant (Intern) at University of Michigan AI Research Center
May 31, 2023 - October 10, 2025
Assisted research teams with data management, visualization, and model performance tracking for AI prototypes. Prepared and cleaned large datasets for experimentation, improving accuracy and consistency of project outcomes.
Data Analyst & AI Assistant (Full-Time) at AI Systems Lab
June 1, 2023 - Present
Supported AI research teams with data preprocessing, model evaluation, and automation of labeling workflows. Built and maintained Python pipelines for structured and unstructured data analysis using Pandas, NumPy, and TensorFlow. Developed Power BI dashboards to visualize key performance metrics and ensure transparency of data insights across departments. Collaborated with ML engineers to document data governance standards and improve data quality.
Freelance AI & Data Annotation Specialist at Remote
June 1, 2022 - Present
Delivered freelance projects for startups and research clients in computer vision, NLP, and data automation domains. Labeled and curated over 200,000 data samples using LabelStudio, CVAT, and Prodigy. Developed Python scripts for automating data cleaning and preprocessing, reducing manual effort by 30%. Coordinated with cross-functional teams to meet project timelines and ensure data accuracy and documentation.
Junior Machine Learning Assistant (Intern) at University of Michigan AI Research Center
May 31, 2023 - October 10, 2025
Assisted research teams with data management, visualization, and model performance tracking for AI prototypes. Prepared and cleaned large datasets for experimentation, improving accuracy and consistency of project outcomes.
AI Data Labelling Lead at Google DeepMind
June 1, 2020 - Present
Led data labeling for object detection and classification in geospatial AI applications. Handled text generation and translation labeling for large language models across English, Spanish, and Mandarin. Integrated Prodigy and supervision tooling for complex data types including 3D point clouds and audio transcripts to streamline labeling workflows.
Graduate Research Assistant at University of Michigan
May 1, 2014 - October 10, 2025
Supported AI research and large-scale data annotation projects; designed and executed labeling pipelines for computer vision and NLP tasks; contributed to AI ethics initiatives; collaborated on satellite imagery labeling and analysis workflows; oversaw data quality and labeling guidelines.

Education

Master of Science in Computer Science at University of Michigan
January 11, 2030 - May 1, 2026
Bachelor of Science in Computer Engineering at Missouri University of Science and Technology
January 11, 2030 - May 1, 2022
Master of Science in Computer Science at University of Michigan, Ann Arbor
January 11, 2030 - May 1, 2026
Bachelor of Science in Computer Engineering at Missouri University of Science and Technology
January 11, 2030 - May 1, 2022
PhD in Computer Science at University of Michigan, Ann Arbor, MI
January 11, 2030 - May 1, 2014
MS in Computer Science at University of Michigan, Ann Arbor, MI
January 11, 2030 - May 1, 2011
B.S. in Computer Engineering at Missouri University of Science and Technology, Rolla, MO
January 11, 2030 - May 1, 2009

Qualifications

Google Data Analytics Professional Certificate
January 1, 2023 - October 10, 2025
Python for Machine Learning (Coursera)
January 1, 2022 - October 10, 2025
Google Data Analytics Professional Certificate
January 1, 2023 - October 10, 2025
Python for Machine Learning
January 1, 2022 - October 10, 2025
AWS Certified Machine Learning Specialist
January 11, 2030 - October 10, 2025
Google Professional Data Engineer
January 11, 2030 - October 10, 2025
Certified Scrum Master
January 11, 2030 - October 10, 2025

Industry Experience

Computers & Electronics, Software & Internet, Education, Professional Services, Media & Entertainment