Available to hire
Hi, I’m Zachary Butler—a PhD‑level AI/CS professional with over 11 years of experience in AI model training, data labeling, and machine learning pipelines. I’ve led large-scale labeling projects for computer vision, NLP, and geospatial AI, and I’ve built robust workflows using LabelStudio, CVAT, Prodigy, and related tools to improve data quality and speed up iteration.
I’m passionate about ethical AI and collaboration across product, engineering, and research teams. I enjoy turning complex data challenges into scalable, reliable labeling systems and contributing to open-source AI tooling so others can build better models faster.
Experience Level
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Language
English
Fluent
Spanish; Castilian
Advanced
Work Experience
Data Analyst & AI Assistant (Full-Time) at AI Systems Lab
June 1, 2023 - PresentSupported AI research teams with data preprocessing, model evaluation, and automation of labeling workflows. Built Python pipelines for structured and unstructured data analysis using Pandas, NumPy, and TensorFlow. Developed Power BI dashboards to visualize key performance metrics and ensure transparency of data insights across departments. Collaborated with machine learning engineers to document data governance standards and improve data quality.
Freelance AI & Data Annotation Specialist at Freelance – Remote
June 1, 2022 - PresentDelivered freelance projects for startups and research clients in computer vision, NLP, and data automation domains. Labeled and curated over 200,000 data samples using LabelStudio, CVAT, and Prodigy for various machine learning models. Developed Python scripts for automating data cleaning and preprocessing, reducing manual effort by 30%. Coordinated with cross-functional teams to meet project timelines and ensure consistent data accuracy and documentation.
Junior Machine Learning Assistant (Intern) at University of Michigan AI Research Center
May 31, 2023 - October 10, 2025Assisted research teams with data management, visualization, and model performance tracking for AI prototypes. Prepared and cleaned large datasets for experimentation, improving accuracy and consistency of project outcomes.
Data Analyst & AI Assistant (Full-Time) at AI Systems Lab
June 1, 2023 - PresentSupported AI research teams with data preprocessing, model evaluation, and automation of labeling workflows. Built and maintained Python pipelines for structured and unstructured data analysis using Pandas, NumPy, and TensorFlow. Developed Power BI dashboards to visualize key performance metrics and ensure transparency of data insights across departments. Collaborated with ML engineers to document data governance standards and improve data quality.
Freelance AI & Data Annotation Specialist at Remote
June 1, 2022 - PresentDelivered freelance projects for startups and research clients in computer vision, NLP, and data automation domains. Labeled and curated over 200,000 data samples using LabelStudio, CVAT, and Prodigy. Developed Python scripts for automating data cleaning and preprocessing, reducing manual effort by 30%. Coordinated with cross-functional teams to meet project timelines and ensure data accuracy and documentation.
Junior Machine Learning Assistant (Intern) at University of Michigan AI Research Center
May 31, 2023 - October 10, 2025Assisted research teams with data management, visualization, and model performance tracking for AI prototypes. Prepared and cleaned large datasets for experimentation, improving accuracy and consistency of project outcomes.
AI Data Labelling Lead at Google DeepMind
June 1, 2020 - PresentLed data labeling for object detection and classification in geospatial AI applications. Handled text generation and translation labeling for large language models across English, Spanish, and Mandarin. Integrated Prodigy and supervision tooling for complex data types including 3D point clouds and audio transcripts to streamline labeling workflows.
Graduate Research Assistant at University of Michigan
May 1, 2014 - October 10, 2025Supported AI research and large-scale data annotation projects; designed and executed labeling pipelines for computer vision and NLP tasks; contributed to AI ethics initiatives; collaborated on satellite imagery labeling and analysis workflows; oversaw data quality and labeling guidelines.
Education
Master of Science in Computer Science at University of Michigan
January 11, 2030 - May 1, 2026Bachelor of Science in Computer Engineering at Missouri University of Science and Technology
January 11, 2030 - May 1, 2022Master of Science in Computer Science at University of Michigan, Ann Arbor
January 11, 2030 - May 1, 2026Bachelor of Science in Computer Engineering at Missouri University of Science and Technology
January 11, 2030 - May 1, 2022PhD in Computer Science at University of Michigan, Ann Arbor, MI
January 11, 2030 - May 1, 2014MS in Computer Science at University of Michigan, Ann Arbor, MI
January 11, 2030 - May 1, 2011B.S. in Computer Engineering at Missouri University of Science and Technology, Rolla, MO
January 11, 2030 - May 1, 2009Qualifications
Google Data Analytics Professional Certificate
January 1, 2023 - October 10, 2025Python for Machine Learning (Coursera)
January 1, 2022 - October 10, 2025Google Data Analytics Professional Certificate
January 1, 2023 - October 10, 2025Python for Machine Learning
January 1, 2022 - October 10, 2025AWS Certified Machine Learning Specialist
January 11, 2030 - October 10, 2025Google Professional Data Engineer
January 11, 2030 - October 10, 2025Certified Scrum Master
January 11, 2030 - October 10, 2025Industry Experience
Computers & Electronics, Software & Internet, Education, Professional Services, Media & Entertainment
Experience Level
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Hire a Data Scientist
We have the best data scientist experts on Twine. Hire a data scientist in Chesterfield today.