I am a Principal GenAI Engineer, Data Scientist, and Data Engineer with 10+ years of experience designing, building, and operating production-scale AI systems. I specialize in agentic architectures, real-time inference, multimodal pipelines, and data platforms for large-scale consumer and enterprise environments.\n\nI collaborate closely with product and business stakeholders to translate ambitious AI goals into resilient, governed systems, and I enjoy turning experimental GenAI workflows into production-ready solutions that deliver measurable value and ROI.

Perry Yee

I am a Principal GenAI Engineer, Data Scientist, and Data Engineer with 10+ years of experience designing, building, and operating production-scale AI systems. I specialize in agentic architectures, real-time inference, multimodal pipelines, and data platforms for large-scale consumer and enterprise environments.\n\nI collaborate closely with product and business stakeholders to translate ambitious AI goals into resilient, governed systems, and I enjoy turning experimental GenAI workflows into production-ready solutions that deliver measurable value and ROI.

Available to hire

I am a Principal GenAI Engineer, Data Scientist, and Data Engineer with 10+ years of experience designing, building, and operating production-scale AI systems. I specialize in agentic architectures, real-time inference, multimodal pipelines, and data platforms for large-scale consumer and enterprise environments.\n\nI collaborate closely with product and business stakeholders to translate ambitious AI goals into resilient, governed systems, and I enjoy turning experimental GenAI workflows into production-ready solutions that deliver measurable value and ROI.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
See more

Language

English
Fluent

Work Experience

Principal Machine Learning Engineer (Gen AI / LLM Systems) at DoorDash
September 1, 2022 - Present
Architected a real-time, agentic offer orchestration platform with specialized retrieval, pricing/optimization, and policy-enforcement agents to personalize promotions across multiple touchpoints. The system uses RAG over FAISS and Delta Lake with low-latency Databricks Model Serving, designed for production reliability and scalability. Led governance, data-quality services for SKU feeds, and a multimodal extraction pipeline combining vision, parsing, and compliance agents.
Senior Data Scientist at Compass
April 1, 2018 - September 1, 2022
Designed ML systems leveraging large-scale WiFi data to forecast passenger demand, delivering ~20% gains in route optimization. Built an AI-driven traffic optimization platform combining IoT sensors and Google Maps APIs to analyze congestion and reduce delays by ~30% with real-time signal adjustments. Implemented predictive maintenance from IoT telemetry and an AI-powered sentiment analysis pipeline for near real-time insights and improved customer engagement.
Senior Data Scientist at Cisco
October 1, 2014 - November 1, 2017
Developed ML models on AWS SageMaker to calibrate IoT device temperatures in greenhouse environments, improving prediction accuracy by ~20% across 8,000 devices. Designed AI-driven inventory forecasting and time-series-based maintenance planning, plus computer-vision quality control, leading to reduced downtime and improved operational reliability. Built customer lifetime value and recommendation models.
Data Scientist at Fora.TV
December 1, 2013 - October 1, 2014
Built scalable recommendation systems with Apache Spark and Hive, including item-to-item collaborative filtering and FP-Growth on HDFS. Developed NLP-based email classification on Azure to automate ticket routing and forecasted call volumes to optimize staffing. Implemented predictive models for HR recruitment and churn analysis, and contributed to ML-based threat detection.

Education

Bachelor's Degree at University of California, Berkeley
January 1, 2009 - January 1, 2013

Qualifications

AWS Certified Data Scientist
January 11, 2030 - February 25, 2026
Deep Learning Specialization (Andrew Ng)
January 11, 2030 - February 25, 2026
Azure Certified Data Scientist
January 11, 2030 - February 25, 2026
GCP Certified Data Scientist
January 11, 2030 - February 25, 2026
IBM Data Science And AI Certificate Level 1
January 11, 2030 - February 25, 2026
IBM Data Science And AI Certificate Level 2
January 11, 2030 - February 25, 2026
IBM Data Science And AI Certificate Level 3
January 11, 2030 - February 25, 2026
MLOps Certified Data Scientist
January 11, 2030 - February 25, 2026

Industry Experience

Software & Internet, Retail, Transportation & Logistics, Media & Entertainment, Professional Services