I am Jerry Yang, a Backend & AI Engineer with 8+ years of experience as a Full-Stack & AI/ML Engineer building scalable, cloud-native software and implementing state-of-the-art LLM, GenAI, and NLP systems. -Documented experience working on cross-functional teams and building microservices, CI/CD workflows, and real-time data systems using Python, Java, Kubernetes, and Terraform. -Broad experience integrating RAG architecture, LangChain, Hugging Face, GPT-3.5/4, and optimizing end-user experience. -Interested in building end-to-end AI platforms using Python frameworks, deep learning, and robust DevOps and corporate applications.

Jerry Yang

I am Jerry Yang, a Backend & AI Engineer with 8+ years of experience as a Full-Stack & AI/ML Engineer building scalable, cloud-native software and implementing state-of-the-art LLM, GenAI, and NLP systems. -Documented experience working on cross-functional teams and building microservices, CI/CD workflows, and real-time data systems using Python, Java, Kubernetes, and Terraform. -Broad experience integrating RAG architecture, LangChain, Hugging Face, GPT-3.5/4, and optimizing end-user experience. -Interested in building end-to-end AI platforms using Python frameworks, deep learning, and robust DevOps and corporate applications.

Available to hire

I am Jerry Yang, a Backend & AI Engineer with 8+ years of experience as a Full-Stack & AI/ML Engineer building scalable, cloud-native software and implementing state-of-the-art LLM, GenAI, and NLP systems.
-Documented experience working on cross-functional teams and building microservices, CI/CD workflows, and real-time data systems using Python, Java, Kubernetes, and Terraform.
-Broad experience integrating RAG architecture, LangChain, Hugging Face, GPT-3.5/4, and optimizing end-user experience.
-Interested in building end-to-end AI platforms using Python frameworks, deep learning, and robust DevOps and corporate applications.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
See more

Language

English
Fluent
Javanese
Advanced
Afar
Intermediate

Work Experience

Backend & AI Engineer at Microsoft
January 1, 2022 - Present
Designed and implemented the Dreaming Worker pipeline to analyze uploaded user content (audio, PDFs, Markdown, DOCX) and generate structured Moments and Dreams via GPT-4-class LLMs. Built a multi-tenant content ingestion system with SeaweedFS (S3-compatible), NATS JetStream, and PostgreSQL/TiDB to manage large-scale asynchronous processing. Developed file upload and moment creation APIs using FastAPI (Python) integrating LLM agents for automated insight extraction. Implemented CLI orchestration commands to run per-tenant or batch analysis, including model selection, lookback windows, and email digest generation. Created automated email reporting via templated HTML summaries of generated moments using SMTP server. Developed RAG-style pipelines for chunking, embedding, and retrieval of multi-format content (PDF, DOCX, Markdown, audio). Developed agentic AI systems that interface with OpenAI, Anthropic, and Claude APIs to process user content and generate structured insights. Integrated O
Backend & AI Engineer at Silicon Labs
July 1, 2020 - January 1, 2022
Led cross-functional teams leveraging ML, GenAI, and NLP for financial data analysis and AI-powered investment insights. Implemented backtesting and a generative AI chat assistant for user education and informed decision-making. Led the development of a legal chatbot powered by LLMs, enhanced with agentic AI technology, and deployed on GCP using Kubeflow and BigQuery. Demonstrated expertise in cloud-based NLP solutions. Created interactive dashboards for financial data communication using Tableau. Implemented cutting-edge research in generative AI (LangChain, ChatGPT, Ollama.ai). Scaled distributed backend systems supporting 20,000 daily active users using FastAPI and Node.js, improving request latency and throughput via API refactoring, advanced caching strategies, and database query optimization (MySQL, RocksDB). Built and maintained enterprise-grade microservices in Python and Django for low-latency, high-availability systems. Built and maintained responsive dashboards and client po
Machine Learning Engineer at Samsung Electronics
June 1, 2019 - August 1, 2019
Provided services for an officer in creating data pipelines and ML engineering. Worked on machine learning, NLP, AWS, Elasticsearch/OpenSearch and helped create ETL for Global Hydrogen Index with Elasticsearch. Utilized BERT for NLP-based deep learning in voice recognition and integrated agentic AI with graph attention networks, transformers, and LSTM architectures. Streamlined data analytics and workflow automation using Alteryx. Designed and implemented complex workflows integrating data from various sources, transforming and cleaning data, and generating actionable insights. Participated in the design, development, and deployment of scalable backend services and distributed systems supporting high-traffic e-commerce and financial platforms. Built and maintained RESTful APIs and microservices using Python for millions of daily users. Combined DNNs with SVM, KNN, and tree models using grid search, increasing performance by 15%. Architected and optimized cloud infrastructure on AWS wit
Full Stack Engineer at AMD
January 1, 2019 - April 1, 2019
Implemented machine learning in a distributed containerized fashion using TensorFlow, Keras, Azure Docker containers, and scikit-learn within an Agile methodology. Built generative adversarial models for anomaly detection and utilized R for statistical analysis and data visualization. Designed and implemented complex workflows integrating data from various sources, transforming and cleaning data, and generating actionable insights. Built and maintained RESTful APIs and microservices using Python and Node.js, integrating with SQL Server and Azure-based services. Implemented CI/CD pipelines with GitHub Actions and AWS CodePipeline. Integrated AI/ML features using OpenAI APIs and Hugging Face models. Monitored and improved application reliability using Prometheus, Grafana, and CloudWatch.
Backend Engineer at Hewlett Packard Enterprise
June 1, 2018 - August 1, 2018
Built and maintained scalable backend services in Java and Python, integrating with VMware’s virtualization stack and APIs (vSphere, ESXi, vCenter). Developed and enhanced RESTful and gRPC APIs to support automation, orchestration, and third-party integrations. Wrote scripts in Python and Bash to automate data collection, parsing, and storage. Supported integration of backend data pipelines with MySQL and NoSQL databases for large-scale data indexing. Optimized scripts to automate ETL processes. Collaborated with senior engineers to improve system reliability and scalability for production workloads.

Education

Bachelor of Science (B.S.) in Computer Engineering at The University of Texas at Austin
January 11, 2030 - January 1, 2018
Bachelor of Science (B.S.), Computer Engineering at The University of Texas at Austin
January 11, 2030 - January 1, 2018

Qualifications

AWS Certified Machine Learning - Specialty
January 11, 2030 - February 17, 2026
TensorFlow Developer Certificate
January 11, 2030 - February 17, 2026
AWS Certified Machine Learning - Specialty
January 11, 2030 - March 5, 2026
TensorFlow Developer Certificate
January 11, 2030 - March 5, 2026

Industry Experience

Software & Internet, Media & Entertainment, Professional Services