Available to hire
Hi, I’m Jerry Yang, a Backend & AI/ML Engineer based in Austin, TX. I’ve spent 7+ years building scalable, cloud-native software, microservices, and AI-powered workflows. I love turning messy data into structured insights, designing multi-tenant ingestion pipelines, and integrating LLMs and agentic AI into real-world applications.
I enjoy collaborating with cross-functional teams to ship end-to-end AI platforms using Python, Kubernetes, and modern DevOps practices. From data processing and retrieval to semantic search and automated reporting, I’m focused on delivering robust experiences for end users while keeping systems reliable and secure.
Skills
Experience Level
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
Language
English
Fluent
Javanese
Advanced
Afar
Intermediate
Work Experience
Backend & AI Engineer at Microsoft
January 1, 2022 - PresentDesigned and implemented the Dreaming Worker pipeline to analyze uploaded user content (audio, PDFs, Markdown, DOCX) and generate structured Moments and Dreams via GPT-4-class LLMs. Built a multi-tenant content ingestion system with SeaweedFS (S3-compatible), NATS JetStream, and PostgreSQL/TiDB to manage large-scale asynchronous processing. Developed file upload and moment creation APIs using FastAPI (Python) integrating LLM agents for automated insight extraction. Implemented CLI orchestration commands to run per-tenant or batch analysis, including model selection, lookback windows, and email digest generation. Created automated email reporting via templated HTML summaries of generated moments using SMTP server. Developed RAG-style pipelines for chunking, embedding, and retrieval of multi-format content (PDF, DOCX, Markdown, audio). Developed agentic AI systems that interface with OpenAI, Anthropic, and Claude APIs to process user content and generate structured insights. Integrated O
Backend & AI Engineer at Silicon Labs
July 1, 2020 - January 1, 2022Led cross-functional teams leveraging ML, GenAI, and NLP for financial data analysis and AI-powered investment insights. Implemented backtesting and a generative AI chat assistant for user education and informed decision-making. Led the development of a legal chatbot powered by LLMs, enhanced with agentic AI technology, and deployed on GCP using Kubeflow and BigQuery. Demonstrated expertise in cloud-based NLP solutions. Created interactive dashboards for financial data communication using Tableau. Implemented cutting-edge research in generative AI (LangChain, ChatGPT, Ollama.ai). Scaled distributed backend systems supporting 20,000 daily active users using FastAPI and Node.js, improving request latency and throughput via API refactoring, advanced caching strategies, and database query optimization (MySQL, RocksDB). Built and maintained enterprise-grade microservices in Python and Django for low-latency, high-availability systems. Built and maintained responsive dashboards and client po
Machine Learning Engineer at Samsung Electronics
June 1, 2019 - August 1, 2019Provided services for an officer in creating data pipelines and ML engineering. Worked on machine learning, NLP, AWS, Elasticsearch/OpenSearch and helped create ETL for Global Hydrogen Index with Elasticsearch. Utilized BERT for NLP-based deep learning in voice recognition and integrated agentic AI with graph attention networks, transformers, and LSTM architectures. Streamlined data analytics and workflow automation using Alteryx. Designed and implemented complex workflows integrating data from various sources, transforming and cleaning data, and generating actionable insights. Participated in the design, development, and deployment of scalable backend services and distributed systems supporting high-traffic e-commerce and financial platforms. Built and maintained RESTful APIs and microservices using Python for millions of daily users. Combined DNNs with SVM, KNN, and tree models using grid search, increasing performance by 15%. Architected and optimized cloud infrastructure on AWS wit
Full Stack Engineer at AMD
January 1, 2019 - April 1, 2019Implemented machine learning in a distributed containerized fashion using TensorFlow, Keras, Azure Docker containers, and scikit-learn within an Agile methodology. Built generative adversarial models for anomaly detection and utilized R for statistical analysis and data visualization. Designed and implemented complex workflows integrating data from various sources, transforming and cleaning data, and generating actionable insights. Built and maintained RESTful APIs and microservices using Python and Node.js, integrating with SQL Server and Azure-based services. Implemented CI/CD pipelines with GitHub Actions and AWS CodePipeline. Integrated AI/ML features using OpenAI APIs and Hugging Face models. Monitored and improved application reliability using Prometheus, Grafana, and CloudWatch.
Backend Engineer at Hewlett Packard Enterprise
June 1, 2018 - August 1, 2018Built and maintained scalable backend services in Java and Python, integrating with VMware’s virtualization stack and APIs (vSphere, ESXi, vCenter). Developed and enhanced RESTful and gRPC APIs to support automation, orchestration, and third-party integrations. Wrote scripts in Python and Bash to automate data collection, parsing, and storage. Supported integration of backend data pipelines with MySQL and NoSQL databases for large-scale data indexing. Optimized scripts to automate ETL processes. Collaborated with senior engineers to improve system reliability and scalability for production workloads.
Backend & AI Developer at Microsoft
January 1, 2022 - March 1, 2026Designed and implemented the Dreaming Worker pipeline to analyze uploaded user content (audio, PDFs, Markdown, DOCX) and generate structured Insights ('Moments' and 'Dreams') using LLM-powered workflows. Built a multi-tenant content ingestion system with SeaweedFS, NATS JetStream, and PostgreSQL/TiDB to manage asynchronous AI processing across tenant-specific workflows. Developed file upload and moment-creation APIs with FastAPI, integrating AI workflow components for automated insight extraction. Implemented CLI orchestration for per-tenant and batch analysis, including model selection, lookback windows, and email digest generation. Created automated HTML email summaries and retrieval pipelines for chunking, embeddings, and multi-format content search. Built LLM-powered and agentic workflow components interfacing with OpenAI, Claude, and Gemini APIs for processing, context retrieval, and structured generation. Integrated embeddings and vector indexes to support semantic search. Implem
Backend & AI Developer at Silicon Labs
July 1, 2020 - January 1, 2022Collaborated with cross-functional teams to support financial analysis, NLP-assisted decision-support, and internal knowledge workflows. Implemented backtesting workflows and NLP-based assistant features for user education and research. Contributed to a legal/financial document Q&A system with NLP, search, retrieval, summarization, and structured responses. Designed backend pipelines for document ingestion, indexing, search, and Q&A across legal/financial sources. Built API integrations for user-facing applications with document processing, retrieval, and analytics. Created dashboards for financial data reporting and scaled backend systems to support 20,000 daily active users via FastAPI and Node.js, improving latency and throughput through refactoring and caching. Deployed data and ML workflows on Azure and Google Cloud (Kubeflow, BigQuery). Mentored junior engineers and contributed to best practices for scalable distributed systems, including a crypto arbitrage bot for real-time deci
Software Developer at Samsung Electronics
June 1, 2019 - August 31, 2019Supported ETL pipelines and ML engineering for research analytics and document indexing. Worked on ML/NLP tasks using BERT, transformers, graph attention networks, and LSTM; streamlined data analytics and workflow automation with Alteryx. Designed workflows for multi-source data ingestion, transformation, and indexing, and built RESTful backend services for internal data access and automation. Contributed to cloud deployments and front-end prototyping for search and analytics interfaces.
Software Developer at AMD
January 1, 2019 - April 30, 2019Supported ML workflows in distributed containerized environments using TensorFlow, Keras, and scikit-learn. Built generative models for anomaly detection and used R for statistical analysis and visualization. Automated data processing with Alteryx, and developed ML/automation pipelines that reduced manual tasks. Delivered backend services and RESTful APIs, and contributed to cloud infrastructure using AWS.
Software Developer at Hewlett Packard Enterprise
June 1, 2018 - August 31, 2018Supported scalable backend services in Java and Python, integrating with VMware virtualization APIs. Developed and enhanced RESTful and gRPC APIs for automation and orchestration. Wrote scripts to automate ETL and data collection, and supported data pipelines with SQL/NoSQL databases. Optimized distributed system reliability, scalability, and maintainability, collaborating across teams in an Agile environment.
Education
Bachelor of Science (B.S.) in Computer Engineering at The University of Texas at Austin
January 11, 2030 - January 1, 2018Bachelor of Science (B.S.), Computer Engineering at The University of Texas at Austin
January 11, 2030 - January 1, 2018Bachelor of Science (B.S.), Computer Engineering at The University of Texas at Austin
January 11, 2030 - January 1, 2020Bachelor of Science (B.S.), Computer Engineering at The University of Texas at Austin
January 1, 2016 - January 1, 2020Qualifications
AWS Certified Machine Learning - Specialty
January 11, 2030 - February 17, 2026TensorFlow Developer Certificate
January 11, 2030 - February 17, 2026AWS Certified Machine Learning - Specialty
January 11, 2030 - March 5, 2026TensorFlow Developer Certificate
January 11, 2030 - March 5, 2026AWS Certified Machine Learning - Specialty
January 11, 2030 - May 27, 2026TensorFlow Developer Certificate
January 11, 2030 - May 27, 2026Industry Experience
Software & Internet, Media & Entertainment, Professional Services, Computers & Electronics
Skills
Experience Level
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
Hire a AI Engineer
We have the best ai engineer experts on Twine. Hire a ai engineer in Austin today.