Hi, I'm Thresha Voddiboina, a Senior ML Engineer and MLOps Specialist with over 8 years of experience designing and building production-scale AI systems. I specialize in Generative AI, Agentic AI, and enterprise machine learning infrastructure, focusing on creating safe, scalable, and efficient AI platforms that serve billions of predictions daily. I enjoy architecting end-to-end ML pipelines, fine-tuning large language models, and developing autonomous AI agents that drive impactful business outcomes. I am passionate about combining cutting-edge AI research with practical deployment strategies, working extensively across multi-cloud environments like AWS, Azure, and GCP. Creating AI systems that respect safety and ethics principles is central to my work, and I thrive on collaborating with diverse teams to bring innovative AI solutions into production environments that improve healthcare, retail, and industrial processes.

Thresha Voddiboina

Hi, I'm Thresha Voddiboina, a Senior ML Engineer and MLOps Specialist with over 8 years of experience designing and building production-scale AI systems. I specialize in Generative AI, Agentic AI, and enterprise machine learning infrastructure, focusing on creating safe, scalable, and efficient AI platforms that serve billions of predictions daily. I enjoy architecting end-to-end ML pipelines, fine-tuning large language models, and developing autonomous AI agents that drive impactful business outcomes. I am passionate about combining cutting-edge AI research with practical deployment strategies, working extensively across multi-cloud environments like AWS, Azure, and GCP. Creating AI systems that respect safety and ethics principles is central to my work, and I thrive on collaborating with diverse teams to bring innovative AI solutions into production environments that improve healthcare, retail, and industrial processes.

Available to hire

Hi, I’m Thresha Voddiboina, a Senior ML Engineer and MLOps Specialist with over 8 years of experience designing and building production-scale AI systems. I specialize in Generative AI, Agentic AI, and enterprise machine learning infrastructure, focusing on creating safe, scalable, and efficient AI platforms that serve billions of predictions daily. I enjoy architecting end-to-end ML pipelines, fine-tuning large language models, and developing autonomous AI agents that drive impactful business outcomes.

I am passionate about combining cutting-edge AI research with practical deployment strategies, working extensively across multi-cloud environments like AWS, Azure, and GCP. Creating AI systems that respect safety and ethics principles is central to my work, and I thrive on collaborating with diverse teams to bring innovative AI solutions into production environments that improve healthcare, retail, and industrial processes.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
See more

Language

English
Fluent

Work Experience

Senior ML Engineer & MLOps Specialist at CVS Health
December 1, 2024 - Present
Architected an enterprise-scale conversational AI platform serving over 50 million annual patient interactions using GPT-4, Claude-3, and fine-tuned medical LLMs with sub-100ms response time and 99.7% accuracy. Designed comprehensive MLOps infrastructure supporting 40+ production models with Azure Databricks, MLflow, and Kubernetes enabling automated CI/CD with zero downtime. Implemented advanced RAG systems combining medical knowledge graphs and vector databases for realtime clinical decision support, improving diagnostic accuracy by 35%. Led development of multi-agent AI systems for pharmacy operations. Established end-to-end LLMOps pipelines for healthcare-specific models using LoRA/QLoRA and distributed training. Applied Constitutional AI and RLHF for model safety achieving 40% improvement in harmful content detection. Developed distributed inference infrastructure handling 100K+ concurrent requests with auto-scaling and load balancing. Automated model retraining pipelines triggere
Senior ML Engineer & MLOps Lead at 7-Eleven
November 30, 2024 - July 24, 2025
Architected end-to-end customer segmentation and personalization AI platform processing 200M+ daily transactions from 77K+ stores using fine-tuned Llama 2, GPT-3.5, and advanced ML models. Built production MLOps pipeline with Azure Databricks, MLflow, and GitHub Actions supporting 15+ models, reducing manual operations by 70%. Developed multi-agent AI systems for demand forecasting, pricing optimization, inventory management achieving 30% efficiency gain. Implemented real-time inference APIs on AKS with Istio service mesh serving 500K+ predictions per minute at 99.99% availability and sub-50ms latency. Established continuous fine-tuning LLMOps framework, automated prompt optimization improving model performance by 25%. Created A/B testing for AI content and built conversational AI interfaces handling 2M+ monthly interactions with 92% satisfaction. Developed scalable feature engineering frameworks and automated model retraining triggered by drift detection. Built self-healing infrastruc
Senior ML Engineer & Research Lead at Caterpillar Inc.
January 31, 2022 - July 24, 2025
Led architecture of large-scale predictive maintenance AI platform processing over 100TB daily IoT sensor data with TFX, transformer-based time series, and physics-informed neural networks. Deployed distributed ML infrastructure on AWS EKS and GKE managing 50+ models with automated retraining, hyperparameter tuning, and A/B testing, reducing equipment failures by 45%. Implemented federated learning across global sites ensuring data privacy and compliance. Created autonomous diagnostic agents reducing expert intervention by 70%. Developed MLOps pipeline using Kubeflow, MLflow, and Kubernetes operators. Built streaming inference infrastructure processing 10M+ events per second with sub-200ms latency. Established monitoring achieving 99.95% uptime SLA. Designed hybrid cloud Kubernetes architecture balancing cost and performance. Pioneered multi-modal AI combining textual, sensor, and visual data. Enhanced domain-specific language models with 94% accuracy in technical documentation. Execut
ML Engineer & Platform Developer at Accenture
June 30, 2019 - July 24, 2025
Architected secure multi-tenant ML platform serving 200+ models across multiple clients with Terraform, Jenkins, and Google Cloud Build incorporating DevSecOps. Delivered client AI solutions including sentiment analysis, document classification, predictive analytics, and early transformer NLP models improving performance by 30%. Built AutoML pipelines with neural architecture search reducing development time by 60%. Established model governance frameworks for ethics and compliance. Developed computer vision systems for quality control using CNNs, Vision Transformers, and real-time detection. Delivered multimodal AI applications combining text, image, and structured data. Created automated data labeling tools enhancing data prep efficiency. Built scalable model serving infrastructures utilizing Docker and Kubernetes. Developed Python and Go automation tools reducing environment provisioning from 2 weeks to 4 hours. Implemented infrastructure as code and automated canary deployments redu
Junior ML Engineer & Data Scientist at KPMG
February 28, 2018 - July 24, 2025
Developed and deployed 15+ ML models for predictive analytics, classification, and regression using scikit-learn and early TensorFlow. Created data preprocessing and feature engineering pipelines for structured and unstructured data. Built automated model training and evaluation frameworks supporting cross-validation and performance optimization across domains. Delivered statistical analysis and reporting tools for stakeholder insights. Initiated NLP projects using TF-IDF, word2vec, and LSTM for text classification, sentiment analysis, and document processing. Developed document clustering and topic modeling with LDA and NMF for business intelligence. Implemented information extraction using NER and rule-based methods. Established text preprocessing pipelines for multiple languages. Set foundational monitoring infrastructure with Prometheus and Grafana. Created Git-based MLflow workflows for model versioning and experiment tracking. Developed CI/CD pipelines improving deployment consis

Education

Master of Science at Stevens Institute of Technology
January 1, 2021 - May 31, 2023

Qualifications

DeepLearning.AI Generative AI Specialization
January 1, 2024 - December 31, 2024
OpenAI GPT-4 Developer Certification
January 1, 2024 - December 31, 2024
LangChain Certified AI Engineer
January 1, 2024 - December 31, 2024
Anthropic Claude AI Safety Certification
January 1, 2024 - December 31, 2024
Hugging Face Transformers Expert
January 1, 2023 - December 31, 2023
AWS Certified Machine Learning - Specialty
January 1, 2023 - December 31, 2023
Microsoft Certified: Azure AI Engineer Associate
January 1, 2023 - December 31, 2023
Google Cloud Professional Machine Learning Engineer
January 1, 2023 - December 31, 2023
Databricks Certified Professional Data Engineer
January 1, 2023 - December 31, 2023
Certified Information Systems Security Professional (CISSP)
January 1, 2023 - December 31, 2023
AWS Certified Solutions Architect - Professional
January 1, 2022 - December 31, 2022
Microsoft Certified: Azure Solutions Architect Expert
January 1, 2022 - December 31, 2022
HashiCorp Certified: Terraform Associate
January 1, 2022 - December 31, 2022
Kubernetes Certified Application Developer (CKAD)
January 1, 2022 - December 31, 2022
Certified Kubernetes Administrator (CKA)
January 1, 2021 - December 31, 2021
Jenkins Certified Engineer
January 1, 2021 - December 31, 2021
AWS Certified Security - Specialty
January 1, 2022 - December 31, 2022

Industry Experience

Healthcare, Retail, Manufacturing, Professional Services, Software & Internet