Versatile GenAI, ML, and Agentic AI Engineer with experience designing intelligent autonomous systems powered by large language models, deep learning, and retrieval-augmented pipelines. Skilled in building end-to-end generative AI solutions, fine-tuning LLMs, and creating agentic workflows that enable multi-step reasoning, tool-use, and dynamic decision-making using frameworks like LangChain, AutoGen, and CrewAI. Adept at integrating RAG, vector search, and cloud-scale ML infrastructure to deliver scalable, production-grade AI applications. Passionate about developing safe, reliable, and high-impact AI systems that automate complex processes, enhance productivity, and drive business transformation.

Vedavathi Thumula

Versatile GenAI, ML, and Agentic AI Engineer with experience designing intelligent autonomous systems powered by large language models, deep learning, and retrieval-augmented pipelines. Skilled in building end-to-end generative AI solutions, fine-tuning LLMs, and creating agentic workflows that enable multi-step reasoning, tool-use, and dynamic decision-making using frameworks like LangChain, AutoGen, and CrewAI. Adept at integrating RAG, vector search, and cloud-scale ML infrastructure to deliver scalable, production-grade AI applications. Passionate about developing safe, reliable, and high-impact AI systems that automate complex processes, enhance productivity, and drive business transformation.

Available to hire

Versatile GenAI, ML, and Agentic AI Engineer with experience designing intelligent autonomous systems powered by large language models, deep learning, and retrieval-augmented pipelines.

Skilled in building end-to-end generative AI solutions, fine-tuning LLMs, and creating agentic workflows that enable multi-step reasoning, tool-use, and dynamic decision-making using frameworks like LangChain, AutoGen, and CrewAI. Adept at integrating RAG, vector search, and cloud-scale ML infrastructure to deliver scalable, production-grade AI applications. Passionate about developing safe, reliable, and high-impact AI systems that automate complex processes, enhance productivity, and drive business transformation.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
See more

Language

English
Fluent

Work Experience

Gen AI Engineer at Walmart
May 1, 2025 - Present
Developed and deployed Generative AI applications using LLMs, Transformers, Diffusion Models, and GANs for text, image, and multimodal tasks. Built autonomous agentic workflows with LangChain and LangGraph enabling multi-step reasoning, tool-calling, memory management, and stateful task orchestration for complex automation. Designed modular agent pipelines that integrate retrieval, planning, and action execution using DAG-based control flows to improve reliability, debuggability, and end-to-end performance. Implemented end-to-end GenAI pipelines with Python, PyTorch, TensorFlow, and Hugging Face for training, fine-tuning, and inference. Built RAG-based systems with FAISS and custom embeddings to improve grounding and factuality. Fine-tuned LLMs (GPT-like, LLaMA, Mistral) using LoRA/QLoRA/PEFT on domain-specific data. Optimized inference APIs with FastAPI, Docker, Kubernetes, and Azure for scalable, low-latency workloads. Engineered domain-specific prompts and evaluation frameworks; est
AI ML Engineer at Fifth Third Bank
February 1, 2024 - April 1, 2025
Designed, developed, and deployed machine learning models for prediction, classification, and optimization using Python, PyTorch, TensorFlow, scikit-learn. Built scalable data pipelines for preprocessing, feature engineering, and ETL using Pandas, NumPy, SQL, Spark, and automated workflows with Airflow/Prefect. Engineered end-to-end ML systems including model training, hyperparameter tuning, validation, and monitoring with MLflow and Weights & Biases. Developed production-grade APIs for ML/AI services using FastAPI and Flask, containerized with Docker, and deployed on AWS/GCP/Azure. Implemented model optimization techniques including quantization, ONNX, distillation, and GPU acceleration with CUDA. Integrated ML models with backend systems, data warehouses, and real-time applications through REST APIs, gRPC, and message queues. Performed large-scale experimentation and AB testing to validate model improvements and ensure measurable business impact. Developed automated model retraining
ML Engineer / Data Scientist at Virtusa, India
July 1, 2022 - December 1, 2023
Designed and deployed machine learning models for prediction, classification, and recommendation tasks using Python, scikit-learn, TensorFlow, and PyTorch. Built robust data processing and feature engineering pipelines using Pandas, NumPy, SQL, and large-scale distributed workflows with Apache Spark. Developed end-to-end ML workflows including training, hyperparameter tuning (GridSearchCV, RandomizedSearchCV), validation, and model performance tracking. Implemented deep learning architectures (CNNs, LSTMs, Transformers) for NLP and computer vision tasks with TensorFlow/Keras and PyTorch. Built and deployed ML microservices using Flask, FastAPI, Docker, and orchestrated deployments across AWS/GCP/Azure. Used TensorBoard, MLflow, and custom logging frameworks for experiment tracking, versioning, and reproducibility. Created automated ETL and batch processing pipelines using Airflow and Prefect. Performed model optimization via pruning, quantization, and ONNX export for efficient inferenc
AI/ML Engineer at Fifth Third Bank
February 1, 2024 - April 1, 2025
Designed, developed, and deployed machine learning models for prediction, classification, and optimization using Python, PyTorch, TensorFlow, and scikit-learn. Built scalable data pipelines for preprocessing, feature engineering, and ETL using Pandas, NumPy, SQL, Spark, and automated workflows with Airflow/Prefect. Engineered end-to-end ML systems including model training, hyperparameter tuning, validation, and monitoring with MLflow and Weights & Biases. Developed and optimized deep learning architectures (CNNs, RNNs, Transformers) for NLP, computer vision, and tabular workflows, improving model accuracy and robustness. Built production-grade APIs for ML/AI services using FastAPI and Flask, containerized with Docker, and deployed on AWS/GCP/Azure. Implemented model optimization techniques including quantization, ONNX, distillation, and GPU acceleration with CUDA for faster inference. Integrated ML models with backend systems, data warehouses, and real-time applications through REST AP

Education

Master's in Computer Science at University of Central Missouri
January 1, 2024 - May 1, 2025
Bachelor of Science in Computer Science at University of Central Missouri
January 1, 2024 - May 1, 2025

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Financial Services, Professional Services, Media & Entertainment, Retail