Hello! I’m an AI and ML engineer with 6 years of experience designing, developing, and deploying LLM-powered applications, NLP pipelines, and data-driven solutions across healthcare, finance, and enterprise automation. I specialize in Retrieval-Augmented Generation (RAG) systems, semantic search, and context-aware outputs using GPT-3/4, BERT, LLaMA2, LangChain, and Hugging Face Transformers. I enjoy turning business goals into production-ready, scalable AI solutions that are secure, explainable, and aligned with responsible AI practices. My work spans cloud-native AI engineering (AWS SageMaker, Vertex AI, Azure), MLOps/LLMOps, and data engineering with Python, SQL, Airflow, and Informatica. I design low-latency, high-throughput ML infrastructure using Docker, Kubernetes, and Terraform, enabling automated deployments and reliable production performance. I value cross-functional collaboration and data-driven decision-making to deliver measurable impact and continuous optimization.

Venkata Sai Naga Phanindra Kumar Annabat

Hello! I’m an AI and ML engineer with 6 years of experience designing, developing, and deploying LLM-powered applications, NLP pipelines, and data-driven solutions across healthcare, finance, and enterprise automation. I specialize in Retrieval-Augmented Generation (RAG) systems, semantic search, and context-aware outputs using GPT-3/4, BERT, LLaMA2, LangChain, and Hugging Face Transformers. I enjoy turning business goals into production-ready, scalable AI solutions that are secure, explainable, and aligned with responsible AI practices. My work spans cloud-native AI engineering (AWS SageMaker, Vertex AI, Azure), MLOps/LLMOps, and data engineering with Python, SQL, Airflow, and Informatica. I design low-latency, high-throughput ML infrastructure using Docker, Kubernetes, and Terraform, enabling automated deployments and reliable production performance. I value cross-functional collaboration and data-driven decision-making to deliver measurable impact and continuous optimization.

Available to hire

Hello! I’m an AI and ML engineer with 6 years of experience designing, developing, and deploying LLM-powered applications, NLP pipelines, and data-driven solutions across healthcare, finance, and enterprise automation. I specialize in Retrieval-Augmented Generation (RAG) systems, semantic search, and context-aware outputs using GPT-3/4, BERT, LLaMA2, LangChain, and Hugging Face Transformers. I enjoy turning business goals into production-ready, scalable AI solutions that are secure, explainable, and aligned with responsible AI practices.

My work spans cloud-native AI engineering (AWS SageMaker, Vertex AI, Azure), MLOps/LLMOps, and data engineering with Python, SQL, Airflow, and Informatica. I design low-latency, high-throughput ML infrastructure using Docker, Kubernetes, and Terraform, enabling automated deployments and reliable production performance. I value cross-functional collaboration and data-driven decision-making to deliver measurable impact and continuous optimization.

See more

Work Experience

Gen AI Developer at ABM Industries
December 1, 2023 - Present
Designed and deployed advanced Generative AI solutions using GPT-3, GPT-4, and BERT for content creation, summarization, and conversational AI across various applications. Developed AI-driven applications with LangChain, orchestrating workflows involving large language models for question answering, document processing, and automation. Engineered Retrieval-Augmented Generation systems integrating LLMs with vector databases such as FAISS and Pinecone for accurate information retrieval and output generation. Optimized prompts for domain-specific OpenAI API tasks and deployed LLMs on AWS SageMaker and Google Vertex AI ensuring scalability and performance. Built real-time intelligent chatbots and maintained CI/CD pipelines using Jenkins, Docker, and Kubernetes. Developed computer vision models for object detection. Ensured responsible AI practices focusing on explainability, fairness, and compliance.
Programmer (AI Engineer) Analyst at Cognizant Technology Solutions
July 31, 2023 - August 26, 2025
Designed and deployed RAG systems using LangChain, FAISS, and ChromaDB. Integrated LLMs like GPT-4 and LLaMA2 for use cases including fraud detection, code extraction, and document summarization. Managed AI infrastructure on AWS leveraging SageMaker, EC2, Lambda, S3, Glue Catalog, and Athena. Developed NLP pipelines using BERT, T5, SBERT, spaCy, and NLTK, enabling clinical coding recommendations and chatbot responses. Maintained CI/CD pipelines with AWS CodePipeline, CodeBuild, Docker, Kubernetes, and Terraform for automated ML workflows. Engineered ETL pipelines with Informatica and Apache Airflow from MS SQL Server and healthcare data. Developed deep learning and computer vision models using TensorFlow and PyTorch, customized SageMaker pipelines for model tuning.
Machine Learning Data Associate at Amazon
December 31, 2022 - August 26, 2025
Developed and deployed advanced AI solutions using LLMs and RAG architectures with vector databases like FAISS and Pinecone. Built conversational AI applications with LangChain for personalized real-time interactions. Applied RLHF and traditional machine learning techniques for behavioral modeling and segmentation. Developed automated data pipelines using Airflow, Python, NumPy, and Pandas; implemented OpenCV computer vision pipelines. Maintained ML models using TensorFlow, PyTorch, spaCy, and Hugging Face Transformers in production with Docker, Kubernetes, and AWS SageMaker. Created scalable CI/CD pipelines using Jenkins, Git, and cloud-native tools supporting continuous deployment of AI solutions.
Data Scientist / ML Engineer at Optum
June 30, 2022 - August 26, 2025
Developed and fine-tuned deep learning models for classification and prediction. Designed end-to-end ML solutions using AWS SageMaker, EC2, Lambda, and S3 for training and deployment. Applied NLP techniques such as TF-IDF, Word2Vec, and text preprocessing to extract insights from unstructured data. Built CI/CD pipelines tailored for ML workflows with AWS CodePipeline and CodeBuild ensuring automated retraining and deployment. Engineered large-scale ETL workflows using Informatica and MS SQL Server. Created dashboards and visualizations using Tableau and Power BI to communicate findings. Collaborated with cross-functional teams ensuring delivery of impactful machine learning initiatives.
Gen AI Developer at ABM Industries
December 1, 2023 - Present
Designed and deployed advanced Generative AI solutions using GPT-3, GPT-4, and BERT for content creation, summarization, and conversational AI. Built AI-driven applications using LangChain to orchestrate multi-step workflows with large language models for question answering, document processing, and automation. Developed Retrieval-Augmented Generation systems combining LLMs with vector databases FAISS and Pinecone to ensure factually accurate outputs. Engineered prompt optimization for domain-specific tasks. Deployed scalable LLMs on AWS SageMaker and Google Vertex AI enabling high availability and performance. Integrated generative AI with external APIs and enterprise systems across healthcare, e-commerce, and finance. Built real-time intelligent chatbots and maintained CI/CD pipelines using Jenkins, Docker, and Kubernetes. Developed computer vision algorithms for object detection with OpenCV, Keras, and TensorFlow. Promoted responsible AI practices focusing on explainability, fairnes
Programmer (AI Engineer) Analyst at Cognizant Technology Solutions
July 31, 2023 - August 26, 2025
Designed and deployed Retrieval-Augmented Generation (RAG) systems using LangChain, FAISS, and ChromaDB with GPT-4 and LLaMA2 to support applications like fraud detection and document summarization. Architected AI infrastructure on AWS utilizing SageMaker, EC2, Lambda, and S3 with Glue Catalog and Athena for real-time metadata querying. Developed NLP pipelines using BERT, T5, SBERT, spaCy, and NLTK for clinical code recommendations and chatbot functionalities. Built robust CI/CD pipelines with AWS CodePipeline, CodeBuild, Docker, Kubernetes, and Terraform automating retraining and deployment. Engineered ETL pipelines using Informatica and Apache Airflow to prepare structured datasets. Fine-tuned deep learning and computer vision models with TensorFlow and PyTorch for dental image analysis and document verification. Customized SageMaker pipelines for hyperparameter tuning and model improvement.
Machine Learning Data Associate at Amazon
December 1, 2022 - August 26, 2025
Designed and deployed AI solutions leveraging LLMs (GPT, BERT), Retrieval-Augmented Generation architectures, and vector databases like FAISS and Pinecone for scalable, context-aware language tasks. Developed conversational AI applications with LangChain supporting multi-step reasoning workflows for personalized interactions. Applied reinforcement and supervised learning techniques including K-Means, SVM, and Random Forest using Scikit-learn and deep learning frameworks. Automated large-scale data pipelines using Airflow, Python, NumPy, and Pandas. Implemented computer vision pipelines with OpenCV for document processing. Deployed and maintained deep learning and NLP models with TensorFlow, PyTorch, spaCy, and Hugging Face Transformers integrating into production via Docker, Kubernetes, and SageMaker. Built scalable CI/CD pipelines for ML workflows with Jenkins, Git, and AWS services.
Data Scientist / ML Engineer at Optum
June 30, 2022 - August 26, 2025
Developed and fine-tuned deep learning and neural network models for classification and prediction using backpropagation, hyperparameter tuning, and regularization. Designed full machine learning solutions leveraging AWS services such as SageMaker, EC2, Lambda, and S3 for training and deployment. Applied NLP techniques including TF-IDF, Word2Vec, Bag-of-Words, stemming, and lemmatization to derive insights from unstructured text. Built and managed ML-specific CI/CD pipelines with AWS CodePipeline and CodeBuild to automate testing, retraining, and deployment. Engineered ETL workflows using Informatica integrating MS SQL Server data. Created interactive dashboards and visualizations with Tableau, Power BI, and Python to communicate insights. Collaborated closely with cross-functional teams to ensure alignment of AI initiatives with business goals, model explainability, and measurable impact.
Gen AI Developer at ABM Industries
December 1, 2023 - Present
Designed and deployed advanced Generative AI solutions using GPT-3, GPT-4, and BERT for content creation, summarization, and conversational AI applications. Built and integrated AI-driven applications with LangChain, orchestrated multi-step workflows for question answering, document processing, and automation. Developed Retrieval-Augmented Generation (RAG) systems with vector databases like FAISS and Pinecone to produce factually accurate outputs. Engineered prompt optimizations for domain-specific tasks and deployed LLMs on AWS SageMaker and Google Vertex AI, ensuring scalability and performance. Integrated generative AI with external APIs and enterprise systems across healthcare, e-commerce, and finance sectors. Built real-time chatbots offering context-aware conversational experiences. Maintained CI/CD pipelines using Jenkins, Docker, and Kubernetes for model deployment and monitoring. Developed computer vision algorithms for object detection extending AI capabilities beyond text. E
Programmer (AI Engineer) Analyst at Cognizant Technology Solutions
July 31, 2023 - August 26, 2025
Designed and deployed Retrieval-Augmented Generation (RAG) systems using LangChain, FAISS, and ChromaDB integrating LLMs such as GPT-4 and LLaMA2 for use cases like claim fraud detection, code extraction, and document summarization with context-aware accurate outputs. Architected AI infrastructure on AWS using SageMaker, EC2, Lambda, and S3; integrated Glue Catalog and Athena for metadata querying in ML data lakes. Developed NLP pipelines with BERT, T5, SBERT, spaCy, NLTK to convert unstructured text into actionable insights supporting clinical and chatbot applications. Built tailored CI/CD pipelines for AI workflows with AWS CodePipeline, Docker, Kubernetes, and Terraform to enable automated retraining and deployment. Engineered ETL pipelines with Informatica and Apache Airflow handling data from MS SQL Server and healthcare repositories for modeling and analytics. Developed and fine-tuned deep learning and computer vision models for dental image analysis, document verification, and N
Machine Learning Data Associate at Amazon
December 1, 2022 - August 26, 2025
Designed and deployed AI solutions using LLMs (GPT, BERT) and RAG architectures with vector databases (FAISS, Pinecone) for scalable context-aware language tasks. Developed conversational AI applications with LangChain for real-time personalized interaction and automation. Applied reinforcement learning from human feedback and supervised/unsupervised algorithms using Scikit-learn and deep learning frameworks for behavioral modeling and segmentation. Built automated data pipelines with Airflow, Python, NumPy, Pandas; implemented computer vision document processing with OpenCV. Deployed and maintained models via TensorFlow, PyTorch, spaCy, Hugging Face Transformers within production environments using Docker, Kubernetes, and AWS SageMaker. Created scalable CI/CD pipelines using Jenkins, Git, and cloud tools incorporating versioning, testing, and monitoring to ensure robust AI deployments.
Data Scientist / ML Engineer at Optum
June 30, 2022 - August 26, 2025
Developed and fine-tuned deep learning and neural network models for classification and prediction tasks using tuning and regularization to improve performance. Designed and implemented machine learning solutions leveraging AWS services such as SageMaker, EC2, Lambda, and S3. Applied NLP techniques including TF-IDF, Word2Vec, Bag-of-Words, stemming, and lemmatization to extract insights from unstructured text data. Built and managed CI/CD pipelines tailored for ML workflows with AWS CodePipeline and CodeBuild ensuring model automation. Engineered ETL workflows with Informatica integrating structured and semi-structured data from MS SQL Server for analytics. Developed interactive dashboards and visualizations with Tableau, Power BI, and custom Python tools to communicate KPIs and data insights. Collaborated cross-functionally to align ML initiatives with business objectives, focusing on model explainability and impact.
Gen AI Developer at ABM Industries
December 1, 2023 - Present
Designed and deployed enterprise-grade Generative AI solutions using GPT-3, GPT-4, and BERT to enhance content creation, summarization, and conversational AI across healthcare, e-commerce, and finance domains. Built and integrated Retrieval-Augmented Generation (RAG) systems with FAISS and Pinecone to deliver context-aware, factually accurate responses, reducing manual query resolution time by 27%. Engineered and optimized prompt strategies for the OpenAI API, enabling domain-specific automation and personalized content generation. Deployed large-scale LLMs on AWS SageMaker and Google Vertex AI, ensuring high availability, scalability, and reliable production performance. Implemented serverless and container-based deployments (AWS Lambda, Docker, Kubernetes) and CI/CD pipelines with Jenkins. Extended AI capabilities into computer vision with OpenCV, TensorFlow, and Keras. Applied SHAP-based explainability and fairness checks to deployed systems.
Programmer (AI Engineer) Analyst at Cognizant Technology Solutions
July 1, 2023 - September 25, 2025
Designed and deployed Retrieval-Augmented Generation (RAG) systems using LangChain, FAISS, and ChromaDB, integrating LLMs (GPT-4, LLaMA2) to support claim fraud detection, medical code extraction, and intelligent document summarization. Architected and managed AI infrastructure on AWS (SageMaker, EC2, Lambda, S3, Glue Catalog, Athena) to enable scalable model training, inference, and metadata querying in real-time ML data lakes. Enhanced claim fraud detection efficiency by 22% through optimized RAG workflows and metadata retrieval pipelines. Built NLP pipelines using BERT, T5, SBERT, SpaCy, and NLTK for clinical code recommendations, evidence retrieval, RTE, and chatbot responses. Maintained CI/CD pipelines (CodePipeline, CodeBuild, Docker, Kubernetes, Terraform) and engineered ETL workflows with Informatica and Apache Airflow. Fine-tuned DL/CV models (TensorFlow, PyTorch) for healthcare tasks and collaborated in Agile teams to deliver end-to-end AI solutions.
Machine Learning Data Associate at Amazon
December 1, 2022 - September 25, 2025
Designed and deployed advanced AI solutions leveraging LLMs (GPT, BERT) and RAG architectures with FAISS and Pinecone for scalable, context-aware language understanding and generation tasks. Built and integrated conversational AI applications with LangChain, orchestrating multi-step reasoning workflows for real-time, personalized interaction. Reduced manual data labeling time by 18% through automated annotation pipelines and intelligent data preprocessing scripts. Applied RLHF and ML techniques for behavioral modeling and segmentation, improving accuracy. Developed automated data pipelines with Apache Airflow and Python, and built CV pipelines with OpenCV for document processing. Deployed NLP and CV models in production using Docker, Kubernetes, and AWS SageMaker; established CI/CD pipelines with Jenkins and cloud-native tooling.
Data Scientist / ML Engineer at Optum
June 1, 2022 - September 25, 2025
Developed and fine-tuned deep learning and neural network models for classification and predictive analytics, achieving improved forecast accuracy. Implemented end-to-end ML solutions on AWS (SageMaker, EC2, Lambda, S3) for scalable training and deployment. Increased patient risk prediction accuracy by 15% via feature engineering and hyperparameter optimization. Applied NLP techniques to extract insights from unstructured text for analytics and decision-making. Built CI/CD pipelines for ML workflows and engineered large-scale ETL processes with Informatica. Created dashboards with Tableau and Power BI to communicate insights to stakeholders and collaborated across teams to ensure model explainability and business impact.

Education

Master of Science (M.S.): Management Information Systems at University Of Illinois, Springfield, Illinois
August 1, 2023 - May 31, 2025
Bachelor of Technology: Aeronautical Engineering at Jawaharlal Nehru Technological University, Hyderabad, India
August 1, 2015 - May 31, 2019
Master of Science (M.S.): Management Information Systems at University Of Illinois, Springfield, Illinois
August 1, 2023 - May 1, 2025
Bachelor of Technology: Aeronautical Engineering at Jawaharlal Nehru Technological University, Hyderabad, India
August 1, 2015 - May 1, 2019
Master of Science (M.S.): Management Information Systems at University Of Illinois, Springfield, Illinois
August 1, 2023 - May 1, 2025
Bachelor of Technology: Aeronautical Engineering at Jawaharlal Nehru Technological University, Hyderabad, India
August 1, 2015 - May 1, 2019
Master of Science in Management Information Systems at University Of Illinois, Springfield
August 1, 2023 - May 1, 2025
Bachelor of Technology in Aeronautical Engineering at Jawaharlal Nehru Technological University, Hyderabad
August 1, 2015 - May 1, 2019

Qualifications

PwC Switzerland - Power BI Job Simulation, Forage
March 1, 2025 - August 26, 2025
ARIS Business Process Analysis Platform
October 1, 2024 - August 26, 2025
SQL Essential Training, National Association of State Boards of Accountancy (NASBA)
April 1, 2024 - August 26, 2025
Udemy - Certified Machine Learning Engineer
January 11, 2030 - August 26, 2025
Power BI Job Simulation, PwC Switzerland - Forage
March 1, 2025 - August 26, 2025
ARIS Business Process Analysis Platform
October 1, 2024 - August 26, 2025
SQL Essential Training, National Association of State Boards of Accountancy (NASBA)
April 1, 2024 - August 26, 2025
Certified Machine Learning Engineer, Udemy
January 11, 2030 - August 26, 2025
PwC Switzerland - Power BI Job Simulation, Forage
March 1, 2025 - August 26, 2025
ARIS Business Process Analysis Platform
October 1, 2024 - August 26, 2025
SQL Essential Training, National Association of State Boards of Accountancy (NASBA)
April 1, 2024 - August 26, 2025
Udemy - Certified Machine Learning Engineer
January 11, 2030 - August 26, 2025
AWS Educate Machine Learning Foundations
July 1, 2025 - September 25, 2025
Introducing Generative AI with AWS
July 1, 2025 - September 25, 2025
ARIS Business Process Analysis Platform
October 1, 2024 - September 25, 2025
SQL Essential Training
April 1, 2024 - September 25, 2025
Certified Machine Learning Engineer
January 11, 2030 - September 25, 2025

Industry Experience

Healthcare, Financial Services, Retail, Software & Internet, Other, Professional Services, Education