I am Siva Medikonda, an innovative AI Engineer with nearly 5 years of experience delivering cutting-edge AI/ML solutions across aviation, insurance, and enterprise automation domains. I design and deploy LLM-powered applications, RAG pipelines, and NLP-driven microservices using GPT-4, BERT, and MiniLM. I excel at building end-to-end AI systems—from data ingestion and preprocessing to model fine-tuning, semantic search, and real-time inference—leveraging HuggingFace, LangChain, and PyTorch. I collaborate cross-functionally to translate business goals into scalable, production-grade AI solutions, champion responsible AI practices, and drive enterprise adoption through measurable KPIs. I have a strong cloud-native background on AWS (Lambda, SageMaker, EKS) and have led the orchestration of GenAI workflows, automated ML lifecycles, and the integration of context-aware AI agents for data-rich environments.

SIVA MEDIKONDA

I am Siva Medikonda, an innovative AI Engineer with nearly 5 years of experience delivering cutting-edge AI/ML solutions across aviation, insurance, and enterprise automation domains. I design and deploy LLM-powered applications, RAG pipelines, and NLP-driven microservices using GPT-4, BERT, and MiniLM. I excel at building end-to-end AI systems—from data ingestion and preprocessing to model fine-tuning, semantic search, and real-time inference—leveraging HuggingFace, LangChain, and PyTorch. I collaborate cross-functionally to translate business goals into scalable, production-grade AI solutions, champion responsible AI practices, and drive enterprise adoption through measurable KPIs. I have a strong cloud-native background on AWS (Lambda, SageMaker, EKS) and have led the orchestration of GenAI workflows, automated ML lifecycles, and the integration of context-aware AI agents for data-rich environments.

Available to hire

I am Siva Medikonda, an innovative AI Engineer with nearly 5 years of experience delivering cutting-edge AI/ML solutions across aviation, insurance, and enterprise automation domains. I design and deploy LLM-powered applications, RAG pipelines, and NLP-driven microservices using GPT-4, BERT, and MiniLM. I excel at building end-to-end AI systems—from data ingestion and preprocessing to model fine-tuning, semantic search, and real-time inference—leveraging HuggingFace, LangChain, and PyTorch.

I collaborate cross-functionally to translate business goals into scalable, production-grade AI solutions, champion responsible AI practices, and drive enterprise adoption through measurable KPIs. I have a strong cloud-native background on AWS (Lambda, SageMaker, EKS) and have led the orchestration of GenAI workflows, automated ML lifecycles, and the integration of context-aware AI agents for data-rich environments.

See more

Language

English
Fluent

Work Experience

Gen AI Developer at Athena Health
December 1, 2023 - Present
Built and deployed AI microservices for document understanding, anomaly detection, and semantic search using LLMs like GPT-4, BERT, and MiniLM. Architected end-to-end NLP pipelines integrating LangChain agents, Hugging Face models, and real-time data feeds with RAG workflows. Developed scalable vector search platforms combining FAISS, Pinecone, and metadata filters to improve knowledge retrieval accuracy. Designed serverless ML systems on AWS with automated CI/CD workflows and monitoring dashboards. Fine-tuned foundation models on domain-specific data for summarization, question answering, and entity recognition tasks. Collaborated across data, DevOps, and product teams to deploy AI features in customer-facing platforms with low-latency inference.
AI/ML Scientist at Bajaj Finance
July 1, 2023 - August 26, 2025
Led development of contextual recommendation systems and multi-agent chatbots using LangChain, vector databases, and prompt engineering. Built NLP workflows for summarization, entailment, and classification using models like BERT, T5, and SBERT. Orchestrated ML infrastructure on AWS and GCP using Terraform, Lambda, Vertex AI, and Dockerized training environments. Implemented robust CI/CD pipelines for model deployment, version control, and rollback support. Created ETL pipelines with Airflow and Pandas to process structured and unstructured datasets supporting training pipelines. Designed deep learning models for document parsing and image classification using OpenCV, PyTorch, and Keras.
Data Analyst / Data Scientist at Divi's Laboratories
April 1, 2022 - August 26, 2025
Developed BI dashboards and DAX-optimized data models accelerating decision-making across business units. Built ML models such as SVM, Random Forest, and XGBoost for behavioral segmentation and predictive analytics using Scikit-learn and PySpark MLlib. Developed data pipelines using SSIS and Azure Data Factory. Automated reports via Power Automate and Report Builder. Deployed ML solutions using Amazon SageMaker and automated delivery through AWS CodePipeline and CodeBuild.
Gen AI Developer at Athena Health
December 1, 2023 - Present
Designed and deployed AI solutions leveraging LLMs (GPT-3.5, GPT-4, GPT-4o, BERT, MiniLM) and HuggingFace Transformers to enable real-time summarization, semantic search, and intelligent document understanding for enterprise-grade applications. Built multi-model NLP pipelines to convert unstructured sensor logs and installation documents into structured summaries. Implemented Retrieval-Augmented Generation (RAG) with real-time data feeds for sub-second AI-driven anomaly detection, technical support, and compliance tracking. Integrated context-aware AI agents with LangChain for domain-specific data reasoning, and fine-tuned models for summarization, QA, and document understanding. Developed semantic search systems with vector embeddings and metadata filtering; built scalable AI microservices containerized and orchestrated with Kubernetes and serverless workflows. Deployed AI workloads on AWS (Lambda, SageMaker, EKS) and automated the full ML lifecycle with production-ready GenAI pipelin
Data (AI/ML) Scientist at Bajaj Finance
July 1, 2023 - October 15, 2025
Designed and deployed Retrieval-Augmented Generation (RAG) systems by integrating LLMs with vector databases (FAISS, Pinecone, ChromaDB) via LangChain to deliver context-aware outputs in information-rich environments. Built scalable ML infrastructure on AWS and GCP (SageMaker, Vertex AI) for training and inference, with CI/CD automation. Developed NLP pipelines for summarization, entailment, classification, and QA using transformers (BERT, T5, SBERT) and NLP libraries. Orchestrated multi-agent reasoning and chat applications with external tools, memory, and APIs to support real-time, multi-turn conversational workflows; implemented prompt strategies (zero-shot, few-shot, chain-of-thought). Automated model deployment, versioning, testing, and rollback; engineered ETL with Apache Airflow for diverse data sources; built CV models for image tasks; applied classical ML techniques with explainability practices.
Data Analyst/Scientist at Divi's Laboratories
April 1, 2022 - October 15, 2025
Designed and developed interactive Power BI dashboards with drill-through reports; optimized data models with DAX for performance. Integrated data from SQL Server and Azure Data Lake into unified datasets; developed ETL pipelines using SSIS and Azure Data Factory. Performed data cleaning, profiling, and feature engineering with Python (Pandas, NumPy). Built ML models (SVM, Random Forest, XGBoost) for predictive insights and deployed models in production using SageMaker with CI/CD automation. Automated report scheduling and alerts with Power Automate; created paginated reports to accelerate distribution.
Gen AI Developer at Athena Health
December 1, 2023 - Present
Designed and deployed advanced AI solutions leveraging LLMs (GPT-3.5, GPT-4, GPT-4o, BERT, MiniLM) and HuggingFace Transformers to enable real-time summarization, semantic search, and intelligent document understanding for enterprise-grade applications. Architected multi-model NLP pipelines to transform unstructured sensor logs and installation documents into structured summaries, enabling improved decision-making. Implemented Retrieval-Augmented Generation (RAG) with real-time data feeds for sub-second responses in anomaly detection, technical support, and compliance tracking. Integrated transformer models with LangChain for context-aware AI agents with dynamic reasoning over domain datasets. Fine-tuned foundation models for summarization, QA, and document understanding tasks. Built semantic search systems with vector embeddings and metadata filtering for fast retrieval from large knowledge bases. Constructed data ingestion and preprocessing pipelines for time-series data and document
Data (AI/ML) Scientist at Bajaj Finance
July 1, 2023 - October 15, 2025
Designed and deployed Retrieval-Augmented Generation (RAG) systems by integrating LLMs with vector databases (FAISS, Pinecone, ChromaDB) using LangChain for context-aware outputs. Built scalable ML infrastructure on AWS and GCP (SageMaker, Vertex AI, EC2) with automation via Lambda, Cloud Functions, and Terraform. Developed NLP pipelines for summarization, entailment, classification, and QA using BERT, T5, SBERT with spaCy and HF Transformers. Orchestrated multi-agent reasoning and chat apps with LangChain, enabling real-time, multi-turn conversational workflows. Implemented zero-shot, few-shot, and chain-of-thought prompting to optimize outputs. Created CI/CD pipelines using Jenkins, Docker, Kubernetes, and GitHub Actions. Automated ETL pipelines with Airflow, Python, Pandas, and NumPy. Trained DL/CV models for image classification, detection, and document parsing (TensorFlow, PyTorch, OpenCV, Keras). Applied classical ML (Random Forest, SVM, K-Means, XGBoost) with model validation an
Data Analyst/Scientist at Divi's Laboratories
April 1, 2022 - October 15, 2025
Designed and developed interactive Power BI dashboards with drill-through reports and custom visuals. Optimized data models using DAX to improve report performance. Integrated data from SQL Server and Azure Data Lake to build unified datasets. Developed ETL pipelines with SSIS and Azure Data Factory. Performed data cleaning, profiling, and feature engineering using Python (Pandas, NumPy). Built ML models (SVM, Random Forest, XGBoost) to predict customer behavior and support decision-making. Deployed models in production with SageMaker and automated CI/CD via CodePipeline/CodeBuild. Automated report scheduling and alerts via Power Automate and Power BI Service.

Education

Master of Science (M.S.) in Computer Science at New England College, Henniker, NH
January 11, 2030 - May 1, 2025
Bachelor of Technology in Electronics and Communication Engineering at Bharath Institute of Higher Education and Research, Chennai, India
January 11, 2030 - August 26, 2025
Master of Science (M.S.) at New England College, Henniker, New Hampshire
August 1, 2023 - May 1, 2025
Bachelor of Technology at Bharath Institute of Higher Education and Research, Chennai, Tamil Nadu
January 11, 2030 - October 15, 2025
Master of Science (M.S.): Computer Science at New England College
January 11, 2030 - May 1, 2025
Bachelor of Technology: Electronics and Communications Engineering at Bharath Institute of Higher Education and Research
January 11, 2030 - October 15, 2025

Qualifications

Infosys Certified Python Programmer
January 11, 2030 - August 26, 2025
IBM Certified: Data Analysis with Python, Databases and SQL
January 11, 2030 - August 26, 2025
Python for Finance: Portfolio Statistical Analysis
January 11, 2030 - August 26, 2025
Data Visualization with Python
January 11, 2030 - August 26, 2025
IoT Wireless and Cloud Computing (Emerging Technologies)
January 11, 2030 - August 26, 2025
HackerRank SQL (Basic)
January 11, 2030 - August 26, 2025
Infosys Certified Python Programmer
January 11, 2030 - October 15, 2025
IBM Certified Data Analysis with Python
January 11, 2030 - October 15, 2025
IBM Databases and SQL for Data Science with Python
January 11, 2030 - October 15, 2025
Hacker Rank SQL (Basic)
January 11, 2030 - October 15, 2025
Python for Finance: Portfolio Statistical Data Analysis
January 11, 2030 - October 15, 2025
Data Visualization with Python
January 11, 2030 - October 15, 2025
IoT Wireless and Cloud Computing Emerging Technologies
January 11, 2030 - October 15, 2025
Infosys Certified Python Programmer
January 11, 2030 - October 15, 2025
IBM Certified Data Analysis with Python
January 11, 2030 - October 15, 2025
IBM Databases and SQL for Data Science with Python
January 11, 2030 - October 15, 2025
Hacker Rank SQL (Basic)
January 11, 2030 - October 15, 2025
Python for Finance: Portfolio Statistical Data Analysis
January 11, 2030 - October 15, 2025
Data Visualization with Python
January 11, 2030 - October 15, 2025
IoT Wireless and Cloud Computing Emerging Technologies
January 11, 2030 - October 15, 2025

Industry Experience

Healthcare, Financial Services, Software & Internet, Professional Services, Other