Hi, I’m Hani Kancharla. I am a Platform/MLOps Engineer with 5+ years of experience in productionizing AI/ML systems, including 2+ years focused on Generative AI and LLMs. I enjoy building reliable, scalable AI platforms that empower teams to move quickly. I’ve led the integration and orchestration of LLM inference APIs in production, managed Kubernetes clusters, Helm charts, and ArgoCD pipelines for high availability. I’m proficient in Python and cloud platforms (Azure, AWS, GCP), and I focus on observability (Prometheus, Grafana) to continuously improve CI/CD and platform operations.

Hani Kancharla

Hi, I’m Hani Kancharla. I am a Platform/MLOps Engineer with 5+ years of experience in productionizing AI/ML systems, including 2+ years focused on Generative AI and LLMs. I enjoy building reliable, scalable AI platforms that empower teams to move quickly. I’ve led the integration and orchestration of LLM inference APIs in production, managed Kubernetes clusters, Helm charts, and ArgoCD pipelines for high availability. I’m proficient in Python and cloud platforms (Azure, AWS, GCP), and I focus on observability (Prometheus, Grafana) to continuously improve CI/CD and platform operations.

Available to hire

Hi, I’m Hani Kancharla. I am a Platform/MLOps Engineer with 5+ years of experience in productionizing AI/ML systems, including 2+ years focused on Generative AI and LLMs. I enjoy building reliable, scalable AI platforms that empower teams to move quickly.

I’ve led the integration and orchestration of LLM inference APIs in production, managed Kubernetes clusters, Helm charts, and ArgoCD pipelines for high availability. I’m proficient in Python and cloud platforms (Azure, AWS, GCP), and I focus on observability (Prometheus, Grafana) to continuously improve CI/CD and platform operations.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
See more

Language

English
Fluent

Work Experience

Gen AI Engineer at UPS
May 1, 2023 - November 6, 2025
Designed and developed deep learning models for automating complex logistics and security threat analysis. Engineered and optimized neural networks for real-time inference, focusing on low-latency requirements. Collaborated with cross-functional teams to integrate AI solutions into existing platforms, ensuring scalability and performance.
AI/ML Engineer at Cigna
August 1, 2021 - August 1, 2021
Anomaly Detection & Model Development: Architected and trained LSTM-based sequence models to identify anomalous patterns in user behavior and process execution trees, reducing false alerts by 30%. Event Correlation & CEP: Implemented event correlation logic using Apache Beam pipelines to process high-volume data streams and detect complex multi-stage security incidents. Model Explainability & MLOps: Integrated SHAP for model interpretability, providing clear insights to stakeholders. Established MLflow for end-to-end model lifecycle management, from experimentation to production. Performance & Validation: Validated model performance against industry benchmarks, achieving a sustained recall of 88% and maintaining a false positive rate below 1% for critical detection use cases.
Associate Engineer at Reliance Jio
May 1, 2018 - May 1, 2018
Developed and maintained data processing pipelines for a large-scale telecommunications network. Gained foundational experience in Python programming and data analysis, supporting senior data scientists in model training and data preparation tasks.
Gen AI Engineer at UPS
May 1, 2023 - November 26, 2025
Designed and implemented scalable pipelines for training and deploying large language models (LLMs), enhancing platform intelligence and user interaction capabilities. Developed reusable CI/CD workflows using GitHub Actions and MLflow for automated model versioning and promotion across development, staging, and production environments. Built high-performance model serving infrastructure using vLLM and Kubernetes, achieving millisecond-level latency for real-time inference. Partnered with data science teams to operationalize AI agents and RAG systems, leveraging vector databases and LangChain for advanced information retrieval, applicable to educational content discovery. Established model governance and approval workflows, ensuring security, compliance, and auditability for production AI systems.
AI/ML Engineer at Cigna
July 1, 2018 - August 1, 2021
Built end-to-end ML pipelines for healthcare analytics using Kubeflow and Docker. Designed structured data intake flows for patient diagnostics and predictive insights. Integrated AI models with existing healthcare platforms for seamless data flow and reporting.
Associate Engineer at Reliance Jio
July 1, 2016 - May 1, 2018
Supported development of AI-driven customer interaction systems for telecom solutions. Assisted in designing conversational flows and data processing pipelines for user behavior analytics.
GenAI Engineer at UPS
May 1, 2023 - Present
Developed enterprise-grade conversational AI using GPT-4o and LLaMA-2 via LangChain, deployed via FastAPI on Azure AKS. Built Retrieval-Augmented Generation (RAG) pipelines with FAISS for context-aware document intelligence. Engineered LangChain-based autonomous agents for task automation, reasoning, and dynamic knowledge retrieval. Automated CI/CD pipelines for AI models using GitHub Actions, improving deployment velocity. Created interactive Streamlit dashboards for real-time LLM performance tracking and user interaction analytics.
Associate Software Engineer at Reliance Jio
July 1, 2016 - May 1, 2018
Developed real-time image classification models using TensorFlow/Keras for IoT and telecom applications. Built and deployed automated ML pipelines on GCP for scalable model training and versioned deployment. Integrated real-time dashboards with Flask and Plotly for operational monitoring and visualization.

Education

Master of Science in Information Science at Lamar University
October 1, 2021 - December 1, 2023
Bachelor of Science in Computer Science at Vignan Institute of Technology and Science
January 11, 2030 - January 1, 2016
Master of Science in Information Science at Lamar University
October 1, 2021 - December 1, 2023
Bachelor of Science in Computer Science at Vignan Institute of Technology and Science
January 11, 2030 - January 1, 2016
Master of Science in Information Science at Lamar University
October 1, 2021 - December 1, 2023
Bachelor of Science in Computer Science at Vignan Institute of Technology and Science, India
January 1, 2016 - November 26, 2025
Master of Science in Information Science at Lamar University, Beaumont, TX
October 1, 2021 - December 1, 2023
Bachelor of Science in Computer Science at Vignan Institute of Technology and Science, India
January 11, 2030 - January 1, 2016
Master of Science in Information Science at Lamar University, TX
January 11, 2030 - January 27, 2026
Bachelor of Science in Computer Science at Vignan Institute of Technology and Science, India
January 11, 2030 - January 27, 2026
Master of Science in Information Science at Lamar University, TX
January 11, 2030 - February 17, 2026
Bachelor of Science in Computer Science at Vignan Institute of Technology and Science, India
January 11, 2030 - February 17, 2026

Qualifications

AWS Certified Cloud Practitioner (AZ-900 Equivalent)
January 11, 2030 - December 31, 2025
Microsoft Certified: Azure AI Engineer Associate
January 11, 2030 - January 27, 2026
Hugging Face Transformers Certified Developer
January 11, 2030 - January 27, 2026
TensorFlow Developer Certificate
January 11, 2030 - January 27, 2026
Microsoft Certified - Azure AI Engineer Associate
January 11, 2030 - February 17, 2026
Hugging Face Transformers Certified Developer
January 11, 2030 - February 17, 2026
TensorFlow Developer Certificate
January 11, 2030 - February 17, 2026

Industry Experience

Software & Internet, Healthcare, Telecommunications, Transportation & Logistics, Education, Professional Services, Media & Entertainment, Other