Hi, I'm Lahari Gurram, an AI Engineer specializing in designing and deploying production-grade machine learning and generative AI solutions using Python, PyTorch, TensorFlow, and LangChain. I enjoy turning complex data into practical, scalable AI applications that empower teams and stakeholders. I have built retrieval-augmented generation pipelines, fine-tuned LLMs (GPT-4, Claude, Gemini), and developed multi-agent workflows to automate processes while improving response quality. I prioritize observability, secure PII-compliant deployments, and close collaboration with cross-functional teams to deliver scalable GenAI solutions on Azure and cloud platforms.

Lahari Gurram

Hi, I'm Lahari Gurram, an AI Engineer specializing in designing and deploying production-grade machine learning and generative AI solutions using Python, PyTorch, TensorFlow, and LangChain. I enjoy turning complex data into practical, scalable AI applications that empower teams and stakeholders. I have built retrieval-augmented generation pipelines, fine-tuned LLMs (GPT-4, Claude, Gemini), and developed multi-agent workflows to automate processes while improving response quality. I prioritize observability, secure PII-compliant deployments, and close collaboration with cross-functional teams to deliver scalable GenAI solutions on Azure and cloud platforms.

Available to hire

Hi, I’m Lahari Gurram, an AI Engineer specializing in designing and deploying production-grade machine learning and generative AI solutions using Python, PyTorch, TensorFlow, and LangChain. I enjoy turning complex data into practical, scalable AI applications that empower teams and stakeholders.
I have built retrieval-augmented generation pipelines, fine-tuned LLMs (GPT-4, Claude, Gemini), and developed multi-agent workflows to automate processes while improving response quality. I prioritize observability, secure PII-compliant deployments, and close collaboration with cross-functional teams to deliver scalable GenAI solutions on Azure and cloud platforms.

See more

Language

English
Fluent

Work Experience

Gen AI Engineer at Kadence
August 1, 2024 - Present
Designed and deployed enterprise-grade Generative AI solutions using LLMs (GPT-4, Claude, Gemini), automating workflows and improving operational efficiency. Developed Retrieval-Augmented Generation (RAG) pipelines with LlamaIndex and vector databases (Pinecone, Weaviate, Milvus, Chroma), improving response relevance by 20% and reducing hallucinations. Engineered prompt strategies, function-calling schemas, and tool integrations, increasing LLM response accuracy across production use cases. Integrated GenAI services with backend systems using Python, Node.js, and REST APIs, scaling to high daily request volumes with stable uptime. Implemented LLM evaluation, observability, and monitoring using LangSmith, Weights & Biases, and Prometheus/Grafana, reducing production regressions by 18%. Fine-tuned and optimized LLMs, embedding models, and rerankers using LoRA/QLoRA/PEFT, reducing inference latency while preserving model quality. Deployed and managed self-hosted open-source LLMs and ML pi
AI Engineer – AI/ML Platform at Grid Dynamics
May 1, 2022 - June 1, 2023
Developed and deployed ML models for classification, regression, and clustering using scikit-learn, XGBoost, and LightGBM; improved prediction accuracy for enterprise datasets by 20%. Performed feature engineering, data cleaning, and preprocessing; evaluated models using precision, recall, F1-score, and ROC-AUC. Built NLP solutions including sentiment analysis, named entity recognition (NER), and text classification using pre-trained embeddings (Word2Vec, GloVe, FastText) and Hugging Face Transformers. Fine-tuned BERT and smaller transformer models for domain-specific text tasks using PyTorch and Hugging Face; improved NLP model relevance. Designed and maintained ML data pipelines using Pandas and SQL for structured and semi-structured datasets; streamlined preprocessing for training and inference by 25%. Assisted in model deployment in dev/test environments with Docker and cloud ML services. Collaborated with cross-functional teams to integrate ML/NLP models into applications, ensurin

Education

Master of Science in Computer Information Systems at Saint Louis University
August 1, 2023 - May 1, 2025
Bachelor of Technology in Computer Science & Engineering (CSE) at Vignan's Lara Institute of Technology & Science
August 1, 2019 - May 1, 2023
Master of Science in Computer Information Systems at Saint Louis University
August 1, 2023 - May 1, 2025
Bachelor of Technology in Computer Science & Engineering (CSE) at Vignan's Lara Institute of Technology & Science
August 1, 2019 - May 1, 2023

Qualifications

Build Real World AI Applications with Gemini and Imagen – Google
January 11, 2030 - March 30, 2026
Develop GenAI Apps with Gemini and Streamlit – Google
January 11, 2030 - March 30, 2026
Prompt Design in Vertex AI – Google
January 11, 2030 - March 30, 2026
Python Basic Training – Andhra Pradesh State Skill Development Corporation (APSSDC)
January 11, 2030 - March 30, 2026
Certified Project Management Associate – Excelerate
January 11, 2030 - March 30, 2026
Web Development Training – Internshala
January 11, 2030 - March 30, 2026
Build Real World AI Applications with Gemini and Imagen – Google
January 11, 2030 - March 30, 2026
Develop GenAI Apps with Gemini and Streamlit – Google
January 11, 2030 - March 30, 2026
Prompt Design in Vertex AI – Google
January 11, 2030 - March 30, 2026
Python Basic Training – Andhra Pradesh State Skill Development Corporation (APSSDC)
January 11, 2030 - March 30, 2026
Certified Project Management Associate – Excelerate
January 11, 2030 - March 30, 2026
Web Development Training – Internshala
January 11, 2030 - March 30, 2026

Industry Experience

Software & Internet, Computers & Electronics, Professional Services