I’m Jayesh Locharla, an AI/ML Engineer with around 4 years of experience delivering enterprise-grade AI solutions across finance, retail, and healthcare. I build practical systems using Python, TensorFlow, PyTorch, XGBoost, and state-of-the-art NLP and generative AI tools to extract actionable insights and automate complex workflows. I thrive in cross-functional teams and enjoy turning data into tangible business impact through scalable ML pipelines, semantic search, and robust deployment practices. I’ve delivered measurable outcomes such as reducing analyst turnaround from multi-day to same-day delivery and achieving significant cost savings through prompt engineering, model fine-tuning, and production-ready deployments on AWS with MLflow, Docker, and Kubernetes. My hands-on experience spans LLMs (GPT-family, LLaMA, BERT), RAG pipelines, and end-to-end ML lifecycle management, from research to production monitoring.

Jayesh Locharla

I’m Jayesh Locharla, an AI/ML Engineer with around 4 years of experience delivering enterprise-grade AI solutions across finance, retail, and healthcare. I build practical systems using Python, TensorFlow, PyTorch, XGBoost, and state-of-the-art NLP and generative AI tools to extract actionable insights and automate complex workflows. I thrive in cross-functional teams and enjoy turning data into tangible business impact through scalable ML pipelines, semantic search, and robust deployment practices. I’ve delivered measurable outcomes such as reducing analyst turnaround from multi-day to same-day delivery and achieving significant cost savings through prompt engineering, model fine-tuning, and production-ready deployments on AWS with MLflow, Docker, and Kubernetes. My hands-on experience spans LLMs (GPT-family, LLaMA, BERT), RAG pipelines, and end-to-end ML lifecycle management, from research to production monitoring.

Available to hire

I’m Jayesh Locharla, an AI/ML Engineer with around 4 years of experience delivering enterprise-grade AI solutions across finance, retail, and healthcare. I build practical systems using Python, TensorFlow, PyTorch, XGBoost, and state-of-the-art NLP and generative AI tools to extract actionable insights and automate complex workflows. I thrive in cross-functional teams and enjoy turning data into tangible business impact through scalable ML pipelines, semantic search, and robust deployment practices.

I’ve delivered measurable outcomes such as reducing analyst turnaround from multi-day to same-day delivery and achieving significant cost savings through prompt engineering, model fine-tuning, and production-ready deployments on AWS with MLflow, Docker, and Kubernetes. My hands-on experience spans LLMs (GPT-family, LLaMA, BERT), RAG pipelines, and end-to-end ML lifecycle management, from research to production monitoring.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert

Work Experience

AI/ML Engineer at Walmart
September 1, 2024 - Present
Designed and deployed Generative AI solutions using LLMs (LLaMA, GPT-family) to automate document understanding for merchandising, compliance, and supplier operations, accelerating analyst processing from 4–5 days to same-day. Programmed RAG pipelines with Pinecone for semantic search across enterprise knowledge sources, reducing information retrieval latency by 45% and boosting AI tool adoption. Built LangChain-based orchestration workflows integrating LLMs with internal SQL systems, APIs, and governed data assets to standardize AI-assisted workflows. Trained PyTorch-based recommendation and ranking models for product discovery, achieving real-time performance with reduced inference latency by 20 ms per request. Established prompt engineering standards and reusable prompt libraries, cutting rework and manual review while delivering about $70,000 in annual savings. Implemented prompt tuning/LLM fine-tuning (LoRA/PEFT/adapters) for domain-specific merchandising and compliance tasks, i
Machine Learning Scientist at Goldman Sachs
January 1, 2021 - July 1, 2023
Improved credit risk assessment by 18% by building TensorFlow and XGBoost models on 9M historical loans and transactions, enabling risk teams to prioritize high-risk accounts and reduce manual review cycles. Reduced fraud investigation latency by 45 minutes per batch by implementing Isolation Forest and autoencoder-based anomaly detection on 1,500+ suspicious transactions monthly. Increased portfolio forecasting accuracy by 22% using LSTM and ARIMA models for market exposure and trading volume prediction, supporting more precise hedging strategies for trading desks. Automated compliance document analysis, cutting review time from 3 hours to 50 minutes per report, by deploying NLP pipelines with Hugging Face Transformers and spaCy to extract actionable insights from audit logs, emails, and trade notes. Applied model interpretability frameworks with SHAP, LIME, and PCA, highlighting top predictors across risk models and ensuring audit-ready transparency for regulatory reporting. Rolled o

Education

Master of Science in Computer Science at Florida State University
January 11, 2030 - February 26, 2026
Bachelor of Technology in Computer Science (Specialization in Big Data Analytics) at SRM Institute of Science & Technology, Chennai
January 11, 2030 - February 26, 2026

Qualifications

Add your qualifications or awards here.

Industry Experience

Financial Services, Retail, Healthcare