I'm a senior machine learning engineer with 14+ years of experience designing and deploying data-driven solutions across healthcare, finance, and retail. I specialize in end-to-end ML lifecycles, from data collection and feature engineering to model training, evaluation, deployment, and monitoring. I enjoy building scalable AI systems that deliver tangible business outcomes and are compliant with healthcare and data privacy standards. My strengths include building GenAI and LLM-powered applications, designing robust recommender systems, advanced retrieval and vector search, MLOps, and creating production-grade AI pipelines on AWS, Azure, and GCP. I collaborate across cross-functional teams to translate business goals into reliable technical solutions, continuously learn from user behavior, and mentor engineers to raise the team's capabilities.

Santosh Karki

I'm a senior machine learning engineer with 14+ years of experience designing and deploying data-driven solutions across healthcare, finance, and retail. I specialize in end-to-end ML lifecycles, from data collection and feature engineering to model training, evaluation, deployment, and monitoring. I enjoy building scalable AI systems that deliver tangible business outcomes and are compliant with healthcare and data privacy standards. My strengths include building GenAI and LLM-powered applications, designing robust recommender systems, advanced retrieval and vector search, MLOps, and creating production-grade AI pipelines on AWS, Azure, and GCP. I collaborate across cross-functional teams to translate business goals into reliable technical solutions, continuously learn from user behavior, and mentor engineers to raise the team's capabilities.

Available to hire

I’m a senior machine learning engineer with 14+ years of experience designing and deploying data-driven solutions across healthcare, finance, and retail. I specialize in end-to-end ML lifecycles, from data collection and feature engineering to model training, evaluation, deployment, and monitoring. I enjoy building scalable AI systems that deliver tangible business outcomes and are compliant with healthcare and data privacy standards.

My strengths include building GenAI and LLM-powered applications, designing robust recommender systems, advanced retrieval and vector search, MLOps, and creating production-grade AI pipelines on AWS, Azure, and GCP. I collaborate across cross-functional teams to translate business goals into reliable technical solutions, continuously learn from user behavior, and mentor engineers to raise the team’s capabilities.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent

Work Experience

Gen AI/Machine Learning Engineer at McKesson
September 1, 2024 - Present
Designed GenAI solutions that automated daily healthcare operations such as eligibility checks, coverage discovery, preregistration and claim preparation using GenAI-driven workflows, reducing manual processing effort by 30–40% across operational teams. Built AI assistants that generated personalized payment options and patient communication templates to improve financial transparency and user experience. Optimized prompts for GPT, Claude, Gemini and Mistral models to improve the accuracy and reliability of automated reporting, payer rule interpretation and denial guidance and reduced incorrect responses by 25%.
Machine Learning Engineer at Wells Fargo
October 1, 2021 - August 1, 2024
Participated in the full ML lifecycle, including data collection, online model training, evaluation, deployment, and monitoring. Built real-time user profiling pipelines processing billions of clicks daily with sub-second latency. Implemented online learning workflows with Bayesian Personalized Ranking (BPR) for collaborative filtering and dynamic preference adaptation. Developed online embedding updates for users and items, improving personalization and reducing cold-start issues.
Machine Learning Engineer at Intuit
March 1, 2018 - September 1, 2021
Designed and deployed multi-stage recommendation pipelines, built ANN-based vector retrieval systems with Faiss, Redis, and Elasticsearch achieving low-latency retrieval at large scale. Built deep learning ranking models leveraging user behavior sequences and contextual signals; implemented MCP tooling and agent orchestration with LangChain/LlamaIndex for robust enterprise workflows.
Data Scientist at The Home Depot
April 1, 2015 - February 1, 2018
Developed data-driven pricing, inventory and customer engagement models; built recommender systems; performed market basket analysis; created ETL pipelines; delivered dashboards to stakeholders; collaborated with cross-functional teams to drive measurable ROI.
Data Analyst at Medtronic
July 1, 2012 - March 1, 2015
Created and analyzed business requirements for data solutions; developed data models, ETL mappings, data dictionaries, and governance artifacts; supported data integration for BI and analytics.

Education

Bachelor Degree in Computer Science at Rochester Institute of Technology, New York
January 11, 2030 - January 1, 2012
Bachelor Degree in Computer Science at Rochester Institute of Technology, New York
January 1, 2012 - February 17, 2026

Qualifications

Add your qualifications or awards here.

Industry Experience

Healthcare, Financial Services, Software & Internet, Retail, Professional Services, Other