I am an AI/ML engineer with over 12 years of experience building scalable machine learning systems, specializing in NLP, predictive modeling, and applications within healthcare and insurance domains. I excel at deploying large language models, automating complex data pipelines, and delivering models that drive critical business decisions, from early diagnosis to operational risk. I am passionate about bridging research and engineering to translate cutting-edge research into practical, production-ready solutions. I enjoy solving high-impact problems at the intersection of data, language, and decision-making, and mentoring junior engineers to foster knowledge sharing and a high-performing team culture.

Prasanna Pandey

I am an AI/ML engineer with over 12 years of experience building scalable machine learning systems, specializing in NLP, predictive modeling, and applications within healthcare and insurance domains. I excel at deploying large language models, automating complex data pipelines, and delivering models that drive critical business decisions, from early diagnosis to operational risk. I am passionate about bridging research and engineering to translate cutting-edge research into practical, production-ready solutions. I enjoy solving high-impact problems at the intersection of data, language, and decision-making, and mentoring junior engineers to foster knowledge sharing and a high-performing team culture.

Available to hire

I am an AI/ML engineer with over 12 years of experience building scalable machine learning systems, specializing in NLP, predictive modeling, and applications within healthcare and insurance domains. I excel at deploying large language models, automating complex data pipelines, and delivering models that drive critical business decisions, from early diagnosis to operational risk.

I am passionate about bridging research and engineering to translate cutting-edge research into practical, production-ready solutions. I enjoy solving high-impact problems at the intersection of data, language, and decision-making, and mentoring junior engineers to foster knowledge sharing and a high-performing team culture.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Work Experience

Engineering Manager at Cedar Gate Technologies
February 29, 2024 - August 28, 2025
Led end-to-end development and recalibration of over 20 ML models, improving predictive accuracy by up to 15% and scaling delivery across 40+ client environments via robust versioning and reusable pipeline design. Architected and implemented LLM-driven schema automation pipelines, reducing manual mapping by 60%, accelerating onboarding, and enhancing data consistency. Delivered a high-impact diagnostic prediction model identifying 80% of undiagnosed diabetes cases a year in advance, gaining national recognition and supporting early clinical interventions. Oversaw migration to Spark 3.3 on AWS EMR including comprehensive compatibility testing and refactoring to ensure zero downtime deployment and 25% reduction in training latency. Integrated ChatGPT into enterprise analytics workflows enabling privacy-aware prompt engineering and de-identification workflows. Mentored junior engineers, institutionalized ML knowledge-sharing, and led training on time series forecasting, embeddings, and RA
Principal Software Engineer at Cedar Gate Technologies
January 1, 2022 - August 28, 2025
Founded and led the company's machine learning function, transforming R&D initiatives into production-grade predictive models that drove health care risk stratification, cost containment, and early interventions. Developed and deployed predictive models for chronic disease detection and ER visit forecasting, leveraging healthcare claims data to support early interventions and reduce avoidable admissions. Built real-time ML inference pipelines on PySpark and AWS EMR, reducing latency by 70% and optimizing infrastructure costs for large-scale healthcare data applications. Engineered secure, distributed ETL workflows using S3, LZ4 compression, and AWS-KMS encryption, improving throughput by 20% and ensuring HIPAA-compliant data handling. Applied neural networks and representation learning (embeddings, EFA/CFA) to extract hidden signals from clinical datasets, enhancing model performance and interpretability. Led internal research initiatives exploring time series modeling, NLP, and cluste
Senior Software Engineer at Deerwalk Services Pvt. Ltd
January 1, 2018 - August 28, 2025
Designed and automated a disaster recovery framework using Ansible and AWS KMS encryption, reducing recovery time by 40% and ensuring secure multi-region data resilience for production systems. Led the migration of ETL infrastructure from Hadoop1 to Hadoop2 on AWS EMR, improving parallelism, increasing throughput, and reducing cloud costs across large-scale data pipelines. Developed and optimized high-volume data ingestion pipelines in Python for heterogeneous healthcare datasets, improving daily processing throughput and accelerating downstream analytics workflows. Diagnosed and stabilized Elasticsearch cluster performance through index lifecycle policies and capacity planning, increasing system stability by 35% and reducing unplanned downtime. Redesigned and unified batch data processing workflows into a centralized dashboard UI, improving operational visibility and reducing manual triage effort by over 50%. Collaborated cross-functionally with DevOps, Security, and Cloud IT teams to
Software Engineer at Deerwalk Services Pvt. Ltd
November 30, 2015 - August 28, 2025
Designed and deployed a clinical NLP pipeline using OpenNLP, UIMA, and YTEX to extract ICD/CPT codes from unstructured medical text, laying the foundation for scalable healthcare analytics and semantic modeling. Built distributed ETL pipelines on AWS EMR using Hadoop and Cascading, standardizing raw healthcare datasets and ensuring regulatory-compliant integration with analytics platforms. Developed RESTful APIs for structured data extraction from free-text clinical documents, accelerating diagnosis and procedure tagging while improving downstream model readiness. Evaluated and benchmarked early-stage big data storage technologies (Hive, Redshift, CrateDB), contributing to the architectural roadmap for data lake adoption and scalable retrieval systems. Engineered high-volume data ingestion pipelines from heterogeneous sources, ensuring consistency, integrity, and performance across distributed environments. Collaborated on foundational architecture decisions supporting cloud adoption a

Education

Master of Computer Applications (MCA) at Sikkim Manipal University of Health, Medical and Technological Sciences, India
January 11, 2030 - August 28, 2025
Bachelors of Computer Applications (BCA) at Makhala Chaturvedi National University of Journalism and Communication, India
January 11, 2030 - August 28, 2025
Liberal Arts at Reed College, USA
January 11, 2030 - August 28, 2025

Qualifications

Add your qualifications or awards here.

Industry Experience

Healthcare, Financial Services, Software & Internet