I am a result-oriented data scientist with 9 years of experience, currently driving production-grade AI and analytics solutions across Metro Operations, Media, and Retail. I specialize in end-to-end model development, from data gathering and cleansing to deployment and lifecycle management, with a strong focus on Generative AI, LLMs, and cloud-native architectures on AWS. I pride myself on building scalable, cost-efficient systems and fostering cross-functional collaboration. Beyond delivering robust data science work, I have led teams, built training programs, and implemented observability for AI systems. My work includes designing and deploying multi-agent AI assistants, ML pipelines with Airflow, real-time dashboards, and cost optimization strategies that reduced inference costs and improved deployment frequency. I enjoy solving complex problems and turning data into actionable business impact.

Akshay Nimbalkar

I am a result-oriented data scientist with 9 years of experience, currently driving production-grade AI and analytics solutions across Metro Operations, Media, and Retail. I specialize in end-to-end model development, from data gathering and cleansing to deployment and lifecycle management, with a strong focus on Generative AI, LLMs, and cloud-native architectures on AWS. I pride myself on building scalable, cost-efficient systems and fostering cross-functional collaboration. Beyond delivering robust data science work, I have led teams, built training programs, and implemented observability for AI systems. My work includes designing and deploying multi-agent AI assistants, ML pipelines with Airflow, real-time dashboards, and cost optimization strategies that reduced inference costs and improved deployment frequency. I enjoy solving complex problems and turning data into actionable business impact.

Available to hire

I am a result-oriented data scientist with 9 years of experience, currently driving production-grade AI and analytics solutions across Metro Operations, Media, and Retail. I specialize in end-to-end model development, from data gathering and cleansing to deployment and lifecycle management, with a strong focus on Generative AI, LLMs, and cloud-native architectures on AWS. I pride myself on building scalable, cost-efficient systems and fostering cross-functional collaboration.

Beyond delivering robust data science work, I have led teams, built training programs, and implemented observability for AI systems. My work includes designing and deploying multi-agent AI assistants, ML pipelines with Airflow, real-time dashboards, and cost optimization strategies that reduced inference costs and improved deployment frequency. I enjoy solving complex problems and turning data into actionable business impact.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent

Work Experience

Data Scientist at Exelixi AI
April 1, 2025 - Present
Designed and deployed an enterprise-grade, multi-agent AI Assistant with a central intent-router handing off to three specialized sub-agents for HR, IT, and Compliance. Developed a full-stack application with React JS front-end, Python FastAPI microservices, and Supabase PostgreSQL for secure role-based data persistence. Managed cloud-native production on Contabo VPS using Docker-compose with automated CI/CD and zero downtime rolling updates. Integrated Model Context Protocol to enrich prompts and reduce hallucinations, with LLM powered by OpenAI GPT-4.1 optimized through dynamic retrieval-augmented generation and caching. Implemented end-to-end observability using Langfuse feeding a Grafana dashboard for latency, token usage, cost, and quality metrics.
Data Scientist – Performance Data Analytics at Keolis - MHI Dubai Metro (RTA), Dubai
April 1, 2025 - August 27, 2025
Designed and deployed ML production pipelines using Airflow automating workflows for data ingestion, feature engineering, training, and real-time inference. Integrated Generative AI in Metro Operations, working cross-functionally. Led development of an LLM-based chatbot 'OCC-Mate' using RAG and OpenAI to streamline OCC operations, reducing error rates by 20%. Trained a team of 8 on Power BI dashboard development and optimization. Implemented performance monitoring dashboards linking Power BI to AWS RDS and NoSQL data for real-time TSA/TSP metric tracking. Preprocessed structured and unstructured data to discover trends and developed predictive models. Reduced model inference costs by 25% and improved deployment frequency by 40% through containerized microservices and cloud cost optimization.
Data Analyst at Majid Al Futtaim Headquarters, Deira, Dubai
October 31, 2022 - August 27, 2025
Formulated product strategies by defining and refining KPIs for performance tracking in cloud environments. Co-developed ETL pipelines with Data Engineers using Airflow to automate workflows and deliver real-time dashboards. Ensured data integrity and consistency by establishing data management best practices. Developed dynamic Power BI dashboards integrating SQL and NoSQL data sources to support strategic decision-making. Acted as data quality gatekeeper, maintaining system reliability and scalability.
Data Scientist at Mondia Media Group, Dubai Media City, Dubai
February 1, 2022 - August 27, 2025
Built a Billing Optimization Model using ML regression and AB testing, increasing successful payments by 13% and adding EGP 850K revenue for Vodafone Egypt within three months. Managed full ML lifecycle including data collection, feature engineering, training, deployment, and continuous performance and cost monitoring.
Data Scientist at Market Xcel Data Matrix | Predictivu, New Delhi
October 31, 2020 - August 27, 2025
Developed Market Media Mix Modeling using regression and simulation that improved advertising ROI with a 19% sales increase at same ad budgets. Applied advanced statistical techniques to optimize marketing spend.
Software Developer at Sapours Technologies, Pune, Maharashtra
October 31, 2016 - September 28, 2018
Developed Optical Character Recognition (OCR) product for Volkswagen client using Java Tesseract API library, delivering enhanced product capabilities.
Senior Data Scientist at Exelixi AI
April 1, 2025 - November 15, 2025
Designed and deployed an enterprise-grade multi-agent AI assistant with a central intent-router and three specialized sub-agents. Built full-stack: React frontend, Python FastAPI microservices, and Supabase for secure, role-based data persistence. Led an AI-driven Document Analysis and Processing system for Bank of Oman (OCR, NLP, and classification) within a conversational interface for real-time interaction, extraction, and validation. Cloud-native deployment on Contabo VPS with Docker Compose, automated CI/CD, and zero-downtime rolling updates. Integrated Model Context Protocol (MCP) to enrich prompts and structured function calls; leveraged GPT-4.1 with dynamic RAG strategies to reduce costs. Implemented end-to-end observability with Langfuse and a Grafana dashboard tracking latency, token usage, cost, and quality scores.
Data Scientist – Performance Data Analytics at Keolis - MHI Dubai Metro (RTA)
April 1, 2025 - April 1, 2025
Designed and deployed ML pipelines in production using Airflow, automating data ingestion, feature engineering, model training, and real-time inference. Integrated Generative AI solutions to improve operational efficiency in Metro Operations; collaborated with cross-functional teams (Operations, IT, Data). Focused on data extraction, cleansing, and analysis from multiple sources to generate insightful business intelligence reports.
Data Scientist at Majid Al Futtaim Headquarters
October 1, 2022 - October 1, 2022
Formulated product strategies by establishing and refining core metric KPIs for continuous performance tracking in a cloud-based environment. Engineered ETL data pipelines with Airflow to ensure robust data flow automation, delivering timely insights for real-time dashboards. Ensured data integrity and consistency by establishing best practices for data management and reliability. Developed Power BI dashboards and implemented performance optimization techniques across SQL/NoSQL data sources.
Data Scientist at Mondia Media Group
February 1, 2022 - February 1, 2022
Built Billing Optimization Model using ML Regression and AB Testing, achieving a 13% increase in successful payments and revenue uplift for Vodafone EG within three months. Oversaw the full ML lifecycle: data collection, feature engineering, model training, deployment, and ongoing monitoring for performance and cost.
Data Scientist at Market Xcel Data Matrix / Predictivu
October 1, 2020 - October 1, 2020
Developed Market Media Mix Modelling using Regression and Simulation techniques, enabling more effective allocation of advertising budgets. Resulted in a 19% increase in sales for the same advertising budget.
Software Developer at Sapours Technologies
June 1, 2018 - June 1, 2018
Developed an OCR product for Volkswagen using Java Tesseract API library.

Education

Masters: Data Science at Aegis School of Data Science
January 1, 2019 - December 31, 2019
Bachelor of Engineering: Computer Science at Pune University
January 1, 2015 - December 31, 2015
Master of Data Science at Aegis School of Data Science – Mumbai
January 11, 2030 - January 1, 2019
Bachelor of Engineering in Computer Science at Pune University – Pune
January 11, 2030 - January 1, 2015

Qualifications

Add your qualifications or awards here.

Industry Experience

Transportation & Logistics, Media & Entertainment, Retail, Software & Internet, Professional Services, Financial Services