Hi, I'm Aakar Shah, a passionate Data Scientist with a Master's degree in Data Science from the University of Glasgow. I love building AI-driven tools that make an impact, especially in global development and healthcare. With a strong background in machine learning, natural language processing, and cloud technologies, I enjoy collaborating with diverse teams to deliver scalable and efficient AI solutions. Throughout my career, I've had the opportunity to work on projects funded by global organizations like the Gates Foundation and FCDO. I'm excited about continuing to innovate and contribute to meaningful projects that solve real-world problems using data and AI.

Aakar Shah

Hi, I'm Aakar Shah, a passionate Data Scientist with a Master's degree in Data Science from the University of Glasgow. I love building AI-driven tools that make an impact, especially in global development and healthcare. With a strong background in machine learning, natural language processing, and cloud technologies, I enjoy collaborating with diverse teams to deliver scalable and efficient AI solutions. Throughout my career, I've had the opportunity to work on projects funded by global organizations like the Gates Foundation and FCDO. I'm excited about continuing to innovate and contribute to meaningful projects that solve real-world problems using data and AI.

Available to hire

Hi, I’m Aakar Shah, a passionate Data Scientist with a Master’s degree in Data Science from the University of Glasgow. I love building AI-driven tools that make an impact, especially in global development and healthcare. With a strong background in machine learning, natural language processing, and cloud technologies, I enjoy collaborating with diverse teams to deliver scalable and efficient AI solutions.

Throughout my career, I’ve had the opportunity to work on projects funded by global organizations like the Gates Foundation and FCDO. I’m excited about continuing to innovate and contribute to meaningful projects that solve real-world problems using data and AI.

See more

Experience Level

Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
See more

Work Experience

Data Scientist at CAB International
October 1, 2024 - Present
Delivering AI-driven tools for international development and global food policymaking, leading projects funded by FCDO, Gates Foundation, and other global stakeholders. Developed advanced classifiers reducing systematic review time by 45%, collaborated with global experts to tailor NLP pipelines, engineered scalable extraction tools with 25% accuracy improvement, designed interactive dashboards boosting expert engagement by 30%. Architected and deployed Juno Chatbot processing complex queries with 92% accuracy, integrated DeepSeek R1 and fine-tuned Mistral 7B model improving response time by 60%, built end-to-end data pipelines ensuring 99.9% uptime, coordinated global cross-functional teams for system scalability and reliability.
Data Scientist at Creative Colab AI
October 31, 2024 - July 21, 2025
Led AI Assurance initiative to evaluate AI companies' adherence to EU AI Act using NLP, sentiment and text analysis with AWS hosting. Designed user-friendly frontend for company URL input and report generation. Provided actionable insights for AI certification optimization. Developed Splitflap machine with AI backend, leveraging Huggingface LLM for personalized storytelling and interaction. Used semantic search and sentiment analysis to match story characteristics with user personalities and emulate user personality to deepen connection. Successfully integrated multiple AI technologies to create interactive user experiences.
Machine Learning/NLP-Consultant at OncoFlow
February 28, 2025 - July 21, 2025
Developed automated treatment planning system combining Retrieval-Augmented Generation with fine-tuned LLaMA-3B model for personalized treatment plans. Engineered precise data extraction from clinical reports to provide relevant metrics. Built multi-layered evaluation system to ensure recommendations align with medical guidelines, reducing manual review time. Collaborated with medical professionals and developers to align AI with clinical protocols, improving patient management processes.
Machine Learning Engineer at Avnio
September 30, 2022 - July 21, 2025
Integrated NLP NLG feature into Salesforce native product using Huggingface and ElasticSearch, enhancing functionality and customer engagement. Led a team deploying an alternative to GPT-3 via Flask API and web servers. Reduced error rate by 80% by fine-tuning Semantic Search ML models, improving knowledge base querying. Performed A/B testing and statistical modeling on large customer data volumes. Developed XLNet-based model achieving 95% accuracy in question detection in RFPs. Refined BERT-based model to detect semantic duplicates with 80% accuracy. Delivered semantic content search system using Haystack for intuitive search experience.
Data Scientist at CAB International
October 1, 2024 - Present
Delivered AI-driven tools for international development and global food policymaking, leading projects funded by FCDO, Gates Foundation, and other global stakeholders. Developed advanced classifiers reducing systematic review processing time by 45%, collaborated with global experts and stakeholders, engineered scalable extraction tools with a 25% accuracy improvement, and designed interactive dashboards boosting expert engagement by 30%. Architected and deployed the Juno Chatbot with 92% response accuracy, integrated DeepSeek R1 and fine-tuned the Mistral 7B model increasing user satisfaction by 35%, and ensured 99.9% system uptime with real-time dashboards.
Data Scientist at Creative Colab AI
October 31, 2024 - July 21, 2025
Led the AI Assurance initiative analyzing data from diverse AI companies to develop a model for evaluating adherence to the EU AI Act. Engineered an evaluation system incorporating NLP, sentiment and text analysis, integrated with AWS for scalable hosting. Designed user-friendly frontends for URL input and report generation. Led the development of a Splitflap machine with AI backend, utilizing Huggingface LLM for personalized interactions with semantic search and sentiment analysis, enabling the machine to emulate user personalities.
Machine Learning/NLP-Consultant at OncoFlow
February 28, 2025 - July 21, 2025
Developed an automated treatment planning system by integrating Retrieval-Augmented Generation (RAG) with a fine-tuned LLaMA-3B model on health data. Engineered precise data extraction from clinical reports, constructed a multi-layered system evaluating data against medical guidelines for accurate treatment recommendations, and coordinated with medical professionals to align AI with clinical protocols, significantly reducing manual review time.
Machine Learning Engineer at Avnio
September 30, 2022 - July 21, 2025
Integrated NLP/NLG features into Salesforce native product using Huggingface and Elasticsearch, leading a team to deploy an alternative to GPT-3. Achieved 80% error rate reduction by fine-tuning semantic search models, performed A/B and statistical analysis on large datasets, developed XLNet-based question detection model with 95% accuracy, and refined BERT-based semantic duplicate detection to 80% accuracy. Delivered a comprehensive content search system using Haystack for semantic search capabilities.

Education

Masters of Science at University of Glasgow
September 1, 2022 - September 30, 2023
Bachelors of Technology at Pandit Deendayal Petroleum University
August 1, 2016 - August 31, 2020
Masters of Science at University of Glasgow
September 1, 2022 - September 30, 2023
Bachelors of Technology at Pandit Deendayal Petroleum University
August 1, 2016 - August 31, 2020

Qualifications

Kaggle Challenge DeepLearning
January 11, 2030 - July 21, 2025
Hackathon (IIC, PDPU)
January 11, 2030 - July 21, 2025
Crash Course on Python (Coursera)
January 11, 2030 - July 21, 2025
Managing Machine Learning Projects with Google Cloud (Coursera)
January 11, 2030 - July 21, 2025
Stock Analysis: Create a Buy Signal Filter using R and Quantmod Package (Coursera)
January 11, 2030 - July 21, 2025
Kaggle Challenge DeepLearning
January 11, 2030 - July 21, 2025
Hackathon (IIC, PDPU)
January 11, 2030 - July 21, 2025
Crash Course on Python (Coursera)
January 11, 2030 - July 21, 2025
Managing Machine Learning Projects with Google Cloud (Coursera)
January 11, 2030 - July 21, 2025
Stock Analysis: Create a Buy Signal Filter using R and the Quantmod Package (Coursera)
January 11, 2030 - July 21, 2025

Industry Experience

Government, Non-Profit Organization, Healthcare, Life Sciences, Software & Internet