Hi, I'm Vaibhav Upadhyay, a passionate Data Scientist and AI Engineer with a strong background in Machine Learning, NLP, and Data Science. I've worked on a variety of projects ranging from developing healthcare conversational AI solutions to fine-tuning language models and building web crawlers. I enjoy tackling complex problems and turning data into actionable insights to drive strategic decisions. With a Master's in Data Science from the University of Glasgow and diverse experience across industries including healthcare and technology, I bring a solid foundation in data engineering, cloud deployments, and advanced AI techniques. I'm always eager to learn and collaborate on innovative projects that push the boundaries of what's possible with AI and data.

Vaibhav Upadhyay

Hi, I'm Vaibhav Upadhyay, a passionate Data Scientist and AI Engineer with a strong background in Machine Learning, NLP, and Data Science. I've worked on a variety of projects ranging from developing healthcare conversational AI solutions to fine-tuning language models and building web crawlers. I enjoy tackling complex problems and turning data into actionable insights to drive strategic decisions. With a Master's in Data Science from the University of Glasgow and diverse experience across industries including healthcare and technology, I bring a solid foundation in data engineering, cloud deployments, and advanced AI techniques. I'm always eager to learn and collaborate on innovative projects that push the boundaries of what's possible with AI and data.

Available to hire

Hi, I’m Vaibhav Upadhyay, a passionate Data Scientist and AI Engineer with a strong background in Machine Learning, NLP, and Data Science. I’ve worked on a variety of projects ranging from developing healthcare conversational AI solutions to fine-tuning language models and building web crawlers. I enjoy tackling complex problems and turning data into actionable insights to drive strategic decisions.

With a Master’s in Data Science from the University of Glasgow and diverse experience across industries including healthcare and technology, I bring a solid foundation in data engineering, cloud deployments, and advanced AI techniques. I’m always eager to learn and collaborate on innovative projects that push the boundaries of what’s possible with AI and data.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate

Language

Javanese
Advanced
Bashkir
Intermediate

Work Experience

Machine Learning/NLP-Engineer at Talking Medicines
May 1, 2024 - Present
Led the development of Drug-GPT, a cutting-edge NLP product generating £125k in revenue within 3 months. Managed design and implementation using state-of-the-art NLP techniques to enhance chatbot response accuracy over large unstructured healthcare datasets. Established scalable infrastructure leveraging Azure services for data curation, analysis pipelines, and chatbot deployment. Developed a custom Retrieval-Augmented Generation (RAG) system using LangGraph framework and Kuzu graph database. Built a synthetic data generation pipeline powered by Llama3.2 for training proprietary classification and Named Entity Recognition models. Delivered client projects including development and fine-tuning of Healthcare Professional classifiers, clinical NER models, sentiment analysis, and opinion mining models. Created interactive Tableau dashboards for healthcare data visualization and led project management including SOW formulation and stakeholder communication.
Data Scientist (Part Time) at Tiquu
May 31, 2024 - August 1, 2025
Developed a label prediction model for Scientific QnA text corpus using semantic text analysis with transformer models like SBert. Designed and implemented an AWS-based web crawler to retrieve scheduled scientific journal documents from sources including Elsevier and Science Nature.
Data Scientist at Infosys Ltd. (Client Apple Inc.)
August 31, 2022 - August 1, 2025
Developed and fine-tuned NLP models using RoBERTa to automate user issue resolution, reducing operational costs by 10%. Created a Slack bot with a fine-tuned GPT model integrated with the Bolt framework for issue summarization and sentiment-based ticket resolution, deployed on AWS Lambda and used by 2000+ employees. Prepared data, engineered features, and trained regression/classification models (XGBoost, SVR) for fraud detection and sales prediction on large customer datasets. Conducted A/B testing, hypothesis testing, and statistical modeling on hourly batches of approximately 10GB data. Designed and maintained ETL pipelines using PL/SQL for data warehousing and business intelligence. Developed Python and SQL pipelines for real-time sales data aggregation and reporting to support business decisions during critical product launch events.
Data Analyst at Infosys Ltd. (Client Apple Inc.)
December 31, 2019 - August 1, 2025
Created data cleansing and aggregation scripts for sales forecasting using Python and SQL, ensuring timely report data extraction. Maintained and enhanced monitoring scripts on cloud servers, providing system health alerts using Python and Bash. Automated quarterly and annual cloud server activities including Daylight Saving Time switch, failovers, GDPR compliance, and database health monitoring. Collaborated with DB admins to implement user requirements into high-performance data applications supporting real-time dashboards in Tableau and Business Objects.
System Engineer Trainee (Internship) at Infosys Ltd.
June 30, 2017 - August 1, 2025
Developed a dynamic web game implementing the Model View Controller architecture using the Spring framework in Java.

Education

Masters of Science at University of Glasgow
September 1, 2022 - September 30, 2023
Bachelors of Technology at Galgotias University
September 1, 2013 - June 30, 2017

Qualifications

Machine Learning (Coursera)
January 11, 2030 - August 1, 2025
Large Language Models with Semantic Search (deeplearning.ai)
January 11, 2030 - August 1, 2025
Langchain for LLM development (deeplearning.ai)
January 11, 2030 - August 1, 2025
Finetuning LLMs (deeplearning.ai)
January 11, 2030 - August 1, 2025
Web development Framework (Microsoft)
January 11, 2030 - August 1, 2025

Industry Experience

Healthcare, Software & Internet, Professional Services, Financial Services