I am an experienced Data Scientist with over 3 years of expertise in machine learning, deep learning, and data analysis. I specialize in developing and deploying models using Python, SQL, and cloud technologies, with a focus on NLP, large language models, generative AI, and MLOps. I excel in building end-to-end AI solutions and collaborating with cross-functional teams to drive data-driven strategies that enhance business outcomes. Throughout my career, I have built predictive models, chatbots, and ETL pipelines, achieving significant improvements in customer retention, fraud detection, and healthcare cost reduction. With strong skills in frameworks like Scikit-learn, TensorFlow, PyTorch, and cloud platforms such as AWS and Azure, I am passionate about using AI to solve complex problems and deliver measurable impact.

Gaurav Salvi

I am an experienced Data Scientist with over 3 years of expertise in machine learning, deep learning, and data analysis. I specialize in developing and deploying models using Python, SQL, and cloud technologies, with a focus on NLP, large language models, generative AI, and MLOps. I excel in building end-to-end AI solutions and collaborating with cross-functional teams to drive data-driven strategies that enhance business outcomes. Throughout my career, I have built predictive models, chatbots, and ETL pipelines, achieving significant improvements in customer retention, fraud detection, and healthcare cost reduction. With strong skills in frameworks like Scikit-learn, TensorFlow, PyTorch, and cloud platforms such as AWS and Azure, I am passionate about using AI to solve complex problems and deliver measurable impact.

Available to hire

I am an experienced Data Scientist with over 3 years of expertise in machine learning, deep learning, and data analysis. I specialize in developing and deploying models using Python, SQL, and cloud technologies, with a focus on NLP, large language models, generative AI, and MLOps. I excel in building end-to-end AI solutions and collaborating with cross-functional teams to drive data-driven strategies that enhance business outcomes.

Throughout my career, I have built predictive models, chatbots, and ETL pipelines, achieving significant improvements in customer retention, fraud detection, and healthcare cost reduction. With strong skills in frameworks like Scikit-learn, TensorFlow, PyTorch, and cloud platforms such as AWS and Azure, I am passionate about using AI to solve complex problems and deliver measurable impact.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
See more

Language

English
Advanced
Hindi
Advanced

Work Experience

Data Scientist at PNC – Texas, USA
January 1, 2025 - Present
Established and deployed a predictive churn model using logistic regression and feature engineering on healthcare claims data, reducing churn by 20% and improving retention and satisfaction. Developed a fraud detection system with RNN using TensorFlow, achieving 98% accuracy, integrated into a real-time scoring pipeline. Created a LangChain-powered chatbot with OpenAI GPT to automate healthcare insurance support, reducing response time by 50% and increasing NPS. Implemented predictive modeling for hospital readmissions using Random Forest and Logistic Regression, achieving 70% accuracy and reducing healthcare costs by 50%. Designed and deployed an Azure-based ETL pipeline using Azure ML and Hugging Face Transformers (BERT) to analyze patient feedback, extracting sentiment and key themes, boosting satisfaction by 30%.
Data Scientist at TatvaSoft – India
June 30, 2023 - August 26, 2025
Architected an NLP-based recommendation system using NLTK and Scikit-learn, leveraging product metadata and reviews to drive personalized suggestions, boosting click-through rate by 20% and increasing sales by 30%. Integrated Artificial Neural Networks, CNN, and RNN into a hybrid architecture for complex data analysis tasks, improving overall model performance by 30%. Created customer segmentation dashboards in Tableau integrating clustering models (K-Means) and behavioral data, enabling a 20% lift in targeted acquisition campaigns. Trained and tuned XGBoost models to identify promotion-sensitive products, shaping marketing strategy resulting in a 10% spike in peak-season sales and 8% overall profit growth. Employed comprehensive data science and machine learning libraries including Pandas, NumPy, Seaborn, and PyTorch to refine models and workflows, boosting development speed by 25%. Streamlined ML deployment via AWS CodePipeline, reducing deployment time by 50% and improving CI/CD eff
Data Scientist at PNC
January 1, 2025 - Present
Led the development and deployment of predictive models including a logistic regression churn model on healthcare claims data achieving a 20% reduction in churn and improved retention. Developed an RNN-based fraud detection system integrated with real-time scoring pipelines with 98% accuracy. Built a LangChain-powered chatbot using OpenAI GPT to automate healthcare insurance support, reducing response times by 50% and increasing Net Promoter Score. Implemented Random Forest and Logistic Regression models for hospital readmissions with 70% accuracy, cutting healthcare costs by 50%. Designed and deployed an Azure-based ETL pipeline with Azure ML and Hugging Face transformers to analyze patient feedback, extracting sentiment and themes that boosted satisfaction by 30%.
Data Scientist at TatvaSoft
June 30, 2023 - August 26, 2025
Architected an NLP-based recommendation system using NLTK and Scikit-learn leveraging product metadata and reviews, which boosted click-through rate by 20% and sales by 30%. Improved model performance by 30% integrating ANN, CNN, and RNN into hybrid architecture for complex data tasks. Created customer segmentation dashboards in Tableau integrating K-Means clustering and behavioral data to identify high-LTV cohorts, resulting in a 20% lift in targeted acquisition campaigns. Trained and tuned XGBoost models to identify promotion-sensitive products shaping marketing strategy and yielding a 10% spike in peak-season sales and 8% profit growth. Streamlined ML deployment using AWS CodePipeline reducing deployment time by 50% and improving CI/CD efficiency.
Data Scientist at PNC – Texas, USA
January 1, 2025 - Present
Established and deployed a predictive churn model using logistic regression and feature engineering on healthcare claims data, reducing churn by 20%. Developed a fraud detection system using RNN in TensorFlow with 98% accuracy integrated into real-time scoring. Created a LangChain-powered chatbot using OpenAI GPT that reduced healthcare insurance support response time by 50% and increased NPS. Implemented predictive modeling for hospital readmissions using Random Forest and Logistic Regression achieving 70% accuracy and reducing costs by 50%. Designed and deployed an Azure-based ETL pipeline using Azure ML and Hugging Face transformers for patient feedback sentiment analysis, boosting satisfaction by 30%.
Data Scientist at TatvaSoft – India
June 30, 2023 - August 26, 2025
Architected an NLP-based recommendation system using NLTK and Scikit-learn that improved click-through rate by 20% and sales by 30%. Integrated ANN, CNN, and RNN into a hybrid model, boosting overall model performance by 30%. Created customer segmentation dashboards in Tableau using K-Means clustering, enabling a 20% lift in targeted acquisition campaigns. Trained and tuned XGBoost models identifying promotion-sensitive products, driving a 10% spike in peak-season sales and 8% profit growth. Employed a comprehensive suite of data science libraries to refine models and workflows, increasing development speed by 25%. Streamlined ML deployment using AWS CodePipeline, reducing deployment time by 50% and improving CI/CD efficiency.

Education

Master of Science in Information System at California State University – Long Beach, CA
January 11, 2030 - May 1, 2025
Bachelor of Science in Electronic and Telecommunication at University of Mumbai – Mumbai, India
January 11, 2030 - May 1, 2023
Master of Science in Information System at California State University – Long Beach
August 1, 2023 - May 1, 2025
Bachelor of Science in Electronic and Telecommunication at University of Mumbai – Mumbai, India
August 1, 2019 - May 1, 2023
Master of Science in Information System at California State University – Long Beach, CA
January 11, 2030 - May 1, 2025
Bachelor of Science in Electronic and Telecommunication at University of Mumbai – Mumbai, India
January 11, 2030 - May 1, 2023

Qualifications

AWS Certified Solutions Architect – Associate
January 11, 2030 - August 26, 2025
AWS Cloud Practitioner
January 11, 2030 - August 26, 2025
Google Data Analytics
January 11, 2030 - August 26, 2025
Stanford ML Specialization (Coursera)
January 11, 2030 - August 26, 2025
AWS Certified Solutions Architect – Associate
January 11, 2030 - August 26, 2025
AWS Cloud Practitioner
January 11, 2030 - August 26, 2025
Google Data Analytics
January 11, 2030 - August 26, 2025
Stanford ML Specialization (Coursera)
January 11, 2030 - August 26, 2025
AWS Certified Solutions Architect – Associate
January 11, 2030 - August 26, 2025
AWS Cloud Practitioner
January 11, 2030 - August 26, 2025
Google Data Analytics
January 11, 2030 - August 26, 2025
Stanford ML Specialization (Coursera)
January 11, 2030 - August 26, 2025

Industry Experience

Healthcare, Financial Services, Software & Internet, Professional Services, Education