Available to hire
I am Mohitdeep Singh, a Senior AI/ML Engineer with 10+ years of end-to-end experience across GenAI, NLP, computer vision, and predictive modeling. I’ve shipped production-grade ML in e-commerce, OTT streaming, AdTech, and music-tech, aligning solutions with product strategy and market needs.
I excel at ML engineering and data infrastructure—building robust ETL pipelines, training scalable models on Spark and Databricks, and deploying at scale on AWS/GCP/Azure with Kubernetes, Docker, and CI/CD. I thrive in cross-functional teams, mentoring others and translating product goals into impactful ML solutions.
Skills
Experience Level
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Language
Afar
Advanced
Javanese
Advanced
Work Experience
Senior AI/ML Engineer at Coupang
February 1, 2024 - PresentDesigned and deployed real-time recommender systems for Coupang’s marketplace, driving an 18% CTR uplift. Built deep learning-based product ranking models using NLP on reviews, search queries, and metadata. Engineered fraud-detection pipelines with anomaly detection, reducing fraudulent transactions by 30%. Deployed ML workflows on AWS SageMaker, Databricks, and orchestrated via Airflow for production stability. Designed feature engineering pipelines processing terabytes of clickstream data daily. Partnered with product managers to align personalization models with e-commerce growth metrics. Integrated recommender engines into microservices and APIs to ensure a low-latency user experience.
Senior ML Engineer at Roku Inc
February 1, 2024 - October 2, 2025Built audience segmentation models for Roku’s advertising platform, boosting ad targeting ROI by ~20%. Designed OTT viewership prediction models for user retention and personalization. Scaled ML pipelines using GCP BigQuery, Spark, and Kafka to process billions of streaming events. Implemented ad inventory optimization models, increasing fill rates while reducing costs. Collaborated with engineering teams to deploy ML services via Kubernetes and Docker. Created dashboards in Tableau and Power BI for campaign performance and ML monitoring. Applied NLP to content metadata to enhance search relevance. Developed real-time bidding algorithms for programmatic ad systems. Contributed to cross-functional AI roadmaps aligning with expansion into OTT advertising.
Senior Machine Learning Developer at Pinterest
May 1, 2022 - October 2, 2025Designed and deployed recommendation systems powering personalized home feeds, increasing user engagement by 22%. Built visual search models using deep learning (CNNs, Transformers). Developed ad ranking and targeting models improving ad conversions. Implemented A/B testing pipelines to evaluate recommender performance. Scaled ML workflows on GCP AI Platform and Databricks processing billions of daily pins and user actions. Collaborated with product teams to integrate Gen AI for content recommendations and automated tag generation. Created fraud/spam detection models to maintain feed integrity. Mentored junior ML engineer and contributed to internal ML playbooks for recommender systems.
Senior Data Scientist at Pandora
October 1, 2020 - October 2, 2025Developed music recommendation algorithms using collaborative filtering and matrix factorization powering personalization for millions of users. Applied NLP on song lyrics, metadata, and user reviews to improve playlist relevance. Built predictive churn models reducing churn by 15% through targeted retention campaigns. Designed and deployed A/B testing frameworks to measure impact of ML-driven personalization features. Partnered with product managers and engineers to deliver data-driven personalization inside the Pandora app. Created ETL workflows with Spark and Hadoop to process billions of listening sessions in real time. Authored internal reports on user behavior patterns influencing product roadmap decisions.
Data Scientist at Radio
December 1, 2015 - October 2, 2025Built auto-recommendation pipelines leveraging collaborative filtering and user-listening behavior. Applied deep learning (CNNs/RNNs) to analyze audio waveforms for genre and mood classification. Designed real-time personalization systems for radio streaming, enhancing session length. Partnered with UX teams to deliver smart playlists powered by ML insights. Integrated cloud-based ML workflows using AWS SageMaker and Kubernetes for production deployment.
Research Intern at Intel Labs
May 1, 2014 - October 2, 2025Researched deep learning architectures for computer vision and speech recognition. Contributed to internal patent filings on AI optimization methods. Applied transfer learning to accelerate training and reduce compute cost. Presented findings in lab-wide seminars and technical reports. Implemented scalable data preprocessing pipelines for large unstructured datasets.
Senior AI/ML Engineer at Coupang
February 1, 2024 - PresentBuilt a multi-objective recommender system for Coupang’s homepage using hybrid filtering and intent modeling; led to 18% CTR boost. Fine-tuned open-source LLMs (LLaMA-2, Mistral) using product catalog data for a customer support chatbot in English. Created route optimization models for last-mile delivery using dynamic programming and travel-time forecasting. Designed anomaly detection on logistics scan data using deep autoencoders, reducing delivery errors by 22%. Deployed real-time inference APIs on AWS EKS with TorchServe, monitoring via Prometheus + Grafana.
Senior ML Engineer at Roku Inc
February 1, 2024 - October 22, 2025Developed predictive models for user retention using XGBoost on daily viewing data, improving churn prediction by 30%. Built NLP-based metadata enhancement pipelines (spaCy + custom tagging rules) to improve ad targeting on TV content. Led development of lookalike audience models for Roku’s DSP using deep autoencoders and K-means clustering. Created automated ML pipeline using GCP Vertex AI for training & deployment of ad ranking models. Partnered with product to design content-based recommendations for Roku Channel using VGG16-based vision features.
Senior Machine Learning Developer at Pinterest
May 1, 2022 - October 22, 2025Developed visual search models using ResNet50 and feature hashing to match pins with similar visual content. Improved home feed personalization using attention-based ranking models, leading to 22% increase in session duration. Built caption recommendation models using GPT-2 via Hugging Face 2020 release, fine-tuned on pin metadata. Created automated pipelines in Databricks to process 2B+ daily events for feature engineering. Developed spam and NSFW detection models using vision transformers and content embeddings.
Senior Data Scientist at Pandora
October 1, 2020 - October 22, 2025Built collaborative filtering and matrix factorization models (ALS) for radio station personalization. Applied LSTM-based sequence modeling to predict next-song preference based on listening history. Designed genre classification using audio signal processing (Librosa) + early CNN models trained on spectrograms. Created churn models using survival analysis and logistic regression with listener demographics & behavior. Wrote production-grade Spark jobs to process 10B+ song plays/month on Hadoop clusters.
Data Scientist at Radio
December 1, 2015 - October 22, 2025Built real-time listener segmentation using K-means clustering on session-level data. Applied CNNs (via Caffe and early TensorFlow) on audio spectrograms for genre classification. Developed rule-based NLP models to tag live shows using TF-IDF + spaCy. Created dashboards to monitor listening trends and peak-time traffic patterns.
Research Intern at Intel Labs
May 1, 2014 - October 22, 2025Researched transfer learning with CNNs for document classification and facial recognition tasks. Built a speech classification model using HMM-DNN hybrid architecture. Implemented a custom data pipeline to train models on internal video datasets using OpenCV and Theano. Co-authored technical reports and filed internal patent applications on model compression.
Senior AI/ML Engineer at Coupang
February 1, 2024 - November 6, 2025Built a multi-objective recommender system for Coupang’s homepage using hybrid filtering and intent modeling; led to 18% CTR boost. Fine-tuned open-source LLMs (LLaMA-2, Mistral) using product catalog data for a customer support chatbot in English. Created route optimization models for last-mile delivery using dynamic programming and travel-time forecasting. Designed anomaly detection on logistics scan data using deep autoencoders, reducing delivery errors by 22%. Deployed real-time inference APIs on AWS EKS with TorchServe, monitoring via Prometheus + Grafana.
Senior ML Engineer at Roku Inc
February 1, 2024 - February 1, 2024Developed predictive models for user retention using XGBoost on daily viewing data, improving churn prediction by 30%. Built NLP-based metadata enhancement pipelines (spaCy + custom tagging rules) to improve ad targeting on TV content. Led development of lookalike audience models for Roku’s DSP using deep autoencoders and K-means clustering. Created automated ML pipeline using GCP Vertex AI for training & deployment of ad ranking models. Partnered with product to design content-based recommendations for Roku Channel using VGG16-based vision features.
Senior Machine Learning Developer at Pinterest
May 1, 2022 - May 1, 2022Developed visual search models using ResNet50 and feature hashing to match pins with similar visual content. Improved home feed personalization using attention-based ranking models, leading to 22% increase in session duration. Built caption recommendation models using GPT-2 (via Hugging Face 2020 release) fine-tuned on pin metadata. Created automated pipelines in Databricks to process 2B+ daily events for feature engineering. Developed spam and NSFW detection models using vision transformers and content embeddings.
Senior Data Scientist at Pandora
October 1, 2020 - October 1, 2020Built collaborative filtering and matrix factorization models (ALS) for radio station personalization. Applied LSTM-based sequence modeling to predict next-song preference based on listening history. Designed genre classification using audio signal processing (Librosa) + early CNN models trained on spectrograms. Created churn models using survival analysis and logistic regression with listener demographics & behavior. Wrote production-grade Spark jobs to process 10B+ song plays/month on Hadoop clusters.
Data Scientist at Radio
December 1, 2015 - December 1, 2015Built real-time listener segmentation using K-means clustering on session-level data. Applied CNNs (via Caffe and early TensorFlow) on audio spectrograms for genre classification. Developed rule-based NLP models to tag live shows using TF-IDF + spaCy. Created dashboards to monitor listening trends and peak-time traffic patterns.
Research Intern at Intel Labs
May 1, 2014 - May 1, 2014Researched transfer learning with CNNs for document classification and facial recognition tasks. Built a speech classification model using HMM-DNN hybrid architecture. Implemented a custom data pipeline to train models on internal video datasets using OpenCV and Theano. Co-authored technical reports and filed internal patent applications on model compression.
Education
Master’s in Robotics and Machine Learning at Carnegie Mellon University
January 1, 2009 - January 1, 2011Master’s in Computer Science at Georgia Institute of Technology
January 1, 2007 - January 1, 2009Bachelor’s in Computer Science at Georgia Institute of Technology
January 1, 2003 - January 1, 2007Master of Science - Robotics & Machine Learning at Carnegie Mellon University
January 1, 2009 - January 1, 2011Master of Science - Computer Science at Georgia Institute of Technology
January 1, 2007 - January 1, 2009Bachelor of Science - Computer Science at Georgia Institute of Technology
January 1, 2003 - January 1, 2007Master of Science - Robotics & Machine Learning at Carnegie Mellon University
January 1, 2009 - January 1, 2011Master of Science - Computer Science at Georgia Institute of Technology
January 1, 2007 - January 1, 2009Bachelor of Science - Computer Science at Georgia Institute of Technology
January 1, 2003 - January 1, 2007Qualifications
Industry Experience
Software & Internet, Media & Entertainment, Retail, Professional Services
Skills
Experience Level
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Hire a Data Scientist
We have the best data scientist experts on Twine. Hire a data scientist in San Jose today.