Hi, I'm Pradeep Kumar, a Data Scientist and AI/ML Engineer with over 8 years of experience in building end-to-end data-driven solutions. I specialize in machine learning, deep learning, NLP, computer vision, and large language models, and I enjoy working closely with cross-functional teams to create impactful AI applications aligned with business goals. I'm passionate about developing scalable, explainable, and secure AI/ML systems that solve real-world problems. I have hands-on experience deploying production-grade models using cloud platforms and MLOps tools, and I’m particularly excited about generative AI technologies including prompt engineering and retrieval-augmented generation. Outside of work, I love diving into new AI research and exploring ways to make AI-driven tools more accessible and effective for enterprises.

Pradeep Kumar

Hi, I'm Pradeep Kumar, a Data Scientist and AI/ML Engineer with over 8 years of experience in building end-to-end data-driven solutions. I specialize in machine learning, deep learning, NLP, computer vision, and large language models, and I enjoy working closely with cross-functional teams to create impactful AI applications aligned with business goals. I'm passionate about developing scalable, explainable, and secure AI/ML systems that solve real-world problems. I have hands-on experience deploying production-grade models using cloud platforms and MLOps tools, and I’m particularly excited about generative AI technologies including prompt engineering and retrieval-augmented generation. Outside of work, I love diving into new AI research and exploring ways to make AI-driven tools more accessible and effective for enterprises.

Available to hire

Hi, I’m Pradeep Kumar, a Data Scientist and AI/ML Engineer with over 8 years of experience in building end-to-end data-driven solutions. I specialize in machine learning, deep learning, NLP, computer vision, and large language models, and I enjoy working closely with cross-functional teams to create impactful AI applications aligned with business goals. I’m passionate about developing scalable, explainable, and secure AI/ML systems that solve real-world problems.

I have hands-on experience deploying production-grade models using cloud platforms and MLOps tools, and I’m particularly excited about generative AI technologies including prompt engineering and retrieval-augmented generation. Outside of work, I love diving into new AI research and exploring ways to make AI-driven tools more accessible and effective for enterprises.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
See more

Language

Afar
Advanced
Bashkir
Advanced
English
Fluent

Work Experience

Data Science and AI/ML at CVS Health
January 1, 2024 - Present
Built and maintained a real-time recommendation engine improving user engagement by 30%. Developed and fine-tuned deep neural networks for image and text classification. Deployed a YOLOv5-based computer vision pipeline with 98% precision for defect detection. Conducted extensive feature engineering and hyperparameter tuning, enhancing model F1-score by 15%. Analyzed diverse datasets leading to 20% cost reduction and created dashboards for real-time KPI and ML model monitoring. Collaborated with product managers, engineers, and stakeholders for aligning ML solutions with business objectives. Managed ML infrastructure using tools like Prometheus, Grafana, DVC, MLflow, and deployed containerized models on AWS Lambda. Developed pricing optimization models increasing average revenue per user by 10%. Delivered insights to executives influencing roadmap and budget. Ensured compliance with GDPR and HIPAA, executed fairness audits, and built self-supervised learning frameworks. Utilized STL, EM
AI/ML Engineer at MasterCard
December 31, 2023 - July 31, 2025
Implemented advanced hyperparameter tuning with Optuna and Ray Tune. Added explainability layers (SHAP, LIME) to black-box models. Conducted real-time model monitoring to detect drift and retrain models. Presented model insights and ROI forecasts to executives. Built anomaly detection systems for fraud prevention using Isolation Forests and Autoencoders. Designed time series forecasting models using ARIMA, Prophet, and TFT. Developed reinforcement learning agents for logistics optimization. Built NLP and computer vision deep learning models using TensorFlow and PyTorch. Fine-tuned Transformer models achieving high entity recognition accuracy. Created custom recommendation and search engines with FAISS and ElasticSearch embeddings. Automated ML workflows with MLflow, DVC, Airflow, and GitHub Actions. Established monitoring with Prometheus and Evidently AI. Scaled AI services on AWS, GCP, and Azure. Orchestrated Kubernetes-based Kubeflow pipelines and crafted robust large-scale data pipe
Data Science at DTCC
September 30, 2021 - July 31, 2025
Built NLP pipelines for topic modeling and classification using spaCy and Hugging Face. Conducted sentiment analysis influencing product feature prioritization. Performed advanced statistical analyses including ANOVA and regression. Managed ETL processes and standardized data formatting. Applied ensemble learning methods and predictive sales modeling techniques. Conducted exploratory data analysis and feature engineering for improved model performance. Extracted big data from diverse sources and utilized Hadoop HDFS. Created dashboards with Tableau and Power BI for KPI tracking. Collaborated across teams to deliver detailed analytical reports supporting business decisions.
Data Science at Sonata Software
April 30, 2019 - July 31, 2025
Designed and evaluated A/B and multivariate experiments to optimize marketing and UX. Applied causal inference to estimate business impact. Addressed class imbalance with SMOTE. Built classification models including Logistic Regression, Random Forest, SVM, and XGBoost. Conducted K-Fold cross-validation and feature engineering (PCA, normalization, encoding). Created end-to-end analytics and automation systems with R, Tableau, and Power BI. Analyzed customer churn and sales patterns using time series modeling. Built ETL pipelines and optimized data collection from multiple sources. Performed dimensionality reduction and exploratory data analysis to uncover patterns and trends.
Data Science and AI/ML at CVS Health
January 1, 2024 - Present
Built and maintained a real-time recommendation engine increasing user engagement by 30%. Trained and fine-tuned CNNs and RNNs for image and text classification using PyTorch and TensorFlow. Deployed a computer vision pipeline with YOLOv5 achieving 98% precision in real-time defect detection. Performed advanced feature engineering and tuning with GridSearchCV and Optuna, improving model F1-score by 15%. Conducted multi-source data analysis that contributed to a 20% operational cost reduction. Developed dynamic dashboards for real-time KPI and model monitoring. Collaborated with cross-functional teams to ensure ML solutions aligned with business goals. Built Retrieval-Augmented Generation (RAG) systems using FAISS/Weaviate and Transformers for knowledge retrieval. Automated data pipelines into cloud data warehouses. Established robust model monitoring with Prometheus and Grafana. Managed model lifecycle with DVC and MLflow and deployed containerized models on AWS Lambda handling 100K+ r
AI/ML Engineer at MasterCard
December 31, 2023 - July 31, 2025
Implemented hyperparameter tuning with Optuna and Ray Tune to optimize models efficiently. Added explainability layers (SHAP, LIME) for regulatory compliance and enhanced transparency. Conducted real-time model drift monitoring and automated retraining pipelines. Delivered insights and ROI forecasts influencing AI strategy for executives. Built anomaly detection systems using Isolation Forests and Autoencoders for fraud prevention. Designed time series demand forecasting models including ARIMA, Prophet, and Temporal Fusion Transformers. Developed reinforcement learning agents for logistics optimization. Built and fine-tuned deep learning NLP and vision models including BERT, RoBERTa, YOLOv5, and EfficientNet. Created custom search and recommendation systems using vector retrieval with FAISS and ElasticSearch. Automated ML workflows with MLflow, DVC, Airflow, and GitHub Actions. Deployed scalable AI services on AWS, GCP, and Azure, orchestrating Kubernetes clusters with Kubeflow. Develo
Data Science at DTCC
September 30, 2021 - July 31, 2025
Built NLP pipelines with spaCy, NLTK, and Hugging Face Transformers for topic modeling and document classification. Applied sentiment analysis on product reviews and social media data. Conducted statistical analysis including ANOVA, regression, and hypothesis testing using Python and Excel. Performed ETL and data preparation from diverse sources, transforming data into uniform formats. Applied ensemble learning techniques and selected optimal models through performance metrics. Explored sales trends and seasonality through visualization. Developed sales forecasting models with Linear Regression and ARIMA families. Ensured data quality and optimized collection procedures. Worked with various data formats and integrated data into visualization and ETL platforms. Built dashboards for marketing and operations teams to track KPIs. Collaborated with cross-functional teams to provide insightful analytical reports supporting business decisions.
Data Science at Sonata Software
April 30, 2019 - July 31, 2025
Designed and evaluated A/B and multivariate experiments for feature and marketing impact. Applied causal inference techniques to estimate business impact from observational data. Implemented and optimized classification models including Logistic Regression, Decision Trees, Random Forest, KNN, XGBoost, and SVM. Conducted K-fold cross-validation to avoid overfitting. Performed feature engineering such as PCA and feature normalization. Developed end-to-end data analytics and automation systems integrating R, Tableau, and Power BI visualizations. Analyzed time series data for customer behavior trends. Conducted comprehensive data mining, cleaning, and visualization. Managed ETL pipelines to gather and structure data from multiple sources. Explored variables correlation and dimensionality reduction to improve model accuracy. Built customer churn analysis and reporting tools for business use.

Education

Add your educational history here.

Qualifications

Add your qualifications or awards here.

Industry Experience

Financial Services, Healthcare, Manufacturing, Software & Internet, Transportation & Logistics, Retail

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
See more