I am a Machine Learning Engineer with over 6 years of experience creating robust and scalable machine learning pipelines and deep learning solutions across diverse industries. I specialize in designing end-to-end data workflows, large language model training and fine-tuning using TensorFlow, PyTorch, and Hugging Face, and optimizing pipelines on cloud platforms such as AWS and Snowflake. I excel at translating complex business challenges into impactful, production-ready AI systems. Throughout my career, I have developed expertise in MLOps, natural language processing, big data tools, and cloud-native infrastructure, consistently improving model accuracy, reducing latency, and automating deployment workflows. I am passionate about leveraging cutting-edge AI technologies and data engineering best practices to build scalable, efficient, and secure AI-driven applications that deliver measurable business value.

Praveen Kumar

I am a Machine Learning Engineer with over 6 years of experience creating robust and scalable machine learning pipelines and deep learning solutions across diverse industries. I specialize in designing end-to-end data workflows, large language model training and fine-tuning using TensorFlow, PyTorch, and Hugging Face, and optimizing pipelines on cloud platforms such as AWS and Snowflake. I excel at translating complex business challenges into impactful, production-ready AI systems. Throughout my career, I have developed expertise in MLOps, natural language processing, big data tools, and cloud-native infrastructure, consistently improving model accuracy, reducing latency, and automating deployment workflows. I am passionate about leveraging cutting-edge AI technologies and data engineering best practices to build scalable, efficient, and secure AI-driven applications that deliver measurable business value.

Available to hire

I am a Machine Learning Engineer with over 6 years of experience creating robust and scalable machine learning pipelines and deep learning solutions across diverse industries. I specialize in designing end-to-end data workflows, large language model training and fine-tuning using TensorFlow, PyTorch, and Hugging Face, and optimizing pipelines on cloud platforms such as AWS and Snowflake. I excel at translating complex business challenges into impactful, production-ready AI systems.

Throughout my career, I have developed expertise in MLOps, natural language processing, big data tools, and cloud-native infrastructure, consistently improving model accuracy, reducing latency, and automating deployment workflows. I am passionate about leveraging cutting-edge AI technologies and data engineering best practices to build scalable, efficient, and secure AI-driven applications that deliver measurable business value.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
See more

Work Experience

Machine Learning Engineer at eBay
May 1, 2024 - Present
Engineered scalable ML pipelines using PySpark and Airflow, improving product recommendation accuracy by 22%. Deployed PyTorch models with TorchServe, doubling API response speed and reducing cloud costs by 18%. Enhanced ETL and training data readiness by 3x using Pandas and Spark. Integrated spaCy NLP components to improve user insights by 60%. Reduced inference latency by 35% through containerized deployments on Kubernetes-managed GPU clusters. Accelerated training speed by 40% using PyTorch Lightning with mixed precision. Built real-time monitoring systems with Kafka and Spark Streaming, increasing anomaly detection coverage by 60%. Improved search relevancy by integrating FAISS vector search. Automated retraining workflows with MLflow and SageMaker, cutting downtime by 70%. Developed a RAG-based low-latency recommendation system, increasing upsell conversions by 15%. Modernized deployment pipelines, reducing infrastructure costs by 25%. Migrated analytics to AWS EKS and Terraform,
Machine Learning Engineer at TCS | Client: Evernorth
May 1, 2022 - August 27, 2025
Built scalable XGBoost-based fraud detection models identifying $3M in suspicious claims. Developed an LLM-powered Q&A system using LangChain and HuggingFace, boosting internal query resolution speed by 50%. Created real-time anomaly detection pipelines to mitigate $1M annual losses. Improved churn prediction accuracy by 22% with Random Forest and XGBoost ensembles. Automated training workflows via Airflow and MLflow, reducing workload by 40%. Enhanced ICD-10 coding accuracy by 27% with BERT-based NLP pipelines. Reduced HEDIS penalties by 19% using LightGBM on longitudinal patient data. Processed 10 TB weekly with Apache Spark and Synapse. Automated HIPAA-compliant model deployment with SageMaker. Boosted chronic disease detection by 21% using LSTM models. Enabled 100% SLA compliance by monitoring ML drift with Prometheus and Grafana. Facilitated synthetic patient data generation for robust experiments. Re-architected pipelines with Kubeflow and GitLab CI/CD, halving deployment time.
Machine Learning Engineer at TCS | Client: JP Morgan Chase
May 1, 2020 - August 27, 2025
Improved loan approval accuracy by 28% with Random Forest credit risk models. Reduced fraud detection latency from 2.3s to 0.8s by deploying quantized models on SageMaker. Accelerated foundation model training by 30% using PyTorch DDP and DeepSpeed. Strengthened trade surveillance with Kafka and PySpark streaming pipelines improving anomaly detection by 40%. Enhanced customer sentiment classification by 32% via spaCy and Transformer NLP models. Automated data pipelines using Snowflake, dbt, and Power BI, cutting reporting delays by 70%. Developed secure LLM-powered portfolio assistants with GPT-NeoX and Triton Inference Server. Improved audit compliance integrating Great Expectations in Airflow. Reduced analyst triaging time by 35% using RAG-based transaction summarizers. Built real-time loan default predictors mitigating $4.1M in risk. Increased model release frequency 4x with CI/CD pipelines using Jenkins and Terraform.
Machine Learning Engineer at eBay
May 1, 2024 - Present
At eBay, I engineered scalable ML pipelines using PySpark and Airflow for real-time clickstream data processing, improving product recommendation accuracy by 22%. I optimized PyTorch model serving with TorchServe, doubling API response speed and reducing cloud resource costs by 18%. I enhanced ETL processes and accelerated training cycles using Pandas and Spark. Integrated NLP capabilities leveraging spaCy for named entity recognition and topic modeling, boosting user insight extraction by 60%. Containerized models via Docker and Kubernetes to reduce inference latency by 35%. Implemented mixed precision training with PyTorch Lightning, speeding up training by 40% and improving accuracy. Built real-time monitoring and anomaly detection systems using Kafka and Spark Streaming, increasing detection coverage by 60%. Integrated FAISS vector search for improved product metadata lookup, raising search relevancy by 18%. Automated model retraining workflows with MLflow and SageMaker, cutting ma
Machine Learning Engineer at TCS | Client: Evernorth
May 1, 2022 - August 27, 2025
At Evernorth, I built an XGBoost fraud detection model identifying $3M in suspicious claims, enhancing financial risk management. Developed an LLM-powered Q&A system boosting query resolution by 50% using LangChain and Hugging Face. Created AI fraud detection pipelines using Isolation Forest and Autoencoder reducing annual financial loss by $1M. Improved customer churn prediction accuracy by 22% with SHAP features and ensemble models. Automated training workflows with Airflow and MLflow, reducing manual work by 40%. Used BERT for NLP pipelines that enhanced ICD-10 coding accuracy by 27%. Decreased HEDIS penalties by 19% employing LightGBM on longitudinal patient data. Processed 10TB patient data weekly with Apache Spark and Synapse. Automated HIPAA-compliant deployments with AWS SageMaker, reducing overhead by 60%. Improved early chronic disease detection by 21% via LSTM models. Implemented observability with Prometheus and Grafana to guarantee 100% SLA compliance. Designed synthetic d
Machine Learning Engineer at TCS | Client: JP Morgan Chase
May 1, 2020 - August 27, 2025
Developed Random Forest credit risk scoring model improving loan approval accuracy by 28%. Reduced fraud detection latency from 2.3s to 0.8s via model quantization and SageMaker endpoints. Accelerated foundation model training by 30% using PyTorch DDP and DeepSpeed distributed training on SageMaker. Engineered Kafka and PySpark streaming pipelines enhancing real-time anomaly detection by 40%. Built NLP sentiment classifier using spaCy and Transformers improving accuracy by 32%. Automated data pipelines with Snowflake, dbt, and Power BI to cut reporting delays by 70%. Designed secure LLM-powered assistant with GPT-NeoX and Triton Server for portfolio insights. Improved audit compliance integrating Great Expectations for data lineage. Developed Retrieval-Augmented Generation financial summarizer decreasing triage time by 35%. Built real-time loan default predictor with TabNet reducing default risk by $4.1M. Orchestrated CI/CD with Jenkins and Terraform increasing model release frequency

Education

Master of Science, Data Analytics at Webster University
January 11, 2030 - May 1, 2025
Bachelor of Technology, Mechanical Engineering at VNR VJIET
January 11, 2030 - May 1, 2018
Master of Science at Webster University
January 1, 2023 - May 1, 2025
Bachelor of Technology at VNR VJIET
January 1, 2014 - May 1, 2018

Qualifications

AWS Certified Machine Learning - Specialty
January 11, 2030 - August 27, 2025
Google Professional Machine Learning Engineer
January 11, 2030 - August 27, 2025
Microsoft Certified: Azure AI Engineer Associate
January 11, 2030 - August 27, 2025
AWS Certified Machine Learning - Specialty
January 11, 2030 - August 27, 2025
Google Professional Machine Learning Engineer
January 11, 2030 - August 27, 2025
Microsoft Certified: Azure AI Engineer Associate
January 11, 2030 - August 27, 2025

Industry Experience

Financial Services, Healthcare, Software & Internet, Retail, Professional Services