Available to hire
I’m a Machine Learning Engineer with 5+ years of experience building and deploying large-scale ML systems for payments, fraud detection, and high-performance AI platforms. I specialize in end-to-end ML pipelines, real-time inference services, and GPU-accelerated training workflows.
I collaborate with cross-functional teams to translate business needs into production-ready AI solutions, optimize models for accuracy and latency, and scale deployments to millions of users. I enjoy hands-on experimentation with PyTorch, TensorFlow, Kubernetes, and MLOps best practices.
Skills
Experience Level
Language
English
Fluent
Work Experience
AI/ML Engineer at Nvidia
November 1, 2024 - PresentDesigned and implemented end-to-end ML training and inference pipelines optimized for GPU-accelerated environments, supporting large-scale deep learning workloads across multiple teams. Built and optimized distributed training workflows using PyTorch, CUDA, and NCCL, improving multi-GPU training efficiency and reducing model training time by 30%+. Developed high-performance data preprocessing and feature pipelines using Python, Spark, and custom CUDA kernels, enabling TB-scale dataset processing. Collaborated with research scientists to productionize state-of-the-art transformer- and vision-based models, ensuring reproducibility and scalability. Integrated models into real-time and batch inference systems using Triton Inference Server and TensorRT, achieving low-latency inference. Optimized model performance through mixed-precision training, model parallelism, and quantization, reducing GPU memory usage by 20–25%. Built and maintained MLOps workflows for experiment tracking, model ve
Machine Learning Engineer at Accenture
April 1, 2020 - November 1, 2023Developed and deployed machine learning models supporting fraud detection, risk scoring, and user personalization across Paytm’s payments platform. Built end-to-end ML pipelines for data ingestion, feature engineering, training, and inference on billions of transaction records. Implemented supervised and unsupervised models, improving fraud detection accuracy by 28% while reducing false positives by 20%. Designed real-time inference services handling millions of transactions per day with low latency and high reliability. Optimized feature stores and data pipelines to reduce model inference latency by 30% under peak traffic conditions. Collaborated with product, risk, and data engineering teams to translate business requirements into deployable ML solutions. Implemented model monitoring, drift detection, and automated retraining, improving long-term model stability and performance. Deployed ML models using Docker and Kubernetes, enabling scalable and fault-tolerant production inferenc
Education
Masters of Science in information technology and management at Webster University
January 11, 2030 - December 1, 2025Qualifications
Industry Experience
Financial Services, Software & Internet, Professional Services
Skills
Experience Level
Hire a AI Engineer
We have the best ai engineer experts on Twine. Hire a ai engineer in San Francisco today.