Hi, I’m Arslan Minhas — a Senior AI Engineer with 8+ years of building production machine learning systems, from traditional ML and deep learning to modern GenAI architectures. I specialize in multi-agent RAG platforms, text-to-SQL engines, and computer vision, with a track record of delivering enterprise AI applications from concept to production. I thrive on turning complex data problems into scalable solutions across healthcare, finance, logistics, and retail, with a strong focus on responsible and HIPAA-compliant deployments. I’m passionate about LLM fine-tuning (LoRA/QLoRA), distributed training optimization, and edge AI solutions. I enjoy mentoring teams, shaping MLOps practices, and creating impactful AI systems that balance performance, privacy, and business value.

Arslan Minhas

Hi, I’m Arslan Minhas — a Senior AI Engineer with 8+ years of building production machine learning systems, from traditional ML and deep learning to modern GenAI architectures. I specialize in multi-agent RAG platforms, text-to-SQL engines, and computer vision, with a track record of delivering enterprise AI applications from concept to production. I thrive on turning complex data problems into scalable solutions across healthcare, finance, logistics, and retail, with a strong focus on responsible and HIPAA-compliant deployments. I’m passionate about LLM fine-tuning (LoRA/QLoRA), distributed training optimization, and edge AI solutions. I enjoy mentoring teams, shaping MLOps practices, and creating impactful AI systems that balance performance, privacy, and business value.

Available to hire

Hi, I’m Arslan Minhas — a Senior AI Engineer with 8+ years of building production machine learning systems, from traditional ML and deep learning to modern GenAI architectures. I specialize in multi-agent RAG platforms, text-to-SQL engines, and computer vision, with a track record of delivering enterprise AI applications from concept to production. I thrive on turning complex data problems into scalable solutions across healthcare, finance, logistics, and retail, with a strong focus on responsible and HIPAA-compliant deployments.

I’m passionate about LLM fine-tuning (LoRA/QLoRA), distributed training optimization, and edge AI solutions. I enjoy mentoring teams, shaping MLOps practices, and creating impactful AI systems that balance performance, privacy, and business value.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
See more

Language

English
Fluent

Work Experience

Senior AI Engineer at Technumen
July 1, 2022 - Present
Remote • July 2022 – Present Built multi-agent RAG clinical decision support system using Llama 370B with LoRA adapters; integrated 20M+ medical records and UMLS knowledge graphs; achieved 94% diagnostic accuracy and reduced physician lookup time by 65%. Developed financial Text-to-SQL platform using fine-tuned CodeLlama handling derivatives, portfolio analytics, and risk metrics; processes 50K+ queries daily with sub-second latency. Implemented CLIP-based multimodal visual search across 5M+ retail products; optimized inference with TensorRT achieving <80ms latency and increasing conversion by 40%. Designed PPO-based reinforcement learning system for last-mile route optimization using graph neural networks; reduced delivery times by 30% across 10K daily routes. Deployed YOLOv5 + SAM defect detection across 15 production lines; achieved 97% mAP at 60 FPS with NVIDIA Jetson edge deployment. Automated 75% of insurance claims using GPT-4 fine-tuning with RLHF and fraud detection via gr
Senior Machine Learning Engineer at Marketfuel
January 1, 2019 - June 1, 2022
Remote • Jan 2019 – Jun 2022 Built real-time e-commerce recommendation engine using two-tower neural networks and GRU4Rec session modeling; scaled to 20M users and improved CTR by 35%. Implemented hierarchical demand forecasting models (N-BEATS, TFT) capturing seasonality and promotions; reduced stockouts by 45% while optimizing inventory costs. Deployed multi-armed bandits for explore/exploit optimization in pricing and personalization systems. Engineered streaming fraud detection pipeline processing 100M+ transactions daily using ensemble of isolation forests, autoencoders, and graph neural networks; achieved 0.02% false positive rate preventing significant financial losses. Developed dynamic pricing engine using contextual bandits with competitor price monitoring and elasticity modeling; increased gross margins by 18% through statistically rigorous A/B testing framework. Built customer service chatbot using BERT variants with retrieval-augmented generation for intent classificat
Machine Learning Engineer at FoodBAM
November 1, 2017 - December 1, 2019
New York, NY • Nov 2017 – Dec 2019 Built restaurant analytics platform with LSTM-based forecasting for order volume, prep time, and ingredient usage; reduced food waste by 35%. Developed menu optimization system using collaborative filtering and price elasticity modeling; increased average order value by 22%. Created delivery time prediction pipeline combining gradient boosting and deep learning; achieved 85% accuracy within a 5-minute window. Implemented CNN-based food recognition and quality assessment deployed via Core ML; reduced complaints by 40%. Built NLP sentiment analysis dashboard with aspect extraction and topic modeling; improved ratings by 0.5 stars. Architected real-time anomaly detection system for kitchen operations using streaming data from IoT sensors and POS systems; reduced service disruptions by 28% through proactive intervention alerts.

Education

Master of Science at Metropolitan College of New York
January 11, 2030 - February 26, 2026
Master of Science at Metropolitan College of New York
January 1, 2022 - January 1, 2022

Qualifications

AWS Certified Machine Learning
January 11, 2030 - February 26, 2026
Google Cloud Professional ML Engineer
January 11, 2030 - February 26, 2026
Deep Learning Specialization (deeplearning.ai)
January 11, 2030 - February 26, 2026
NVIDIA Deep Learning Institute
January 11, 2030 - February 26, 2026
MLOps Specialization (Coursera)
January 11, 2030 - February 26, 2026
AWS Certified Machine Learning
January 1, 2019 - June 1, 2022
Google Cloud Professional ML Engineer
January 11, 2030 - April 17, 2026
Deep Learning Specialization (deeplearning.ai)
January 11, 2030 - April 17, 2026
NVIDIA Deep Learning Institute MLOps Specialization (Coursera)
January 11, 2030 - April 17, 2026

Industry Experience

Media & Entertainment, Software & Internet, Professional Services, Education, Other, Healthcare, Financial Services, Retail, Transportation & Logistics, Manufacturing