I am a research-oriented technical consultant with 6+ years of experience in data engineering and data science. I specialize in building end-to-end ML and NLP systems, data engineering cloud solutions, and delivering scalable AI agents and data-driven solutions for Fortune 500 companies, retail giants, and capital-markets clients. I have led migrations of large-scale data warehouses to cloud platforms, engineered production ETL pipelines, and built quality frameworks to ensure data integrity. My work spans semantic search, time series forecasting, and real-time inference at production scale, with a focus on delivering measurable business value across retail, finance, and other domains. I am comfortable bridging business needs with technical execution and enjoy applying GEN AI solutions, AI agents, and back-end development to create impactful outcomes.

Rudresh Mehta

I am a research-oriented technical consultant with 6+ years of experience in data engineering and data science. I specialize in building end-to-end ML and NLP systems, data engineering cloud solutions, and delivering scalable AI agents and data-driven solutions for Fortune 500 companies, retail giants, and capital-markets clients. I have led migrations of large-scale data warehouses to cloud platforms, engineered production ETL pipelines, and built quality frameworks to ensure data integrity. My work spans semantic search, time series forecasting, and real-time inference at production scale, with a focus on delivering measurable business value across retail, finance, and other domains. I am comfortable bridging business needs with technical execution and enjoy applying GEN AI solutions, AI agents, and back-end development to create impactful outcomes.

Available to hire

I am a research-oriented technical consultant with 6+ years of experience in data engineering and data science. I specialize in building end-to-end ML and NLP systems, data engineering cloud solutions, and delivering scalable AI agents and data-driven solutions for Fortune 500 companies, retail giants, and capital-markets clients.

I have led migrations of large-scale data warehouses to cloud platforms, engineered production ETL pipelines, and built quality frameworks to ensure data integrity. My work spans semantic search, time series forecasting, and real-time inference at production scale, with a focus on delivering measurable business value across retail, finance, and other domains. I am comfortable bridging business needs with technical execution and enjoy applying GEN AI solutions, AI agents, and back-end development to create impactful outcomes.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
See more

Language

English
Fluent

Work Experience

Senior Technical Consultant - AI Data Engineer at MI9 Retail
October 1, 2022 - Present
Architected an AI-powered, product-expert solution based on Retrieval-Augmented Generation (RAG) leveraging Azure OpenAI and Pinecone. It processed 100K+ product queries daily, delivering natural language search capabilities that enhanced product discovery and generated over $3M in incremental revenue through semantic search with hybrid dense and sparse retrieval. Automated JIRA-based Windows server access provisioning using NLP-driven AI agents and self-hosted LLM/GPT-OSS on internal infrastructure, reducing manual effort. Built end-to-end production ETL pipelines with Apache Airflow orchestrating data workflows across AWS Redshift and Azure Synapse, processing 5TB+ daily financial transactions with 99.8% reliability, cutting ETL time by ~30% (8h to 5.5h). Designed data quality framework with DBT including line-level tests, checksum validation, and schema compatibility checks that reduced data integrity issues by 95% and prevented roughly $1.2M in downstream errors during migrations.
Big Data Engineer at Hitech I Solutions
April 30, 2021 - October 2, 2025
Led customer-centric data-driven solutions for Fortune 500 clients by integrating cutting-edge ML, DL, and NLP models with Azure cloud architecture to boost operational efficiency and informed decision-making. Built automated vendor pricing analysis system for FMCG manufacturers leveraging OCR and NLP, processing millions of flyer data points daily and saving $10M+ annually. Produced production-grade ETL pipelines in cloud environments processing 5GB+ data daily with 99.5% success using Azure Data Factory and AWS Glue. Deployed predictive time-series forecasting models for credit behavior, loan demand, and user trends, enabling more targeted marketing offers and improving cross-sell rates by 10%. Developed a KYC verification system with offline capabilities using CV & NLP, enhancing onboarding and reducing onboarding times by 40%. Orchestrated a Spark-based predictive modelling pipeline on Databricks and Azure, accelerating real-time model deployment by 50% and supporting client decisi
Data Scientist at HSBC Banking Innovation
April 30, 2021 - October 2, 2025
Developed real-time fraud detection systems using ensemble ML models processing 5M+ transactions daily with low latency; engineered features on transaction patterns, geo-location anomalies, and behavioral signals, achieving high detection accuracy while preventing losses exceeding $26M annually with minimal false positives. Built a credit risk scoring engine using gradient boosting and deep learning on a decade of historical loan data with hundreds of engineered features, improving default prediction. Architected a customer churn prediction platform using NLP sentiment analysis on support interactions and behavioral analytics, deploying optimized LSTM models on Azure ML to identify at-risk customers early and enable proactive retention campaigns saving over $3M annually. Implemented automated KYC validation combining CV for document verification and NLP for entity extraction to handle thousands of onboarding requests with improved straight-through processing.
Data Science Intern at Royal Technosoft
March 31, 2020 - October 2, 2025
Collaborated with BFSI leadership to design AI-driven research ops; contributed to real-time CCTV analytics with YOLO, Mask R-CNN, and custom object detection on 1M+ frames daily. Achieved 93% accuracy in predicting cargo no-shows through feature engineering. Led retrieval-augmented generation (RAG) to contract documents answering user queries from 5M embedded docs using Azure Search. Built an automated license plate recognition system using YOLOv4 and OCR achieving 98% accuracy in real-world conditions. Drove automation of complex document analysis workflows (invoicing, email summaries) and predictive financial forecasting, reducing manual workloads by ~60%.
Senior Technical Consultant - AI Data Engineer at MI9 Retail
October 1, 2022 - Present
Architected an AI-powered product-search system based on RAG using Azure OpenAI and Pinecone, processing 100K+ product queries daily. Delivered natural language search capabilities that increased product discovery and generated over $3M in additional revenue through semantic search with hybrid dense/sparse vector retrieval. Automated Windows server provisioning via NLP-powered AI agents and self-hosted LLMs, saving substantial manual effort and costs. Built production ETL pipelines across AWS Redshift and Azure Synapse with 5TB+ daily transactions and 99.8% success rate; reduced ETL processing time by 30% through parallel processing and optimized transformations. Implemented data quality frameworks with DBT ensuring data integrity and preventing downstream errors worth $1.2M. Led cloud migration of 15TB of legacy databases to Azure Synapse and Microsoft Fabric. Developed feature engineering pipelines on Microsoft Fabric (PySpark) handling 10M+ daily transactions, accelerating model dev
Big Data Engineer at Hitech I Solutions
April 1, 2021 - October 2, 2025
Orchestrated data-driven solutions for Fortune 500 clients by integrating ML, DL, and NLP models within Azure cloud architecture. Built automated vendor pricing analysis using OCR and NLP, processing millions of data points daily and saving $10M+ annually. Developed production-grade ETL pipelines in cloud environments with 99.5% success and 5 GB+ data daily using Azure Data Factory and AWS Glue. Deployed predictive time-series models for customer credit behavior and loan demand, enabling targeted marketing and improving cross-sell rates. Implemented a production-grade LLM-based document parser and an end-to-end KYC/IE workflow, accelerating onboarding and decision-making.
Data Scientist at HSBC Banking Innovation
April 1, 2021 - October 2, 2025
Implemented real-time fraud detection using ensemble ML models with engineered features from transaction patterns and geo-location signals, achieving high accuracy and preventing multi-million-dollar losses annually. Built credit risk scoring using gradient boosting and deep learning on a decade of historical data, improving default prediction. Developed a churn prediction platform leveraging NLP sentiment analysis and Azure ML for proactive retention, saving millions in customer acquisition costs. Created automated KYC validation using CV and NLP to streamline onboarding and reduce manual review.
Data Science Intern at Royal Technosoft
March 1, 2020 - October 2, 2025
Collaborated with senior leaders to deliver AI solutions across operations, including real-time CCTV analytics, RAG-based contract retrieval over 5M docs, and automated document processing workflows that reduced manual effort by 60%. Demonstrated strong cross-functional collaboration and contributed to several predictive and computer vision initiatives.

Education

Master in Big Data Analytics at Trent University
May 1, 2021 - September 1, 2022
Bachelor in Computer Science at Silver Oak College
August 1, 2015 - March 1, 2019
Master in Big Data Analytics at Trent University
May 1, 2021 - September 1, 2022
Bachelor in Computer Science at Silver Oak College
August 1, 2015 - March 1, 2019

Qualifications

Microsoft Azure AI Fundamentals (AI-900)
January 11, 2030 - October 2, 2025
Microsoft Azure Fundamentals (AZ-900)
January 11, 2030 - October 2, 2025
Microsoft Azure AI Engineer Associate (AI-102)
January 11, 2030 - October 2, 2025
HackerRank SQL & Python (5 Stars)
January 11, 2030 - October 2, 2025
Oracle Java EE & SE 6 Web Component Developer
January 11, 2030 - October 2, 2025
Blockchain Essentials (Coursera)
January 11, 2030 - October 2, 2025
Microsoft Azure AI Fundamentals (AI-900)
January 11, 2030 - October 2, 2025
Microsoft Azure Fundamentals (AZ-900)
January 11, 2030 - October 2, 2025
Microsoft Azure AI Engineer Associate (AI-102)
January 11, 2030 - October 2, 2025
HackerRank SQL & Python (5 Stars)
January 11, 2030 - October 2, 2025
Oracle Java EE & SE 6 Web Component Developer
January 11, 2030 - October 2, 2025
Blockchain Essentials (Coursera)
January 11, 2030 - October 2, 2025

Industry Experience

Retail, Financial Services, Consumer Goods, Professional Services, Other, Software & Internet