Available to hire
Hi, I’m Milind Yadav, an experienced professional with expertise in Python, SQL, and cloud technologies. I enjoy applying machine learning, natural language processing, and computer vision to solve real-world problems and deliver impactful insights. I’m passionate about using data-driven approaches to optimize processes and drive innovation.
Currently, I’m working as a Software Engineering Intern focused on AI, where I develop domain-specific large language models and conversational AI systems. I am always eager to learn and contribute to exciting data science projects that create value in industrial and commercial environments.
Skills
Language
English
Advanced
Hindi
Advanced
Work Experience
Software Engineering Intern - AI at Brothers Drink Co LTD
June 1, 2025 - PresentDeveloping domain-specific LLM by fine-tuning Code Llama 7B using Azure ML on manufacturing shift data, targeting 95% query accuracy for natural language to SQL conversion. Engineering AI text-to-SQL system using transformer architecture and Azure Cognitive Services, eliminating manual query writing. Implementing RAG pipeline with Azure AI Search vector embeddings for automated knowledge updates, reducing model retraining overhead while maintaining accuracy. Optimizing neural network deployment using Azure GPU compute and model quantization, achieving sub-2-second inference times for hybrid cloud-on-premise LLM applications. Building conversational AI system integrating Azure OpenAI Services and Functions APIs, processing 200+ daily queries for scalable industrial LLM deployment.
Solution Engineer at Onit India Pvt LTD
August 31, 2024 - July 21, 2025Architected scalable ETL pipelines extracting real-time and historical data from Dun & Bradstreet and Onit ELM APIs, processing 10M+ records daily using Google Cloud Dataproc with PySpark. Developed automated data processing workflows transforming legal invoice and business data using PySpark and Spark SQL, loading into Cloud SQL (PostgreSQL) and BigQuery for analytics. Optimized database performance and data integrity implementing strategic partitioning, indexing, and schema validation systems, achieving 60% faster query response times while preventing data corruption. Implemented incremental loading and automated monitoring reducing processing time by 40% and deployment issues by 80% through intelligent data synchronization across DEV, UAT, and PROD environments. Designed dual-storage architecture and governance protocols integrating processed data for transactional queries and analytical reporting, ensuring schema compatibility and supporting business intelligence initiatives.
Data Analyst Intern at NsembleAi
March 31, 2022 - July 21, 2025Developed computer vision models for construction site safety compliance using supervised learning algorithms on annotated image datasets, achieving 15% improvement in PPE detection accuracy. Engineered comprehensive training dataset through manual annotation of 5,000+ construction site images, implementing data labeling workflows and quality assurance protocols. Applied machine learning frameworks including YOLO, SVM, Random Forest, and OpenCV for real-time object detection and image processing, creating automated safety detection systems. Delivered data-driven recommendations through statistical analysis of image and metadata patterns, enabling proactive safety measures and regulatory compliance improvements.
Software Engineering Intern - AI at Brothers Drink Co LTD
June 1, 2025 - PresentDeveloped a domain-specific large language model by fine-tuning Code Llama 7B on manufacturing shift data using Azure ML, targeting 95% query accuracy for natural language to SQL conversion. Engineered an AI text-to-SQL system using transformer architecture and Azure Cognitive Services, eliminating manual query writing for real-time shop floor data access. Implemented RAG pipeline with Azure AI Search vector embeddings for automated knowledge updates, reducing model retraining overhead by 100% while maintaining accuracy. Optimized neural network deployment using Azure GPU compute and model quantization, achieving sub-2-second inference times for hybrid cloud-on-premise LLM applications. Built a conversational AI system integrating Azure OpenAI Services and Functions APIs, processing over 200 daily queries to enable scalable industrial LLM deployment.
Solution Engineer at Onit India Pvt LTD
August 31, 2024 - July 21, 2025Architected scalable ETL pipelines extracting real-time and historical data from Dun & Bradstreet and Onit ELM APIs, processing over 10 million records daily using Google Cloud Dataproc with PySpark. Developed automated data processing workflows transforming legal invoice and business data using PySpark and Spark SQL, loading into Cloud SQL (PostgreSQL) and BigQuery for analytics. Optimized database performance and data integrity by implementing strategic partitioning, indexing, and schema validation systems, achieving 60% faster query response while preventing data corruption. Implemented incremental loading and automated monitoring, reducing processing time by 40% and deployment issues by 80% through intelligent data synchronization across DEV, UAT, and PROD environments. Designed dual-storage architecture and governance protocols integrating processed data for transactional queries and analytical reporting, supporting business intelligence initiatives.
Data Analyst Intern at NsembleAi
March 31, 2022 - July 21, 2025Developed computer vision models for construction site safety compliance using supervised learning on annotated image datasets, improving PPE detection accuracy by 15% for helmets, boots, and safety gear. Engineered comprehensive training datasets through manual annotation of over 5,000 construction site images, managing data labeling workflows and quality assurance protocols for object detection model development. Applied machine learning frameworks including YOLO, SVM, Random Forest, and OpenCV for real-time object detection and image processing, creating automated safety detection systems for industrial environments. Delivered data-driven recommendations through statistical analysis of image and metadata patterns, enabling proactive safety measures and regulatory compliance improvements.
Education
Bachelor of Engineering at Priyadarshini College of Engineering
January 1, 2019 - July 31, 2022Bachelor of Engineering at Priyadarshini College of Engineering
June 1, 2019 - July 31, 2022Qualifications
AWS Academy Cloud Foundations – Amazon Web Services
January 1, 2024 - December 31, 2024Python Programming – EDYODA
March 1, 2023 - March 31, 2023AWS Academy Cloud Foundations – Amazon Web Services
January 1, 2024 - December 31, 2024Python Programming – EDYODA
March 1, 2023 - March 31, 2023Industry Experience
Manufacturing, Software & Internet, Financial Services, Professional Services, Real Estate & Construction
Skills
Hire a Data Scientist
We have the best data scientist experts on Twine. Hire a data scientist in Bristol today.