Available to hire
Hi, I’m Muzammal Ali, a data scientist and AI engineer based in Berlin with 3.5 years of experience building production-grade machine learning and AI systems. I specialize in LLM-based applications, scalable data pipelines, and cloud-native deployments on GCP and AWS. I have a proven track record delivering end-to-end solutions that improve product accuracy, automation, and business decision-making.
I enjoy collaborating with cross-functional teams to translate user needs into AI features, quantify impact, and maintain strong client relationships. I love turning complex data into actionable insights and building AI-driven tools that scale.
Experience Level
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Language
English
Advanced
Work Experience
Data Scientist / AI Consultant at Helm X
December 1, 2025 - PresentDelivered AI and data science solutions, including LLM-based chatbots and data pipelines, significantly improving operational efficiency. Collaborated on product features, achieving measurable impact and maintaining strong client relationships across engagements.
Data Scientist at CarBee
August 1, 2024 - November 30, 2025Remotely designed and deployed a production-grade Retrieval-Augmented Generation (RAG) system using live MongoDB data, significantly improving chat bot response accuracy and relevance. Built an AI-powered natural language search system translating user queries into dynamic MongoDB queries for precise vehicle matching. Developed AI-generated vehicle descriptions, reducing manual content creation time by 40% and increasing user engagement. Designed distributed asynchronous subtitling generation system using Celery and AWS SQS, enabling scalable multilingual processing. Engineered serverless data pipelines (MongoDB -> BigQuery, Firestore -> GA -> BigQuery) using GCP Cloud Functions for real-time analytics and dashboard reporting.
Research Assistant at National Dong Hwa University (NDHU)
October 1, 2024 - February 28, 2025Real-time theft detection system using YOLOv8, achieving 95% detection accuracy through multi-factor threat analysis. Automated analytics workflows using BigQuery and Apache Airflow, supporting batch and event-driven processing. Designed and maintained RESTful APIs using FastAPI and Django for backend integration. Implemented CI/CD pipelines, reducing deployment time and improving release reliability. Developed scalable ETL pipelines with PySpark and MapReduce to optimize data processing and analytics.
Data Scientist at iPlex
September 1, 2023 - August 31, 2024Built real-time unauthorized person detection system using YOLOv8 with 90%+ accuracy. Automated analytics workflows using BigQuery and Apache Airflow for batch and event-driven processing. Designed and maintained RESTful APIs using FastAPI and Django. Implemented CI/CD pipelines, reducing deployment time by 50% and improving release reliability. Engineered scalable ETL pipelines with PySpark and MapReduce for faster data processing.
Data Scientist at ML Sense
July 1, 2022 - September 30, 2023Built AI-powered customer support chatbot using LangChain, OpenAI, and vector databases enabling semantic search across multiple document formats. Developed SubMagic, an automated video captioning system with grammar correction and emoji suggestions. Designed real-time analytics dashboards for telecom inventory data, enabling anomaly detection for fuel theft and inventory misuse. Architected scalable ETL pipelines using PySpark and MapReduce, reducing data processing time from 500 hours to 6 hours.
Machine Learning Engineer at NC AI
August 1, 2021 - May 31, 2022Developed a real-time unauthorized person detection system with high accuracy, and built predictive models using decision trees and other ML techniques to support security applications.
Data Scientist / AI Engineer at Helm X
December 1, 2025 - PresentDelivered AI and data science solutions including LLM-based chatbots and data pipelines to improve operational efficiency. Collaborated on product features, achieved measurable impact, and maintained strong client relationships.
Data Scientist / Data Engineer at ML Sense
July 1, 2022 - September 1, 2023Developed an AI-powered customer support chatbot using LangChain, OpenAI, and vector databases enabling semantic search across multiple document formats. Built SubMagic, an automated video captioning system with grammar corrections and emoji suggestions. Designed real-time analytics dashboards for Telnor inventory to detect fuel theft and inventory misuse. Engineered scalable ETL pipelines with PySpark and MapReduce, reducing data processing time.
Education
Bachelor of Science in Computer Engineering at University of Engineering and Technology, Peshawar
January 1, 2018 - December 31, 2022Bachelor's degree at University of Engineering and Technology, Peshawar
January 1, 2018 - January 1, 2022Qualifications
IBM Machine Learning Professional Certification
January 11, 2030 - January 5, 2026CodeChef Top 50 Programmer — Competitive Programming
January 11, 2030 - January 5, 2026IBM Machine Learning Professional Course
January 11, 2030 - February 9, 2026Industry Experience
Software & Internet, Professional Services, Media & Entertainment, Education, Computers & Electronics
Experience Level
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Hire a Data Scientist
We have the best data scientist experts on Twine. Hire a data scientist in Berlin today.