Hi, I’m Muzammal Ali, a data scientist and AI engineer based in Berlin with 3.5 years of experience building production-grade machine learning and AI systems. I specialize in LLM-based applications, scalable data pipelines, and cloud-native deployments on GCP and AWS. I have a proven track record delivering end-to-end solutions that improve product accuracy, automation, and business decision-making. I enjoy collaborating with cross-functional teams to translate user needs into AI features, quantify impact, and maintain strong client relationships. I love turning complex data into actionable insights and building AI-driven tools that scale.

Muzammal Ali

Hi, I’m Muzammal Ali, a data scientist and AI engineer based in Berlin with 3.5 years of experience building production-grade machine learning and AI systems. I specialize in LLM-based applications, scalable data pipelines, and cloud-native deployments on GCP and AWS. I have a proven track record delivering end-to-end solutions that improve product accuracy, automation, and business decision-making. I enjoy collaborating with cross-functional teams to translate user needs into AI features, quantify impact, and maintain strong client relationships. I love turning complex data into actionable insights and building AI-driven tools that scale.

Available to hire

Hi, I’m Muzammal Ali, a data scientist and AI engineer based in Berlin with 3.5 years of experience building production-grade machine learning and AI systems. I specialize in LLM-based applications, scalable data pipelines, and cloud-native deployments on GCP and AWS. I have a proven track record delivering end-to-end solutions that improve product accuracy, automation, and business decision-making.

I enjoy collaborating with cross-functional teams to translate user needs into AI features, quantify impact, and maintain strong client relationships. I love turning complex data into actionable insights and building AI-driven tools that scale.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
See more

Language

English
Advanced

Work Experience

Data Scientist / AI Consultant at Helm X
December 1, 2025 - Present
Delivered AI and data science solutions, including LLM-based chatbots and data pipelines, significantly improving operational efficiency. Collaborated on product features, achieving measurable impact and maintaining strong client relationships across engagements.
Data Scientist at CarBee
August 1, 2024 - November 30, 2025
Remotely designed and deployed a production-grade Retrieval-Augmented Generation (RAG) system using live MongoDB data, significantly improving chat bot response accuracy and relevance. Built an AI-powered natural language search system translating user queries into dynamic MongoDB queries for precise vehicle matching. Developed AI-generated vehicle descriptions, reducing manual content creation time by 40% and increasing user engagement. Designed distributed asynchronous subtitling generation system using Celery and AWS SQS, enabling scalable multilingual processing. Engineered serverless data pipelines (MongoDB -> BigQuery, Firestore -> GA -> BigQuery) using GCP Cloud Functions for real-time analytics and dashboard reporting.
Research Assistant at National Dong Hwa University (NDHU)
October 1, 2024 - February 28, 2025
Real-time theft detection system using YOLOv8, achieving 95% detection accuracy through multi-factor threat analysis. Automated analytics workflows using BigQuery and Apache Airflow, supporting batch and event-driven processing. Designed and maintained RESTful APIs using FastAPI and Django for backend integration. Implemented CI/CD pipelines, reducing deployment time and improving release reliability. Developed scalable ETL pipelines with PySpark and MapReduce to optimize data processing and analytics.
Data Scientist at iPlex
September 1, 2023 - August 31, 2024
Built real-time unauthorized person detection system using YOLOv8 with 90%+ accuracy. Automated analytics workflows using BigQuery and Apache Airflow for batch and event-driven processing. Designed and maintained RESTful APIs using FastAPI and Django. Implemented CI/CD pipelines, reducing deployment time by 50% and improving release reliability. Engineered scalable ETL pipelines with PySpark and MapReduce for faster data processing.
Data Scientist at ML Sense
July 1, 2022 - September 30, 2023
Built AI-powered customer support chatbot using LangChain, OpenAI, and vector databases enabling semantic search across multiple document formats. Developed SubMagic, an automated video captioning system with grammar correction and emoji suggestions. Designed real-time analytics dashboards for telecom inventory data, enabling anomaly detection for fuel theft and inventory misuse. Architected scalable ETL pipelines using PySpark and MapReduce, reducing data processing time from 500 hours to 6 hours.
Machine Learning Engineer at NC AI
August 1, 2021 - May 31, 2022
Developed a real-time unauthorized person detection system with high accuracy, and built predictive models using decision trees and other ML techniques to support security applications.
Data Scientist / AI Engineer at Helm X
December 1, 2025 - Present
Delivered AI and data science solutions including LLM-based chatbots and data pipelines to improve operational efficiency. Collaborated on product features, achieved measurable impact, and maintained strong client relationships.
Data Scientist / Data Engineer at ML Sense
July 1, 2022 - September 1, 2023
Developed an AI-powered customer support chatbot using LangChain, OpenAI, and vector databases enabling semantic search across multiple document formats. Built SubMagic, an automated video captioning system with grammar corrections and emoji suggestions. Designed real-time analytics dashboards for Telnor inventory to detect fuel theft and inventory misuse. Engineered scalable ETL pipelines with PySpark and MapReduce, reducing data processing time.

Education

Bachelor of Science in Computer Engineering at University of Engineering and Technology, Peshawar
January 1, 2018 - December 31, 2022
Bachelor's degree at University of Engineering and Technology, Peshawar
January 1, 2018 - January 1, 2022

Qualifications

IBM Machine Learning Professional Certification
January 11, 2030 - January 5, 2026
CodeChef Top 50 Programmer — Competitive Programming
January 11, 2030 - January 5, 2026
IBM Machine Learning Professional Course
January 11, 2030 - February 9, 2026

Industry Experience

Software & Internet, Professional Services, Media & Entertainment, Education, Computers & Electronics