I am a Master’s student in Data Science at the University of Milano-Bicocca with a strong technical foundation in applied mathematics, deep learning, and large-scale data processing. My work spans distributed learning, NLP, and LLM-based infrastructures, with a focus on building scalable, production-ready AI systems that integrate machine learning with robust backend and MLOps components. I am passionate about applied AI engineering and enjoy transforming complex technical concepts into practical, high-impact solutions—from image and signal processing models to full MLOps pipelines and agentic LLM architectures. I am proficient in Python, PyTorch, FastAPI, Docker, and cloud-native tooling, thriving in environments that value autonomy, clean architecture, and performance-driven execution.

ISMAIL AHOUARI

I am a Master’s student in Data Science at the University of Milano-Bicocca with a strong technical foundation in applied mathematics, deep learning, and large-scale data processing. My work spans distributed learning, NLP, and LLM-based infrastructures, with a focus on building scalable, production-ready AI systems that integrate machine learning with robust backend and MLOps components. I am passionate about applied AI engineering and enjoy transforming complex technical concepts into practical, high-impact solutions—from image and signal processing models to full MLOps pipelines and agentic LLM architectures. I am proficient in Python, PyTorch, FastAPI, Docker, and cloud-native tooling, thriving in environments that value autonomy, clean architecture, and performance-driven execution.

Available to hire

I am a Master’s student in Data Science at the University of Milano-Bicocca with a strong technical foundation in applied mathematics, deep learning, and large-scale data processing. My work spans distributed learning, NLP, and LLM-based infrastructures, with a focus on building scalable, production-ready AI systems that integrate machine learning with robust backend and MLOps components.

I am passionate about applied AI engineering and enjoy transforming complex technical concepts into practical, high-impact solutions—from image and signal processing models to full MLOps pipelines and agentic LLM architectures. I am proficient in Python, PyTorch, FastAPI, Docker, and cloud-native tooling, thriving in environments that value autonomy, clean architecture, and performance-driven execution.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
See more

Language

Arabic
Fluent
English
Fluent
French
Fluent
Italian
Advanced

Work Experience

Data Science Intern at Luxembourg Institute of Socio-Economic - LISER
April 30, 2025 - August 7, 2025
Developed a scalable multilingual semantic classification pipeline using pandas and Polars, improving efficient processing of large-scale text data for NLP tasks. Conducted data preprocessing including HTML content extraction with BeautifulSoup, text normalization, and deduplication using spaCy to enhance input quality for machine learning models. Integrated Stanza for language-specific sentence segmentation and modular multilingual NLP processing. Built a keyword extraction system using Sentence-Transformers with semantic similarity techniques for domain-specific AI indicator identification. Optimized pipeline throughput by using batch processing, data parallelism, and language-specific processing branches, enhancing model inference speed. Benchmarked pipeline performance against GPT-based models to inform model selection for practical AI deployment.
Data Science Intern at University of Luxembourg – C2DH
December 31, 2024 - August 7, 2025
Deployed a Wikibase-based knowledge graph within Docker containers to ensure isolated and reproducible environments. Automated data ingestion pipelines using Python and the Wikibase API for processing over 10,000 structured records with semantic annotations. Developed a modular relational data model managing RDF triples across 19 reusable properties. Implemented multilingual support with automated translation targeting Luxembourgish accessibility. Established data reconciliation workflows with OpenRefine for entity linking across platforms. Created SPARQL query interfaces enabling complex historical research queries combined with data visualization tools.
Web Dev Intern at Numidia Collection
February 28, 2022 - August 7, 2025
Developed and managed CMS platforms including WordPress and Prestashop. Built and optimized websites using HTML, CSS, and JavaScript, ensuring compliance with W3C standards. Handled backend development projects using Django and Flask frameworks. Implemented SEO strategies to enhance search engine rankings and improve website visibility.
Data Science Associate at Luxembourg Institute of Socio-Economic - LISER
January 1, 2025 - April 1, 2025
Implemented a scalable multilingual semantic classification pipeline using pandas and Polars for large-scale text NLP tasks. Built data preprocessing modules for HTML content extraction (BeautifulSoup), text normalization, and deduplication with spaCy to improve input quality for ML models. Integrated Stanza for language-specific sentence segmentation and developed modular components to support multilingual NLP processing across diverse corpora. Developed a keyword extraction system using Sentence-Transformers (Hugging Face) with semantic similarity techniques to identify AI-related indicators within domain-specific corpora. Benchmarked the semantic similarity pipeline against GPT-based models (OpenAI GPT, Mixtral) to assess performance accuracy and approach fidelity.
Web Dev Intern at Numidia Collection
November 1, 2021 - February 1, 2022
Developed and managed CMS platforms, including WordPress and Prestashop. Built and optimized websites using HTML, CSS, and JavaScript, ensuring compliance with W3C standards. Handled backend development for specific projects using Flask. Implemented SEO strategies to improve search engine rankings and enhance website visibility.

Education

Master Degree in Data Science at University of Milan - Bicocca
August 1, 2022 - July 1, 2025
Diploma in Web Technologies at EWA School
February 1, 2020 - December 1, 2021
Bachelor Degree in Applied Mathematics at University of Ibn Zohr
August 1, 2016 - November 1, 2020
Master Degree in Data Science at University of Milan - Bicocca
January 1, 2023 - October 1, 2025
Diploma in Web Technologies at EWA School
February 1, 2020 - December 1, 2021
Bachelor Degree in Applied Mathematics at University of Ibn Zohr
August 1, 2016 - November 1, 2020

Qualifications

AWS Certified Solutions Architect - Associate
January 11, 2030 - August 1, 2025
MLops Zoomcamp
January 11, 2030 - August 1, 2025
IBM Professional Data Scientist
September 1, 2022 - August 7, 2025
IBM Professional Data Analyst
March 1, 2022 - August 7, 2025
AWS Certified Solutions Architect - Associate
November 1, 2025 - December 21, 2025
MLops Zoomcamp
November 1, 2025 - December 21, 2025
IBM Professional Data Scientist & Data Analyst
September 1, 2022 - December 21, 2025

Industry Experience

Education, Professional Services, Software & Internet, Media & Entertainment, Other, Non-Profit Organization