I'm a data engineer based in Paris with over 7 years of experience designing scalable data pipelines, integrating cloud platforms, and orchestrating complex data workflows. I specialize in Python, SQL, Spark, and Airflow, and I have hands-on experience with AWS, GCP, and Azure to deliver data platforms that unlock business value. I thrive in cross-functional teams, mentoring engineers, and collaborating with product owners and data scientists to design robust data models, ensure data quality, and automate end-to-end analytics. I am passionate about turning diverse data sources into trusted insights that drive measurable impact.

Yassine MKHININI

I'm a data engineer based in Paris with over 7 years of experience designing scalable data pipelines, integrating cloud platforms, and orchestrating complex data workflows. I specialize in Python, SQL, Spark, and Airflow, and I have hands-on experience with AWS, GCP, and Azure to deliver data platforms that unlock business value. I thrive in cross-functional teams, mentoring engineers, and collaborating with product owners and data scientists to design robust data models, ensure data quality, and automate end-to-end analytics. I am passionate about turning diverse data sources into trusted insights that drive measurable impact.

Available to hire

I’m a data engineer based in Paris with over 7 years of experience designing scalable data pipelines, integrating cloud platforms, and orchestrating complex data workflows. I specialize in Python, SQL, Spark, and Airflow, and I have hands-on experience with AWS, GCP, and Azure to deliver data platforms that unlock business value.

I thrive in cross-functional teams, mentoring engineers, and collaborating with product owners and data scientists to design robust data models, ensure data quality, and automate end-to-end analytics. I am passionate about turning diverse data sources into trusted insights that drive measurable impact.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert

Language

English
Advanced

Work Experience

Data Engineer at DATA OFFICE - Groupe Etama
August 1, 2023 - Present
Led the creation of a Data Platform unifying SAP/BW4, GA4, SME and corporate data. Authored Terraform scripts to deploy infrastructure on AWS and Snowflake, created S3 data stores, and deployed Cloud Run services for serverless processing and data APIs. Optimized ECS/Cloud Run workloads, ingested GA4 data into S3, managed IAM roles, and developed Python automation for ETL tasks. Configured Airflow pipelines and DBT to orchestrate ETL workflows, integrated Snowflake as the data warehouse, implemented CI/CD with GitLab, and produced KPI dashboards. Collaborated with DevOps for deployment and infrastructure maintenance; set up monitoring and alerts for data pipelines.
Data Engineer at Bouygues Telecom
April 1, 2022 - August 1, 2023
In the Data Factory, ingested data from SAP BO, radio reference data, Lynx, and REST APIs; built large-scale data pipelines using Azure Data Factory, Spark/Scala on Databricks; automated data ingestion with Python; processed 5G-related data; optimized Spark jobs; established CI/CD with GitLab; maintained codebase and documentation; loaded data into Snowflake and created Power BI reports to monitor KPIs.
Data Engineer at Groupe Eiffage
January 1, 2019 - April 1, 2022
Contributed to the dematerialization of invoices project by building and maintaining ETL pipelines, ingesting data from API Rest, Oracle SQL, FTP, and other sources; implemented data quality controls, scheduled ETL processes with Jenkins, and supported supplier self-service initiatives via electronic invoicing platforms; collaborated with multiple stakeholders to ensure data accuracy and integration with ERP systems; reported KPIs via Power BI.
Data Scientist / Data Analyst (Stage + CDD) at Blent AI
January 1, 2017 - December 31, 2018
Engaged in data science and analytics projects including price prediction for rentals (Python, ML) and Big Data architecture design for Uber-like data applications (BigQuery, Spark Streaming/Batch). Developed Python tooling to streamline data deployment and integration with applications, supporting data-driven product decisions.

Education

Master of Science – Applied Mathematics & Statistics at Université de Besançon
January 1, 2017 - December 31, 2017
Master of Science – Applied Mathematics & Statistics at Université de Besançon
January 1, 2018 - December 31, 2018
Master de mathématiques appliquées à la finance at Université de Tunis El Manar
January 1, 2016 - December 31, 2016

Qualifications

Data Engineer Associate (Databricks)
January 1, 2025 - January 8, 2026
MLOps Practitioner (Dataiku)
January 1, 2025 - January 8, 2026
Developer Certificate (Dataiku)
January 1, 2025 - January 8, 2026
ML Practitioner (Dataiku)
January 1, 2025 - January 8, 2026
Generative AI Practitioner (Dataiku)
January 1, 2025 - January 8, 2026
ML Practitioner (Dataiku)
January 1, 2025 - January 8, 2026
Advanced Designer Core (Dataiku)
January 1, 2025 - January 8, 2026
Professional Data Engineer (Google)
January 1, 2024 - January 8, 2026
Databricks Lakehouse Fundamentals
January 1, 2023 - January 8, 2026
Spark with Scala – Udemy
January 1, 2023 - January 8, 2026
Developing a Big Data solution with Azure – LinkedIn Learning
January 1, 2023 - January 8, 2026
DBT Fundamentals
January 1, 2023 - January 8, 2026
Snowflake Fundamentals
January 1, 2023 - January 8, 2026
Azure DevOps Essentials
January 1, 2023 - January 8, 2026
Essentiel Apache PySpark
January 1, 2022 - January 8, 2026
Data Analyst with Tableau
January 1, 2022 - January 8, 2026
Blent AI – Data Scientist
January 1, 2021 - January 8, 2026
Blent AI – Data Engineer
January 1, 2020 - January 8, 2026
Coursera – Python for Data Science and IA (Certification)
January 1, 2020 - January 8, 2026

Industry Experience

Software & Internet, Telecommunications, Professional Services, Other