Data Engineer with 5+ years of hands-on experience building, maintaining, and modernizing data pipelines and analytics platforms across cloud environments. I work at the intersection of data engineering and analytics, turning raw, messy data into reliable, well-modeled datasets that teams can actually trust and use. My experience spans ELT/ETL pipelines, data warehousing, orchestration, and cloud-native tooling, with a strong focus on reliability, performance, and automation. I enjoy improving legacy systems, simplifying complex workflows, and collaborating closely with analysts, engineers, and business stakeholders to deliver practical data solutions. Curious by nature and continuously learning, I’m particularly interested in scalable data architectures, analytics engineering, and the growing overlap between data platforms and AI-driven use cases.

Abderrahmane Hajji

Data Engineer with 5+ years of hands-on experience building, maintaining, and modernizing data pipelines and analytics platforms across cloud environments. I work at the intersection of data engineering and analytics, turning raw, messy data into reliable, well-modeled datasets that teams can actually trust and use. My experience spans ELT/ETL pipelines, data warehousing, orchestration, and cloud-native tooling, with a strong focus on reliability, performance, and automation. I enjoy improving legacy systems, simplifying complex workflows, and collaborating closely with analysts, engineers, and business stakeholders to deliver practical data solutions. Curious by nature and continuously learning, I’m particularly interested in scalable data architectures, analytics engineering, and the growing overlap between data platforms and AI-driven use cases.

Available to hire

Data Engineer with 5+ years of hands-on experience building, maintaining, and modernizing data pipelines and analytics platforms across cloud environments. I work at the intersection of data engineering and analytics, turning raw, messy data into reliable, well-modeled datasets that teams can actually trust and use.

My experience spans ELT/ETL pipelines, data warehousing, orchestration, and cloud-native tooling, with a strong focus on reliability, performance, and automation. I enjoy improving legacy systems, simplifying complex workflows, and collaborating closely with analysts, engineers, and business stakeholders to deliver practical data solutions.

Curious by nature and continuously learning, I’m particularly interested in scalable data architectures, analytics engineering, and the growing overlap between data platforms and AI-driven use cases.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent
French
Advanced
Arabic
Fluent

Work Experience

Data Engineer Intern at Deep Echo
February 1, 2021 - June 1, 2021
Fetal ultrasound image segmentation for fetal anatomical structures in 2D ultrasound images. Established a workflow for anonymizing and pre-processing DICOM and text data. Implemented cloud-based architecture in AWS for data storage and model deployment; prepared data pipelines for ML workflows. Automated ETL processes across multiple datasets (Airflow, Azure Data Factory) reducing manual workload by over 50%. Leveraged Python (Pandas) for data cleaning and prepared dashboards for data exploration. Tools: AWS, Airflow, Azure Data Factory, Pandas, Dash.
Data Engineer at Deep Echo
June 1, 2021 - June 1, 2022
Collected, cleaned, and analyzed structured and unstructured data (>500 GB). Built modeling pipelines and dashboards, enabling analytics-driven decisions in finance, marketing, inventory, and customer operations. Implemented cloud-based architecture in AWS from storage to model deployment; aligned ETL with Airflow and Azure Data Factory to reduce manual work and improve data quality. Developed ML tooling and APIs for inference.
Data Engineer at Outfittery GmbH
June 1, 2022 - December 1, 2025
Maintained and managed the entire data ecosystem independently for 6+ months, resolving operational and pipeline issues. Migrated legacy ETL from Talend to dbt, optimizing workflows for reliability and efficiency. Built and automated hourly and 24/7 pipelines (Airflow, dbt, Spark/Trino, AWS EMR/S3) across data sources including Zendesk, Personio, CrossEngage, and Snowplow. Ensured data quality, deduplication, and incremental processing for analytics-ready datasets. Enabled business insights for finance, marketing, inventory, and customer operations through reliable, scalable pipelines. Designed and built a REST API using Azure ML deployment services; monitored server performance and iterated on models.

Education

M.S. in Data Engineering at National Institute of Posts and Telecommunications
September 1, 2018 - July 1, 2021
Technology and Industrial Sciences (TSI) CPGE at Lycée Mohammedia
September 1, 2016 - June 1, 2018

Qualifications

Data Engineering Project (SQL, Python, Airflow)
January 11, 2030 - January 11, 2026
Udemy Terraform for AWS
January 11, 2030 - January 11, 2026
Google Cloud Fundamentals / Big Data & Machine Learning Fundamentals
January 11, 2030 - January 11, 2026

Industry Experience

Software & Internet, Media & Entertainment, Telecommunications, Professional Services