I'm Federico Fiorio, a Data and AI Engineer with a strong foundation in software engineering and data engineering. I specialize in integrating large language models and machine learning to solve complex data challenges and to deploy scalable, cloud-based systems. I focus on delivering enterprise-grade data pipelines, open-source contributions, AI agents, AI-driven insights, and governance-friendly architectures. I enjoy turning data into actionable business insights and collaborating with cross-functional teams to build reliable, scalable AI solutions that balance performance, governance, and business impact across complex environments.

Federico Fiorio

I'm Federico Fiorio, a Data and AI Engineer with a strong foundation in software engineering and data engineering. I specialize in integrating large language models and machine learning to solve complex data challenges and to deploy scalable, cloud-based systems. I focus on delivering enterprise-grade data pipelines, open-source contributions, AI agents, AI-driven insights, and governance-friendly architectures. I enjoy turning data into actionable business insights and collaborating with cross-functional teams to build reliable, scalable AI solutions that balance performance, governance, and business impact across complex environments.

Available to hire

I’m Federico Fiorio, a Data and AI Engineer with a strong foundation in software engineering and data engineering. I specialize in integrating large language models and machine learning to solve complex data challenges and to deploy scalable, cloud-based systems. I focus on delivering enterprise-grade data pipelines, open-source contributions, AI agents, AI-driven insights, and governance-friendly architectures.

I enjoy turning data into actionable business insights and collaborating with cross-functional teams to build reliable, scalable AI solutions that balance performance, governance, and business impact across complex environments.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
See more

Language

Italian
Fluent
English
Advanced
German
Beginner

Work Experience

Data Machine Learning Engineer at Data Reply
May 1, 2024 - Present
Designed high-volume data ingestion pipelines (Databricks, Medallion Architecture) for Prada; optimized historical tracking with SCD Type 2, reducing query time by 80%. Migrated Airflow DAGs to GCP Composer 2.x, boosting scheduling efficiency by 10x. Built edge AI car bonnet detection using fine-tuned YOLO v8 on edge devices; automated safety protocols and customer compliance monitoring. Optimized sanctions screening pipeline with token-sorting approach, reducing storage by 75% and speeding execution. Implemented Databricks Unity Catalog governance and CI/CD pipelines; deployed an agentic workflow using Mosaic AI and MLflow for real-time insights across European stores.
Data Engineer at Management Solutions
October 1, 2023 - April 1, 2024
Optimized IBM DataStage ETL workflows and SQL for banking data warehouses; enforced data governance across cross-functional teams.
Software Engineer Intern at Dilium
February 1, 2021 - May 1, 2021
Researched GANs for deepfake generation with emphasis on ethical considerations.
Data and AI Engineer Consultant at Data Reply
May 1, 2024 - Present
Led enterprise-scale data pipelines and historical data optimization for Prada using Databricks and Medallion Architecture; implemented SCD Type 2 to accelerate query performance by 80%. Refined sanctions screening pipeline (Databricks) from permutation-based matching to token sorting, reducing storage by 75% and speeding execution. Engineered Databricks platform governance with Unity Catalog, Azure DevOps CI/CD, Terraform IaC, and automated data quality checks (DQx). Built agentic workflow with Mosaic AI and MLflow delivering real-time, data-driven insights to Prada store managers and reducing response latency. Migrated Airflow DAGs to Google Cloud Composer 2.x, improving scheduling efficiency by 10x. Implemented edge AI car bonnet detection with fine-tuned YOLOv8 for safety and compliance in car-wash operations.
Data Engineer Consultant at Management Solutions
October 1, 2023 - April 1, 2024
Optimized IBM DataStage ETL workflows and SQL for banking data warehouses. Enforced data governance policies in cross-functional teams.

Education

MSc in Computer Science at University of Milan (Erasmus+ with University of Copenhagen)
January 11, 2030 - March 13, 2026
BSc in Computer Science at University of Milan
January 11, 2030 - March 13, 2026
MSc in Computer Science at University of Milan (Erasmus+ at University of Copenhagen)
January 11, 2030 - May 8, 2026
BSc in Computer Science at University of Milan
January 11, 2030 - May 8, 2026

Qualifications

Databricks Certified Data Engineer Professional
January 11, 2030 - March 13, 2026
Databricks Certified Associate Developer for Apache Spark
January 11, 2030 - March 13, 2026
Databricks Certified Data Engineer Associate
January 11, 2030 - March 13, 2026
Neo4j Certified Professional
January 11, 2030 - March 13, 2026
AWS Certified Machine Learning – Specialty
January 11, 2030 - March 13, 2026
Professional Machine Learning Engineer Certification
January 11, 2030 - March 13, 2026
Databricks Certified Data Engineer Professional
January 11, 2030 - May 8, 2026
Databricks Certified Associate Developer for Apache Spark
January 11, 2030 - May 8, 2026
Databricks Certified Data Engineer Associate
January 11, 2030 - May 8, 2026
Databricks Certified Generative AI Engineer Associate
January 11, 2030 - May 8, 2026
Neo4j Certified Professional
January 11, 2030 - May 8, 2026
AWS Certified Machine Learning – Specialty
January 11, 2030 - May 8, 2026
Google Professional Machine Learning Engineer
January 11, 2030 - May 8, 2026

Industry Experience

Software & Internet, Retail, Professional Services, Media & Entertainment, Financial Services, Other