I am Alessio Cesaretti, a data engineer and AI enthusiast focused on building scalable data platforms and AI-powered products. I have hands-on experience with RAG pipelines, LLMs, ETL on Azure, Spark-based data processing, and migrating large data warehouses to the cloud. I enjoy solving complex data problems and turning them into practical, value-driven solutions. I thrive in cross-functional teams, continuously learning new AI and data engineering techniques to deliver reliable, maintainable systems and impactful insights for the business.

Alessio Cesaretti

I am Alessio Cesaretti, a data engineer and AI enthusiast focused on building scalable data platforms and AI-powered products. I have hands-on experience with RAG pipelines, LLMs, ETL on Azure, Spark-based data processing, and migrating large data warehouses to the cloud. I enjoy solving complex data problems and turning them into practical, value-driven solutions. I thrive in cross-functional teams, continuously learning new AI and data engineering techniques to deliver reliable, maintainable systems and impactful insights for the business.

Available to hire

I am Alessio Cesaretti, a data engineer and AI enthusiast focused on building scalable data platforms and AI-powered products. I have hands-on experience with RAG pipelines, LLMs, ETL on Azure, Spark-based data processing, and migrating large data warehouses to the cloud. I enjoy solving complex data problems and turning them into practical, value-driven solutions.

I thrive in cross-functional teams, continuously learning new AI and data engineering techniques to deliver reliable, maintainable systems and impactful insights for the business.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
See more

Language

English
Fluent
Italian
Fluent
French
Intermediate
Portuguese
Intermediate

Work Experience

Data Engineer (Contract) at Isabel Group
January 1, 2025 - October 7, 2025
Led development of AI-powered data products, including Retrieval Augmented Generation (RAG) pipelines and fine-tuned LLMs for entity extraction, classification, and other NLP tasks. Designed and implemented ETL pipelines using Azure Data Factory, Databricks, Apache Spark, and Python. Migrated a 100+ TB enterprise Data Warehouse from on-prem to Azure Cloud, improving analytics query performance by 45%. Proposed and deployed a cost-efficient architecture, replacing Azure Synapse with direct Power BI-Databricks connectivity, reducing cloud spend and maintenance overhead. Built metadata-driven, automated data pipelines with CI/CD, testing, and monitoring, achieving a failure rate below 5%.
Data Engineer at Lineas
January 1, 2020 - October 7, 2025
Contributed to the implementation of a company-wide Data Warehouse to support operational and analytical reporting. Developed and maintained ETL pipelines using Apache Spark and Scala, ensuring reliable and timely data ingestion from multiple sources. Built end-to-end data processing workflows, improving data availability and enabling more consistent business insights. Collaborated with senior engineers on data modeling and integration strategies.
Software Engineer at HCL Technologies
January 1, 2019 - October 7, 2025
Contributed to the development and maintenance of IBM BigFix, a large-scale systems management platform for desktop, mobile, and server environments. Implemented high-performance C++ features, improved existing codebase robustness, and resolved critical software defects. Worked within Agile development teams to ensure timely delivery and adherence to quality standards.
Co-Founder, Full-Stack Developer at Parkidle
January 1, 2018 - October 7, 2025
Co-founded and developed Parkidle, an Android application enabling users to automatically share and find parking spots without manual interaction. Led full-stack development using Java (Android), Node.js, and cloud-hosted backend services. Designed geolocation and real-time availability features, improving parking search time for users in urban areas.
Data Engineer (Contract) at Isabel Group
January 1, 2020 - Present
Developed AI-powered data products, including Retrieval Augmented Generation (RAG) pipelines and fine-tuned Large Language Models (LLMs) for entity extraction, classification, and other NLP tasks. Designed and implemented ETL pipelines using Azure Data Factory, Databricks, Apache Spark, and Python. Migrated a 100+ TB enterprise Data Warehouse from on-prem to Azure Cloud, improving analytics query performance by 45%. Proposed and deployed a cost-efficient architecture, replacing Azure Synapse with direct Power BI–Databricks connectivity, reducing cloud spend and maintenance overhead. Built metadata-driven, automated data pipelines with a strong focus on CI/CD, testing, and monitoring, achieving a failure rate below 5%.
Data Engineer at Lineas
December 31, 2020 - October 7, 2025
Contributed to the implementation of a company-wide Data Warehouse to support operational and analytical reporting. Developed and maintained ETL pipelines using Apache Spark and Scala, ensuring reliable and timely data ingestion from multiple sources. Built end-to-end data processing workflows, improving data availability and enabling more consistent business insights. Collaborated with senior engineers on data modeling and integration strategies.
Software Engineer at HCL Technologies
December 31, 2019 - October 7, 2025
Contributed to the development and maintenance of IBM BigFix, a large-scale systems management platform for desktop, mobile, and server environments. Implemented high-performance C++ features, improved existing codebase robustness, and resolved critical software defects. Worked within Agile development teams to ensure timely delivery and adherence to quality standards.
Co-Founder, Full-Stack Developer at Parkidle
December 31, 2018 - October 7, 2025
Co-founded and developed Parkidle, an Android application enabling users to automatically share and find parking spots without manual interaction. Led full-stack development using Java (Android), Node.js, and cloud-hosted backend services. Designed geolocation and real-time availability features, improving parking search time for users in urban areas.

Education

Bachelor's Degree in Computer Engineering at Sapienza University of Rome
January 1, 2015 - January 1, 2018
Bachelor's Degree in Computer Engineering at Sapienza University of Rome
January 1, 2015 - January 1, 2018

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Professional Services, Computers & Electronics

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
See more