I am Amit Almagor, a Lead Data & AI Platform Engineer with 6+ years of experience designing large-scale data infrastructure and distributed processing systems on AWS, GCP, and Azure. I specialize in metadata-driven ETL/ELT pipelines, data lake architecture, data quality and governance frameworks, and scalable data platforms for AI/ML workloads. I have a proven track record of cutting pipeline runtimes by up to 80% and delivering production-grade data systems that handle multimodal datasets. I enjoy collaborating across backend, data engineering, and product teams to translate business requirements into robust, observable platforms. In my roles, I lead across the full delivery lifecycle—from backend services in Go and distributed processing to cloud-native pipelines and data governance. I am comfortable working across multiple clouds (AWS, GCP, Azure) and with modern lakehouse tooling (Delta Lake, Iceberg) to build scalable AI-enabled analytics platforms.

Amit Almagor

I am Amit Almagor, a Lead Data & AI Platform Engineer with 6+ years of experience designing large-scale data infrastructure and distributed processing systems on AWS, GCP, and Azure. I specialize in metadata-driven ETL/ELT pipelines, data lake architecture, data quality and governance frameworks, and scalable data platforms for AI/ML workloads. I have a proven track record of cutting pipeline runtimes by up to 80% and delivering production-grade data systems that handle multimodal datasets. I enjoy collaborating across backend, data engineering, and product teams to translate business requirements into robust, observable platforms. In my roles, I lead across the full delivery lifecycle—from backend services in Go and distributed processing to cloud-native pipelines and data governance. I am comfortable working across multiple clouds (AWS, GCP, Azure) and with modern lakehouse tooling (Delta Lake, Iceberg) to build scalable AI-enabled analytics platforms.

Available to hire

I am Amit Almagor, a Lead Data & AI Platform Engineer with 6+ years of experience designing large-scale data infrastructure and distributed processing systems on AWS, GCP, and Azure. I specialize in metadata-driven ETL/ELT pipelines, data lake architecture, data quality and governance frameworks, and scalable data platforms for AI/ML workloads. I have a proven track record of cutting pipeline runtimes by up to 80% and delivering production-grade data systems that handle multimodal datasets. I enjoy collaborating across backend, data engineering, and product teams to translate business requirements into robust, observable platforms.

In my roles, I lead across the full delivery lifecycle—from backend services in Go and distributed processing to cloud-native pipelines and data governance. I am comfortable working across multiple clouds (AWS, GCP, Azure) and with modern lakehouse tooling (Delta Lake, Iceberg) to build scalable AI-enabled analytics platforms.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

Hebrew (modern)
Fluent
English
Fluent
Japanese
Advanced

Work Experience

Contract Data Engineer at Lemon.io (Engineering Platform) / Watchful Technologies
February 1, 2026 - Present
Engaged through Lemon.io as an hourly contract engineer on data platform projects for venture-backed startup clients, focusing on data infrastructure for fast-scaling SaaS and platform companies. Separately engaged with a global insurance/financial services client building Azure-based analytics infrastructure: designed and implemented data ingestion and transformation pipelines on Azure Data Lake Storage Gen2, Azure Data Factory (ADF), and Azure Synapse Analytics. Built distributed Spark workloads on Synapse Spark pools for large-scale batch processing of structured and semi-structured datasets, with schema-on-read patterns over ADLS. Implemented ETL/ELT pipelines, data modeling, and partitioning strategies in ADLS to support downstream analytics and reporting workloads.
Associate Consultant — Data Engineering at Infosys Limited
June 1, 2022 - March 1, 2024
Designed, operated, and optimized large-scale data pipelines on Google Cloud (BigQuery, Dataflow, Datastream, Dataproc), supporting both real-time streaming and batch analytics workloads processing millions of records daily. Led a 14-person technical team covering BigQuery, Dataproc, Dataflow, Data Fusion, and Composer, driving faster resolution times and improved customer satisfaction. Built and maintained streaming data pipelines with Dataflow and Datastream for low-latency analytics and large-scale ETL, processing structured and semi-structured data at scale. Supported Dataform-based analytics workflows for enterprise customers, ensuring data pipeline reliability, correctness, and maintainability across production environments. Contributed to hybrid and multi-cloud architectures spanning GCP, AWS, and Azure, improving scalability and reducing vendor lock-in for distributed data services. Collaborated with Google engineers to identify and resolve global-level product bugs affecting e
Data Analyst at Rakuten Group, Inc.
August 1, 2021 - June 1, 2022
Designed and automated Python-based ETL pipelines integrating with Workday LMS, cutting manual enrollment workload by 70% and improving data accuracy. Built and deployed data pipelines on GCP (BigQuery, DOMO) to process penetration test results, accelerating security reporting cycles from weeks to days. Partnered with engineering and security teams to migrate on-premises databases to BigQuery and Azure SQL, reducing infrastructure cost and improving scalability. Mentored 20+ employees in Tableau and SQL best practices, increasing data self-service adoption and team-wide data literacy.
IT, Data & Web Development — Various Companies at Israel (Earlier Roles)
January 1, 2004 - December 31, 2017
Web development and IT roles across Israel, including migrating 20+ university websites to OpenScholar, building PHP modules, and modernizing research tools; IT Department Manager overseeing 10 admins and 200+ workstations, building ETL pipelines for IBM System i migration; Data Analyst work on SQL Server infrastructure.

Education

M.A.S. Information, Technology and Society in Asia at University of Tokyo
September 1, 2019 - August 31, 2021
University of Tokyo — Fully funded by Monbukagakusho (MEXT) Scholarship
Research Student — Urban Planning & Accessibility at University of Tokyo
April 1, 2018 - September 1, 2019
B.A. Anthropology, Sociology, Asian Studies at Heberw University Of Jerusalem
October 1, 2013 - June 1, 2017

Qualifications

Top Performance Rating
January 1, 2024 - June 20, 2026
Most Valuable Employee — Infosys
January 1, 2023 - June 20, 2026

Industry Experience

Financial Services, Software & Internet, Telecommunications, Professional Services