I'm a data engineer and cloud solutions architect focused on building scalable, automated data ecosystems that power analytics and ML. I specialize in dbt, PySpark, Airflow, Terraform, and AWS Glue, and I enjoy turning fragmented data into unified platforms that drive business impact. I lead end-to-end ELT solutions, optimize cloud costs, and mentor teams to build self-service data capabilities. I thrive on delivering reliable data pipelines with measurable improvements in performance, reliability, and speed.

Gaurav Khurana

I'm a data engineer and cloud solutions architect focused on building scalable, automated data ecosystems that power analytics and ML. I specialize in dbt, PySpark, Airflow, Terraform, and AWS Glue, and I enjoy turning fragmented data into unified platforms that drive business impact. I lead end-to-end ELT solutions, optimize cloud costs, and mentor teams to build self-service data capabilities. I thrive on delivering reliable data pipelines with measurable improvements in performance, reliability, and speed.

Available to hire

I’m a data engineer and cloud solutions architect focused on building scalable, automated data ecosystems that power analytics and ML. I specialize in dbt, PySpark, Airflow, Terraform, and AWS Glue, and I enjoy turning fragmented data into unified platforms that drive business impact.

I lead end-to-end ELT solutions, optimize cloud costs, and mentor teams to build self-service data capabilities. I thrive on delivering reliable data pipelines with measurable improvements in performance, reliability, and speed.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
See more

Language

English
Fluent
Hindi
Fluent
German
Advanced
French
Beginner

Work Experience

Data Engineer at The StepStone Group GmbH
May 1, 2024 - Present
Designed and developed a decoupled ETL module 'Data Bridges' for cross-cluster data movement using Python and dbt; migrated legacy Azure VM pipelines to an AWS-native stack; optimized pipelines by consolidating Redshift dbt transformations into Spark jobs; deployed ML model pipelines for email re-targeting across 5 marketplaces; automated Terraform-based infrastructure provisioning and dbt freshness checks with Slack alerts; improved data reliability and onboarding.
Working Student – Data Engineering at SellerX GmbH
September 1, 2022 - April 1, 2024
Engineered Python-based REST API data extraction workflows orchestrated with Apache Airflow and KubernetesPodOperators; designed and deployed 30+ SQL data models in Snowflake using dbt; built CI/CD pipelines via GitHub Actions; developed and maintained 15+ Power BI dashboards.
Consultant – Technology (Data & Analytics) at Ernst & Young (EY)
November 1, 2021 - April 1, 2022
Provided data & analytics consulting, supported data engineering initiatives and analytics projects in Mumbai; collaborated with client teams to implement robust data pipelines and analytics solutions.
Technical Consultant (Data Science & Engineering) at Yantra Inc.
June 1, 2020 - November 1, 2021
Worked on data science and engineering engagements, developing data pipelines and analytics solutions for clients in Mumbai.

Education

Master's in Big Data and Business Analytics at SRH Hochschule Heidelberg
April 1, 2022 - March 1, 2024
Bachelor's in Computer Engineering at NMIMS University, Mumbai, India
August 1, 2015 - May 1, 2019

Qualifications

Tableau Desktop Specialist
January 11, 2030 - January 7, 2026
SAS Visual Analytics 1 for SAS Viya: Basics (8.5)
January 11, 2030 - January 7, 2026
SAS Visual Analytics 2 for SAS Viya: Advanced (8.5)
January 11, 2030 - January 7, 2026

Industry Experience

Software & Internet, Professional Services, Education