Hi, I’m Ojas Shukla—a results-driven Senior Data Engineer with 6+ years of experience designing and scaling cloud-native data ecosystems across AWS, GCP, Azure, and Snowflake. I specialize in ETL/ELT pipelines, streaming architectures, and analytics engineering using Python, dbt, and Airflow. I’m currently delivering enterprise-scale data observability, governance, and reporting pipelines at a global marketing and data technology company. I love building resilient data platforms that power analytics, compliance, and decision-making across complex, multi-tenant environments. I thrive in cross-functional teams, mentoring junior engineers on modular dbt design and Snowflake optimization. My focus is on data quality, automation, and scalable architectures that enable rapid insights while maintaining security and audit readiness across multi-cloud deployments.

Ojas Shukla

Hi, I’m Ojas Shukla—a results-driven Senior Data Engineer with 6+ years of experience designing and scaling cloud-native data ecosystems across AWS, GCP, Azure, and Snowflake. I specialize in ETL/ELT pipelines, streaming architectures, and analytics engineering using Python, dbt, and Airflow. I’m currently delivering enterprise-scale data observability, governance, and reporting pipelines at a global marketing and data technology company. I love building resilient data platforms that power analytics, compliance, and decision-making across complex, multi-tenant environments. I thrive in cross-functional teams, mentoring junior engineers on modular dbt design and Snowflake optimization. My focus is on data quality, automation, and scalable architectures that enable rapid insights while maintaining security and audit readiness across multi-cloud deployments.

Available to hire

Hi, I’m Ojas Shukla—a results-driven Senior Data Engineer with 6+ years of experience designing and scaling cloud-native data ecosystems across AWS, GCP, Azure, and Snowflake. I specialize in ETL/ELT pipelines, streaming architectures, and analytics engineering using Python, dbt, and Airflow. I’m currently delivering enterprise-scale data observability, governance, and reporting pipelines at a global marketing and data technology company. I love building resilient data platforms that power analytics, compliance, and decision-making across complex, multi-tenant environments.

I thrive in cross-functional teams, mentoring junior engineers on modular dbt design and Snowflake optimization. My focus is on data quality, automation, and scalable architectures that enable rapid insights while maintaining security and audit readiness across multi-cloud deployments.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent
Hindi
Advanced
Marathi (Marāṭhī)
Advanced

Work Experience

Senior Data Engineer at Kinesso (IPG)
August 1, 2025 - Present
Architect and optimize large-scale Snowflake pipelines integrating multi-channel marketing and analytics data across diverse ecosystems. Lead development of Airflow-orchestrated ingestion and transformation workflows, automating report delivery for global campaigns. Establish data quality monitoring frameworks using dbt tests and automated row-count checks, reducing manual QA cycles by 70%. Optimize warehouse performance and cost through adaptive scaling and partition strategies, improving dashboard responsiveness by 65%. Coordinate with cross-functional analytics, BI, and platform teams to ensure compliance, audit readiness, and service-level reliability. Document, version, and standardize schema management and parameter configurations across environments for better reproducibility. Mentor junior engineers on dbt modularity, Git branching standards, and Snowflake optimization techniques.
Senior Data Engineer at Stomble / Freelance Projects
January 1, 2025 - August 1, 2025
Architected and deployed production-ready, cross-cloud data pipelines across Azure and GCP, supporting retrieval-augmented generation (RAG) workflows, semantic search, and stakeholder dashboards. Delivered production-ready cross-cloud data pipelines supporting AI-driven semantic search, retrieval-augmented generation (RAG), and analytics dashboards. Built Azure SQL, BigQuery, and Snowflake warehouse layers optimized for reliability and performance. Designed dbt pipelines with Great Expectations-based testing and schema version control achieving 99.9% data accuracy. Automated CI/CD deployments using Terraform, Docker, and GitHub Actions, reducing release time by 60%. Published open-source reference projects showcasing RAG pipelines, streaming ETL, and governance automation.
Data Engineer at Aluminium Stewardship Initiative (ASI)
February 1, 2023 - January 1, 2025
Built an end-to-end Data Warehouse on GCP, integrating multiple heterogeneous data sources into a centralized database. Reduced ETL processing time by 40% by optimizing Python scripts using Pandas, DuckDB, BigQuery, Snowflake and dbt. Built both batch and stream data pipelines using Python, Apache Kafka, and GCP Pub/Sub, enabling real-time and scheduled data integration from multiple sources (APIs, databases, and flat files). Built portable data infrastructure compatible with both AWS and GCP environments using Terraform, Docker, and Kubernetes for vendor-agnostic deployment. Led mentorship programs, training junior engineers on data pipeline development, coding best practices, and cloud automation. Designed and deployed RESTful APIs using FastAPI for seamless integration between data pipelines and frontend applications, ensuring low-latency access to processed datasets. Automated infrastructure deployment using Terraform and Infrastructure-as-Code, reducing manual interventions by 70%
Data Engineer at University of Wollongong/Dementia Training Australia (DTA)
June 1, 2022 - January 1, 2023
Developed automated ELT workflows, reducing manual data handling by 80% and ensuring real-time analytics. Migrated critical financial and operational data to BigQuery, improving accessibility and performance for stakeholders. Integrated Cloud Functions & Apache Airflow to orchestrate scheduled and event-driven transformations. Enhanced data modeling techniques to support complex analytics and forecasting. Provided data support during clinical research phases by building agile, reproducible pipelines with timestamp tracking. Participated in cross-functional planning with IT, educators, and government liaisons for impact reporting. Improved dashboard performance by 35% by optimizing SQL queries for Tableau, Power BI, and Looker.
Data Specialist at HERE Technologies
February 1, 2019 - December 1, 2021
Developed and optimized Hadoop-based ETL pipelines, utilizing AWS services such as S3, EC2, Glue, and Lambda, along with Apache Kafka for real-time data streaming. Automated infrastructure management using Terraform and CI/CD pipelines to improve deployment efficiency. Implemented Docker and Kubernetes for scalable and resilient data processing, enhancing cloud-based workflows. Ensured compliance with industry regulations by strengthening data security and governance. Designed and optimized large-scale distributed data processing workflows using Apache Spark, improving data transformation speeds and efficiency. Developed Hive-based data warehousing solutions to enhance query performance and structured data accessibility across the organization. Managed HDFS-based storage, optimizing data retention and retrieval strategies to improve system efficiency. Integrated GIS datasets into Hadoop workflows to enable geospatial analytics and location-based decision-making. Leveraged Tableau and P

Education

Master of Computer Science at University of Wollongong
January 1, 2020 - January 1, 2022
Bachelor of Computer Science at Mumbai University
January 1, 2013 - January 1, 2017
Diploma in Computer Technology at Maharashtra State Board
January 1, 2013 - January 1, 2013
Master of Computer Science at University of Wollongong
January 1, 2020 - January 1, 2022
Bachelor of Computer Science at Mumbai University
January 1, 2013 - January 1, 2017
Diploma in Computer Technology at Maharashtra State Board
January 1, 2010 - January 1, 2013

Qualifications

Google Cloud Certified – Professional Data Engineer (In Progress)
January 11, 2030 - January 9, 2026
AWS Certified Data Engineer – Associate (Planned Q4 2025)
January 11, 2030 - January 9, 2026
Google Cloud Certified – Professional Data Engineer
January 11, 2030 - January 9, 2026
AWS Certified Data Engineer – Associate
January 11, 2030 - January 9, 2026

Industry Experience

Software & Internet, Professional Services, Education, Media & Entertainment, Healthcare