I am a data engineer with 5+ years of experience designing, building, and optimizing real-time and batch ETL/ELT pipelines across AWS and hybrid cloud environments. I specialize in delivering scalable data lakes, data warehouses, and streaming architectures that empower analytics, AI, and ML workloads.\n\nI excel at Python-based orchestration (Airflow, Prefect) and data modeling, with a focus on observability (Datadog, Prometheus). I also drive data governance, quality initiatives, and CI/CD practices, while mentoring junior engineers to raise engineering standards and practices.

Suryatej J Akka

I am a data engineer with 5+ years of experience designing, building, and optimizing real-time and batch ETL/ELT pipelines across AWS and hybrid cloud environments. I specialize in delivering scalable data lakes, data warehouses, and streaming architectures that empower analytics, AI, and ML workloads.\n\nI excel at Python-based orchestration (Airflow, Prefect) and data modeling, with a focus on observability (Datadog, Prometheus). I also drive data governance, quality initiatives, and CI/CD practices, while mentoring junior engineers to raise engineering standards and practices.

Available to hire

I am a data engineer with 5+ years of experience designing, building, and optimizing real-time and batch ETL/ELT pipelines across AWS and hybrid cloud environments. I specialize in delivering scalable data lakes, data warehouses, and streaming architectures that empower analytics, AI, and ML workloads.\n\nI excel at Python-based orchestration (Airflow, Prefect) and data modeling, with a focus on observability (Datadog, Prometheus). I also drive data governance, quality initiatives, and CI/CD practices, while mentoring junior engineers to raise engineering standards and practices.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Advanced
Bashkir
Advanced

Work Experience

Data Engineer (Cloud & Data Architecture) at Wayfair
July 1, 2014 - October 31, 2025
Architected and maintained real-time and batch ETL pipelines using AWS Glue, Airflow, and PySpark, improving data latency and reliability by 40%. Designed and optimized AWS data lake and Redshift warehouse, achieving 30% cost reduction through performance tuning and tiered storage. Led the implementation of data observability and alerting using Datadog and CloudWatch; developed API integrations and event-driven ingestion pipelines to process streaming data from 10+ sources for analytics and machine learning workflows. Championed best practices for data governance, version control, and CI/CD, ensuring traceable, auditable deployments. Mentored junior engineers on pipeline design, automation, and coding standards, improving team efficiency by 25%.
Data Engineer (ETL/ELT & Cloud Migration) at HDFC
July 31, 2023 - July 31, 2023
Built automated ETL frameworks in Python and SQL, migrating critical financial datasets from on-prem systems to AWS-based Redshift data warehouse. Implemented data validation, error handling, and lineage documentation for all ETL jobs to ensure integrity and regulatory compliance. Partnered with analytics and reporting teams to define data models supporting dashboards and performance metrics. Established a reusable orchestration pattern for incremental data refresh, reducing manual intervention by 35%.
Python Developer / Data Engineer at Verizon
May 31, 2021 - May 31, 2021
Automated log ingestion and transformation pipelines in Python to support enterprise monitoring and analytics dashboards. Designed database-level performance tuning strategies, optimizing SQL queries and stored procedures. Supported production environments by maintaining high data availability and uptime for telecom systems. Collaborated with data analysts to integrate reporting APIs into business dashboards, reducing manual data refresh time.
Data Engineer (ETL/ELT & Cloud Migration) at HDFC
July 1, 2023 - July 1, 2023
Built automated ETL frameworks in Python and SQL, migrating critical financial datasets from on-prem systems to AWS-based Redshift data warehouse. Implemented data validation, error handling, and lineage documentation for all ETL jobs to ensure integrity and regulatory compliance. Partnered with analytics and reporting teams to define data models supporting dashboards and performance metrics. Established a reusable orchestration pattern for incremental data refresh, reducing manual intervention by 35%.
Python Developer / Data Engineer at Verizon
May 1, 2021 - May 1, 2021
Automated log ingestion and transformation pipelines in Python to support enterprise monitoring and analytics dashboards. Designed database-level performance tuning strategies, optimizing SQL queries and stored procedures. Supported production environments by maintaining high data availability and uptime for telecom systems. Collaborated with data analysts to integrate reporting APIs into business dashboards, reducing manual data refresh time.

Education

Bachelor of Science in Computers at Birla Institute of Technology and Science, India
January 1, 2018 - December 31, 2021
Master of Science in Technology Management at Lindsey Wilson College
January 1, 2023 - December 31, 2025
Bachelor of Science in Computers at Birla Institute of Technology and Science
January 1, 2018 - January 1, 2021
Master of Science in Technology Management at Lindsey Wilson College, Bowling Green, KY
January 1, 2023 - January 1, 2025

Qualifications

AWS Certified: Cloud Practitioner
January 11, 2030 - October 31, 2025
AWS Certified Cloud Practitioner
January 11, 2030 - October 31, 2025

Industry Experience

Software & Internet, Financial Services, Retail, Professional Services, Telecommunications