I am a Senior Data Engineer with over 8 years of experience designing and optimizing large-scale data pipelines, ETL workflows, and analytics platforms. I build real-time and batch data solutions using Apache Spark, Kafka, Airflow, and AWS Glue. Proficient in data modeling, warehousing, and performance tuning with Snowflake, Redshift, and Hive. I have strong cloud experience across AWS, Azure, and GCP enabling scalable and cost-efficient data solutions. I thrive in collaborative teams delivering business-driven, high-impact data architectures and analytics.

Robin K

I am a Senior Data Engineer with over 8 years of experience designing and optimizing large-scale data pipelines, ETL workflows, and analytics platforms. I build real-time and batch data solutions using Apache Spark, Kafka, Airflow, and AWS Glue. Proficient in data modeling, warehousing, and performance tuning with Snowflake, Redshift, and Hive. I have strong cloud experience across AWS, Azure, and GCP enabling scalable and cost-efficient data solutions. I thrive in collaborative teams delivering business-driven, high-impact data architectures and analytics.

Available to hire

I am a Senior Data Engineer with over 8 years of experience designing and optimizing large-scale data pipelines, ETL workflows, and analytics platforms. I build real-time and batch data solutions using Apache Spark, Kafka, Airflow, and AWS Glue.

Proficient in data modeling, warehousing, and performance tuning with Snowflake, Redshift, and Hive. I have strong cloud experience across AWS, Azure, and GCP enabling scalable and cost-efficient data solutions. I thrive in collaborative teams delivering business-driven, high-impact data architectures and analytics.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
See more

Language

English
Fluent

Work Experience

Senior Data Engineer at Theom
January 1, 2023 - Present
Architected and deployed high-performance real-time data pipelines using Kafka and Spark Structured Streaming, reducing data latency by 85% and enabling faster decision-making. Designed and automated large-scale ETL workflows with AWS Glue and Airflow, processing over 5TB of data daily across multiple business domains. Developed and optimized end-to-end enterprise data warehouse models in Snowflake, improving query efficiency and accelerating analytics in insights delivery. Collaborated with data science and BI teams to operationalize predictive models and build KPI dashboards, supporting data-driven strategies across departments. Implemented data quality framework and CI/CD automation to improve pipeline reliability, auditability, and reducing manual intervention by 40%.
Data Engineer at Coalesce
May 1, 2016 - December 1, 2022
Engineered and maintained scalable batch and streaming data pipelines using Apache Spark, Hive, and Amazon Redshift, enabling faster analytic delivery. Migrated legacy ETL workflows from on-premise infrastructure to AWS Cloud (Glue, S3, Redshift), reducing infra costs by 30% and improving scalability. Applied data lineage and version control through Apache Atlas and Git, enhancing data transparency and governance across teams. Tuned and optimized SQL and Spark jobs, improving ETL performance by up to 60% and ensuring high throughput for production pipelines. Partnered with business analysts and BI teams to automate data extraction, transformation, and visualization in Power BI, streamlining reporting workflows.

Education

Add your educational history here.

Qualifications

Bachelor's in Computer Science
January 1, 2012 - January 1, 2016
Bachelor's in Computer Science
January 1, 2012 - January 1, 2016

Industry Experience

Software & Internet, Professional Services, Media & Entertainment, Other