Looks like you have JavaScript disabled. For the full Twine experience, you will need to re-enable it.

I am a Senior Data Engineer with over 8 years of experience designing and optimizing large-scale data pipelines, ETL workflows, and analytics platforms. I build real-time and batch data solutions using Apache Spark, Kafka, Airflow, and AWS Glue. Proficient in data modeling, warehousing, and performance tuning with Snowflake, Redshift, and Hive. I have strong cloud experience across AWS, Azure, and GCP enabling scalable and cost-efficient data solutions. I thrive in collaborative teams delivering business-driven, high-impact data architectures and analytics.…I am a Senior Data Engineer with over 8 years of experience designing and optimizing large-scale data pipelines, ETL workflows, and analytics platforms. I build real-time and batch data solutions using Apache Spark, Kafka, Airflow, and AWS Glue. Proficient in data modeling, warehousing, and performance tuning with Snowflake, Redshift, and Hive. I have strong cloud experience across AWS, Azure, and GCP enabling scalable and cost-efficient data solutions. I thrive in collaborative teams delivering business-driven, high-impact data architectures and analytics.

Robin K

Data Analyst, Data Scientist, AI Engineer, +3





I am a Senior Data Engineer with over 8 years of experience designing and optimizing large-scale data pipelines, ETL workflows, and analytics platforms. I build real-time and batch data solutions using Apache Spark, Kafka, Airflow, and AWS Glue. Proficient in data modeling, warehousing, and performance tuning with Snowflake, Redshift, and Hive. I have strong cloud experience across AWS, Azure, and GCP enabling scalable and cost-efficient data solutions. I thrive in collaborative teams delivering business-driven, high-impact data architectures and analytics.…I am a Senior Data Engineer with over 8 years of experience designing and optimizing large-scale data pipelines, ETL workflows, and analytics platforms. I build real-time and batch data solutions using Apache Spark, Kafka, Airflow, and AWS Glue. Proficient in data modeling, warehousing, and performance tuning with Snowflake, Redshift, and Hive. I have strong cloud experience across AWS, Azure, and GCP enabling scalable and cost-efficient data solutions. I thrive in collaborative teams delivering business-driven, high-impact data architectures and analytics.

Available to hire

I am a Senior Data Engineer with over 8 years of experience designing and optimizing large-scale data pipelines, ETL workflows, and analytics platforms. I build real-time and batch data solutions using Apache Spark, Kafka, Airflow, and AWS Glue.

Proficient in data modeling, warehousing, and performance tuning with Snowflake, Redshift, and Hive. I have strong cloud experience across AWS, Azure, and GCP enabling scalable and cost-efficient data solutions. I thrive in collaborative teams delivering business-driven, high-impact data architectures and analytics.

Skills

Experience Level

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Intermediate

Intermediate

Language

English

Fluent

Work Experience

Senior Data Engineer at Theom

January 1, 2023 - Present

Architected and deployed high-performance real-time data pipelines using Kafka and Spark Structured Streaming, reducing data latency by 85% and enabling faster decision-making. Designed and automated large-scale ETL workflows with AWS Glue and Airflow, processing over 5TB of data daily across multiple business domains. Developed and optimized end-to-end enterprise data warehouse models in Snowflake, improving query efficiency and accelerating analytics in insights delivery. Collaborated with data science and BI teams to operationalize predictive models and build KPI dashboards, supporting data-driven strategies across departments. Implemented data quality framework and CI/CD automation to improve pipeline reliability, auditability, and reducing manual intervention by 40%.

Data Engineer at Coalesce

May 1, 2016 - December 1, 2022

Engineered and maintained scalable batch and streaming data pipelines using Apache Spark, Hive, and Amazon Redshift, enabling faster analytic delivery. Migrated legacy ETL workflows from on-premise infrastructure to AWS Cloud (Glue, S3, Redshift), reducing infra costs by 30% and improving scalability. Applied data lineage and version control through Apache Atlas and Git, enhancing data transparency and governance across teams. Tuned and optimized SQL and Spark jobs, improving ETL performance by up to 60% and ensuring high throughput for production pipelines. Partnered with business analysts and BI teams to automate data extraction, transformation, and visualization in Power BI, streamlining reporting workflows.