Available to hire
I am a Senior Data Engineer with over 5 years of experience designing and optimizing scalable data pipelines and modernizing legacy systems.
I specialize in Snowflake and cloud platforms to deliver high-quality, reliable data solutions. I love turning complex data challenges into measurable improvements in processing time and data quality, and I enjoy collaborating with cross-functional teams to drive business outcomes.
Skills
Language
Javanese
Advanced
Afar
Intermediate
Work Experience
Senior Data Engineer (Azure) at Fifth Third Bank
December 1, 2023 - October 31, 2025Led the migration of legacy banking systems to a cloud-native Azure Data Lakehouse, designing and deploying 10+ ETL pipelines using Azure Data Factory, Databricks, and Informatica PowerCenter. Optimized Delta Lake storage and Spark SQL transformations to accelerate processing of regulatory data by 40%, and established data validation and automated quality monitoring with PySpark and Informatica Data Quality. Collaborated with data science teams to deploy credit risk and fraud detection models within Databricks for real-time scoring, and built self-service reporting with Power BI for loan portfolio performance and customer segmentation. Reduced infrastructure costs by 25% through cluster tuning and autoscaling, and introduced GenAI-based tools using OpenAI APIs to auto-summarize compliance reports.
AWS Data Engineer at Progressive Corporation
November 30, 2023 - November 30, 2023Built a cloud-based claims data lake on AWS, integrating 12 upstream systems through Glue, Lambda, and Lake Formation. Implemented Kafka and Kinesis for near real-time event-driven architectures, enabling timely tracking of insurance claims and payments. Collaborated with data scientists to operationalize ML models (fraud detection, risk scoring) using SageMaker, and exposed outputs via RESTful APIs using Flask and Lambda. Strengthened data governance with Glue Data Catalog and enforced data lineage and access control for HIPAA compliance. Optimized Redshift with advanced partitioning and materialized views, reducing report times by 60%. Automated infrastructure provisioning with Terraform and CloudFormation; proactive monitoring with CloudWatch and Prometheus. Documented architectures and ETL/MLOps pipelines to streamline audits.
Data Engineer (GCP & BI) at IBM INDIA & KYNDRYL INDIA PVT. LTD
September 30, 2022 - September 30, 2022Migrated a large-scale telecom data warehouse to GCP BigQuery, engineering ELT pipelines using Dataflow, Dataproc (Spark), and Pub/Sub to process 5TB+ of daily CDR data. Integrated Informatica MDM (Customer 360) with GCP Data Catalog to maintain a unified customer master and reduce duplication by 85%. Developed real-time fraud detection and network performance monitoring using Apache Beam and BigQuery; partnered with ML engineers to deploy churn prediction and anomaly detection models on Vertex AI. Optimized BigQuery queries and costs via partitioning, clustering, and caching. Automated pipeline orchestration with Cloud Composer (Airflow) and built executive dashboards with Looker/Tableau. Implemented GenAI-enhanced tools to summarize network health reports and established comprehensive monitoring with Google Operations (Stackdriver).
Data Engineer at Micro Labs Limited
August 31, 2020 - August 31, 2020Designed ETL workflows to consolidate market feeds, trading platforms, and internal databases into a centralized data warehouse. Employed API, batch, and real-time streaming (Kafka) to streamline ingestion and reduce integration errors. Built optimized data models and indexing for faster reporting, and created interactive dashboards in Tableau, Power BI, and Looker. Implemented monitoring and documentation to ensure data governance and rapid issue resolution.
Senior Data Engineer (Azure) at Fifth Third Bank
December 1, 2023 - November 6, 2025Migrated legacy banking systems to a cloud-native Azure Data Lakehouse by designing and deploying 10+ efficient ETL pipelines using ADF, Databricks, and Informatica PowerCenter, aligning with scalable data pipeline best practices. Optimized Delta Lake storage and Spark SQL transformations, improving processing speed of large-scale regulatory and compliance datasets by 40% and ensuring data quality and performance. Integrated Azure Synapse Analytics with Informatica MDM (Customer 360) to establish a robust data integration workflow that maintained high-quality and governed customer data across business units. Developed data validation frameworks and automated data quality monitoring with PySpark and Informatica Data Quality, achieving 99.9% accuracy in production to support reliable analytics and reporting. Collaborated with cross-functional data science teams to deploy credit risk and fraud detection models within Databricks, translating business requirements into dependable data solut
AWS Data Engineer at Progressive Corporation
November 1, 2023 - November 1, 2023Built a cloud-based claims data lake on AWS by integrating 12 upstream systems through Glue, Lambda, and Lake Formation, following scalable data pipeline design principles. Implemented Kafka and Kinesis for event-driven architectures, enabling near real-time tracking of insurance claims and payment updates while ensuring robust data flow monitoring and alerting. Collaborated with data scientists to operationalize machine learning models (fraud detection, risk scoring) using SageMaker, boosting model deployment speed by 30% and strengthening agile team practices. Developed RESTful risk scoring APIs using Flask and Lambda, supporting reliable data flows and integration with customer-facing applications within a CI/CD framework. Enhanced data governance by integrating AWS Glue Data Catalog and enforcing data lineage and access controls for HIPAA compliance, underlining commitment to security, privacy, and compliance standards. Optimized Redshift clusters and query performance with advance
Data Engineer (GCP & BI) at IBM India Pvt. Ltd
September 1, 2022 - September 1, 2022Migrated a large-scale telecom data warehouse to GCP BigQuery, engineering ELT pipelines using Dataflow, Dataproc (Spark), and Pub/Sub to process 5TB+ of daily call detail records (CDRs). Integrated Informatica MDM (Customer 360) with GCP Data Catalog to maintain a unified customer master record, enhancing data integrity and reducing duplication by 85%. Developed real-time fraud detection and network performance monitoring using Apache Beam and BigQuery, reducing detection time from hours to minutes. Partnered with ML engineers to deploy churn prediction and anomaly detection models using Vertex AI, improving customer retention by 18%. Optimized BigQuery queries and cost strategies through partitioning, clustering, and caching, lowering monthly spend by 30%. Automated pipeline orchestration with Cloud Composer (Airflow), improving pipeline reliability and operational efficiency. Built executive dashboards with Looker and Tableau, visualizing telecom KPIs including usage, churn, and ser
Data Engineer at Micro Labs Limited
August 1, 2020 - August 1, 2020Designed and implemented ETL workflows to integrate market feeds, trading platforms, and internal databases into a centralized data warehouse, improving data availability and consistency. Employed APIs, batch processing, and real-time streaming using Apache Kafka and traditional schedulers to streamline data ingestion and reduce integration errors. Developed optimized data models and indexing strategies within relational databases to enhance query performance and reporting efficiency. Built and maintained interactive dashboards using Tableau, Power BI, and Looker to deliver actionable insights to business stakeholders. Implemented monitoring and alerting using open-source tools and scripting, enabling proactive issue resolution and minimizing ETL pipeline downtime. Documented ETL pipelines, data mappings, and quality checks to facilitate maintenance and ensure data governance standards were met.
Education
Master's in Computer Science at University of Central Missouri
January 11, 2030 - October 31, 2025Master's in Computer Science at University of Central Missouri
January 11, 2030 - November 6, 2025Qualifications
Microsoft Certified: Azure Data Engineer Associate
January 11, 2030 - October 31, 2025AWS Certified Data Engineer – Associate
January 11, 2030 - October 31, 2025SnowPro Core Certification
January 11, 2030 - October 31, 2025Microsoft Certified: Azure Data Engineer Associate
January 11, 2030 - November 6, 2025AWS Certified Data Engineer - Associate
January 11, 2030 - November 6, 2025SnowPro Core Certification
January 11, 2030 - November 6, 2025Industry Experience
Financial Services, Telecommunications, Software & Internet, Professional Services
Skills
Hire a Data Scientist
We have the best data scientist experts on Twine. Hire a data scientist in Ohio City today.