Available to hire
I am a Senior Data Engineer with 7+ years of experience architecting and delivering large-scale data platforms, cloud migrations, and enterprise analytics solutions across Banking, Healthcare, and Retail. I specialize in building scalable, governed data pipelines and Lakehouse architectures using AWS, Databricks, Snowflake, and DBT to accelerate data-driven decision making.
I enjoy collaborating with risk, fraud, clinical informatics, and DevOps teams to translate domain needs into robust data designs, mentor engineers, and drive modernization from on-premise to cloud-native platforms with a focus on governance, privacy, and reliability.
Skills
See more
Language
Bashkir
Advanced
Javanese
Advanced
Work Experience
Senior AWS Data Engineer at First National Bank
September 1, 2022 - PresentLed end-to-end modernization of on-prem SSIS/Hadoop pipelines to a Databricks + Snowflake Lakehouse on AWS, enabling scalable ingestion, automated transformations, and governed analytics for core banking products. Built ETL/ELT pipelines with AWS Glue, PySpark, Spark SQL, Lambda, Step Functions, and DBT, ingesting data from Finacle, Fiserv channels, and Salesforce. Implemented CDC & SCD Type-2 using Snowflake Streams/Tasks, Delta Lake, and DBT for near real-time insights and regulatory reporting. Developed Python-based ETL framework for validation, error tracking, and auto-alerting via SNS and CloudWatch, improving observability. Integrated Kafka + Spark Structured Streaming for real-time fraud/ AML triggers. Migrated Hive/HBase/Sqoop/MapReduce workloads to AWS EMR + Glue with a S3 data lake. designed multi-zone S3 data lake with Lake Formation RBAC and KMS encryption; modeled Snowflake & Redshift schemas with dimensional models and materialized views. Implemented Terraform IaC and CI/
Data Engineer / Cloud Engineer at UnitedHealth Group
August 1, 2020 - August 1, 2022Built HIPAA-compliant ETL/ELT pipelines with AWS Glue, PySpark, Spark SQL, Lambda, and Step Functions to ingest EHR, claims, and provider data across Epic and Cerner ecosystems. Implemented CDC & SCD-2 pipelines using Snowflake Streams/Tasks, DBT, and Delta Lake to create longitudinal patient profiles and continuous eligibility insights. Developed scalable PySpark frameworks for data harmonization, PHI masking, encryption, and auditability. Created real-time ADT event pipelines using Kinesis and Spark Structured Streaming to trigger clinical notifications and outreach workflows. Standardized FHIR/HL7 mappings and DBT models for member segmentation and risk analytics. Automated data quality with Deequ and custom validators; managed analytical layers in Athena/Redshift for HEDIS, Stars, and claims analytics. Established S3 lake zoning, Glue Catalog tagging, and PHI governance with Lake Formation, private subnets, and VPC controls. Integrated SageMaker ML scoring pipelines for risk identi
Data Engineer / Cloud Engineer at Walmart Global Tech
July 1, 2018 - June 1, 2020Developed enterprise-scale Hadoop-based data pipelines for customer analytics, operational reporting, and fraud-risk data consumption across distributed environments. Built PySpark/Scala batch pipelines for large-volume transformations; engineered Spark jobs with modular transformations, partitioning, and checkpointing to support real-time and batch use cases. Implemented Oozie for job orchestration; managed HBase for high-throughput reference data. Built ingestion pipelines with Sqoop/Flume and Kafka/Spark Streaming for real-time events. Implemented data quality checks, data dictionaries, and runbooks to support governance. Orchestrated data ingestion to AWS EMR + S3 with IAM-based security, encryption, and VPC isolation. Migrated Hive/HBase/Sqoop/MapReduce workloads to modern lakehouse patterns, enabling faster analytics and governance. Partnered with BI teams to deliver pre-aggregated datasets for Tableau dashboards.
Education
Master’s degree in computer science at Texas A&M Corpus Christi
January 11, 2030 - January 7, 2026Qualifications
Industry Experience
Financial Services, Healthcare, Retail, Software & Internet
Skills
See more
Hire a Data Scientist
We have the best data scientist experts on Twine. Hire a data scientist in Pittsburgh today.