I am a Senior Data Engineer with over a decade of experience building cloud-native data platforms on AWS, specializing in scalable ETL/ELT pipelines, big data frameworks, and advanced analytics across healthcare, finance, and retail. I design data lakes and lakehouses on S3 using Iceberg and Delta Lake to enable schema evolution, ACID transactions, and high-performance analytics. I enjoy turning complex data into reliable, governed pipelines and partnering with data scientists and business stakeholders to deliver measurable outcomes.

Sreenivasa Reddy Cuddapah

I am a Senior Data Engineer with over a decade of experience building cloud-native data platforms on AWS, specializing in scalable ETL/ELT pipelines, big data frameworks, and advanced analytics across healthcare, finance, and retail. I design data lakes and lakehouses on S3 using Iceberg and Delta Lake to enable schema evolution, ACID transactions, and high-performance analytics. I enjoy turning complex data into reliable, governed pipelines and partnering with data scientists and business stakeholders to deliver measurable outcomes.

Available to hire

I am a Senior Data Engineer with over a decade of experience building cloud-native data platforms on AWS, specializing in scalable ETL/ELT pipelines, big data frameworks, and advanced analytics across healthcare, finance, and retail.

I design data lakes and lakehouses on S3 using Iceberg and Delta Lake to enable schema evolution, ACID transactions, and high-performance analytics. I enjoy turning complex data into reliable, governed pipelines and partnering with data scientists and business stakeholders to deliver measurable outcomes.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Work Experience

Senior Data Engineer at Johnson & Johnson
August 1, 2024 - Present
Designed and developed end-to-end ETL pipelines in AWS Glue and PySpark, integrating clinical trial, patient, and telemetry data into S3 with Bronze–Silver–Gold layering to ensure data consistency. Integrated SAP, Salesforce, and OKTA into AWS S3 using Glue, Lambda, and AppFlow, standardizing and curating data for Redshift and downstream analytics. Built near-real-time streaming dataflows using Apache Flink integrated with Kinesis Data Streams to process patient telemetry events with sub-second latency for clinical monitoring and analytics. Deployed and managed Spark and API-based workloads on Kubernetes (EKS) clusters to achieve high scalability, fault tolerance, and CI/CD-driven automation of data services. Orchestrated ELT pipelines in Airflow integrated with AWS Glue and Redshift, improving scheduling, retries, and dependency handling for mission-critical healthcare analytics workloads. Optimized pipelines with Airflow for performance tuning, SLA compliance, and thorough runboo
Senior Data Engineer at Wells Fargo
October 1, 2022 - July 1, 2024
Migrated complex PL/SQL ETL pipelines to AWS Glue, S3, and Redshift, redesigning workflows for scalability, faster runtimes, and audit-compliant data management across financial ecosystems. Integrated SAP, Salesforce, and OKTA into S3/Redshift using Glue, Lambda, and AppFlow. Designed streaming data pipelines with Kinesis, Apache Flink, and MSK to enable real-time fraud detection and risk scoring with SLA-driven performance. Implemented Apache Hudi on S3 for incremental data lake ingestion with CDC support and ACID capabilities. Built and optimized Redshift schemas, Spectrum external tables, and data marts for unified analytics. Deployed Spark/API workloads on Kubernetes with CI/CD. Implemented a financial data lake with Apache Iceberg on S3 for schema evolution and ACID transactions. Partnered with quant teams to operationalize predictive ML models in SageMaker, embedding inference in pipelines and BI dashboards. Automated workflows with Step Functions and Lambda to orchestrate Glue,
Data Engineer at New York Life Insurance
October 1, 2019 - February 1, 2022
Built enterprise data lakes on S3 using Apache Hudi to support incremental upserts, CDC ingestion, and ACID transactions across actuarial, claims, and policyholder data sources. Developed ETL pipelines in AWS Glue and PySpark on EMR, standardizing policyholder, claims, and actuarial data for reporting and modeling. Modeled insurance KPIs in Redshift with star schemas and workload management. Implemented real-time claims ingestion via SQS/SNS/Lambda for fraud detection and notifications. Partnered with data scientists to train predictive pricing and churn models in SageMaker, deploying inference endpoints and integrating results into BI dashboards. Automated orchestration with Step Functions. Performed Redshift SQL optimization and governance using Lake Formation/Glue Catalog for HIPAA/SOX alignment. Delivered QuickSight dashboards with ML-driven insights. Established data quality frameworks and Terraform/CodePipeline automation. Collaborated with actuarial/finance teams to translate re
Data Engineer at New Vision Software
June 1, 2017 - June 1, 2019
Built scalable batch and streaming pipelines using AWS Glue, EMR, and Kinesis, processing millions of daily retail transactions and event logs. Designed analytical marts in Redshift with star schemas and partitioning. Implemented streaming ingestion with MSK and Firehose for near real-time dashboards and anomaly detection. Collaborated with ML teams to deploy NLP-based recommendations in SageMaker. Established data governance with Lake Formation/Glue Catalog and automated orchestration with Step Functions. Created QuickSight dashboards with ML anomaly detection and optimized costs with S3 storage policies. Performed PySpark optimizations, and implemented CI/CD pipelines with CodePipeline/CodeBuild for Glue and EMR deployments. Secured pipelines with IAM/KMS/VPC and mentored teams on cloud-native architectures. Authored architecture diagrams and runbooks.

Education

Bachelor's in Computer Science at Presidency University
May 1, 2013 - May 1, 2017

Qualifications

Add your qualifications or awards here.

Industry Experience

Healthcare, Financial Services, Retail, Professional Services, Software & Internet