Available to hire
I am a data engineer with 9 years of experience delivering scalable data ingestion, batch processing, and real-time streaming pipelines across healthcare and financial data. I enjoy partnering with business teams to turn data into actionable insights and have built end-to-end ingestion frameworks, migrations from third-party tools to native AWS, and optimized pipelines for large daily data volumes.
In my work, I focus on building reliable, scalable data platforms, mentoring teammates, and continuously improving data quality and governance. Outside of work I enjoy learning about new AWS services and data technologies.
Work Experience
Data engineer at Clearsense
November 23, 2022 - November 29, 2024Data Engineer at Clearsense Inc
April 1, 2022 - November 24, 2025Real-time streaming application for financial data (One Checkout Platform); tech stack: Python, Spark, XML, JSON, AWS Glue, S3, Lambda, Redshift. Built OMOP models for health systems (GE, Medent, Epic). Batch processing of healthcare data for insurance and provider metrics; designed scalable batch processing using AWS Glue and Apache Spark to handle ~800k files per day.
Data Engineer at Diyotta ULC
March 1, 2022 - March 1, 2022On-prem to Redshift ingestion: migration from Diyotta to native AWS. Tech stack: Diyotta, S3, Lambda, Glue, Redshift, SNS. Developed a generic, scalable AWS ingestion framework for on-prem databases and file-to-Redshift pipelines; enabled migration from Diyotta and reduced licensing costs; framework reused by other units for onboarding Diyotta objects into Redshift. Incremental Ingestion pipeline for Employee Benefits Data (GB-scale daily/weekly) using Shell scripting, AWS CLI, S3, EMR, Glue, Redshift, PySpark. Improved pipeline execution time by ~50% and enhanced alerting.
Data Engineer at Diyotta India p.v.t ltd
December 1, 2019 - December 1, 2019Data Ingestion & Sync Process using Python, Hive, PySpark, Diyotta. Built data-sync with priority tagging (High/Medium/Low) based on criticality; implemented preemption logic. Designed REST API for retention of GA data to optimize cluster space. Collaborated with business users to define ingestion tables, PII handling, and data models for faster insertion/updation.
Data Engineer at Diyotta ULC
May 1, 2016 - May 1, 2016On-prem to Ingestion - Migration from Diyotta to Native AWS; Informatica PowerCenter 9.5.1, Teradata. Created detailed design specifications; analyzed user business and data models; documented requirements for traceability; participated in design specification QA with data architect and data integration designer; developed Informatica mappings, sessions and workflows as per requirements.
Education
Bachelor of Science at JNTU Kakinada
September 1, 2008 - April 1, 2012Qualifications
Industry Experience
Healthcare, Financial Services, Software & Internet, Professional Services
Hire a Database Developer
We have the best database developer experts on Twine. Hire a database developer in Toronto today.