I am Ravi Kiran Pal, a data engineer with 10+ years of experience designing secure, scalable, and automated data systems across Healthcare, Finance, Automotive, and Manufacturing. I specialize in cloud platforms (AWS, GCP, Snowflake, Databricks) and am proficient in building end-to-end ETL/ELT pipelines for both batch and real-time processing. I am skilled in data modeling, building data lakes, and ensuring high data integrity and availability through governance and quality frameworks. I excel at modernizing legacy systems, optimizing performance, and achieving significant cost savings. I enjoy collaborating with cross-functional teams to design scalable data solutions and deliver impactful insights through BI dashboards. I also leverage GenAI tools for automation, optimization, and documentation.

Ravi Kiran Pal

I am Ravi Kiran Pal, a data engineer with 10+ years of experience designing secure, scalable, and automated data systems across Healthcare, Finance, Automotive, and Manufacturing. I specialize in cloud platforms (AWS, GCP, Snowflake, Databricks) and am proficient in building end-to-end ETL/ELT pipelines for both batch and real-time processing. I am skilled in data modeling, building data lakes, and ensuring high data integrity and availability through governance and quality frameworks. I excel at modernizing legacy systems, optimizing performance, and achieving significant cost savings. I enjoy collaborating with cross-functional teams to design scalable data solutions and deliver impactful insights through BI dashboards. I also leverage GenAI tools for automation, optimization, and documentation.

Available to hire

I am Ravi Kiran Pal, a data engineer with 10+ years of experience designing secure, scalable, and automated data systems across Healthcare, Finance, Automotive, and Manufacturing. I specialize in cloud platforms (AWS, GCP, Snowflake, Databricks) and am proficient in building end-to-end ETL/ELT pipelines for both batch and real-time processing. I am skilled in data modeling, building data lakes, and ensuring high data integrity and availability through governance and quality frameworks.

I excel at modernizing legacy systems, optimizing performance, and achieving significant cost savings. I enjoy collaborating with cross-functional teams to design scalable data solutions and deliver impactful insights through BI dashboards. I also leverage GenAI tools for automation, optimization, and documentation.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert

Work Experience

Project Lead, Senior Data Engineer (Automobile Domain) at R Systems International Ltd.
December 1, 2024 - December 1, 2024
Led migration of 75+ legacy ETL pipelines from Informatica and SQL Server to cloud-native PySpark and BigQuery, delivering 50x–100x data processing speed improvements and optimizing AWS infrastructure. Built 70+ modular pipelines processing up to 15B records with robust schema validation and deduplication; optimized S3-to-warehouse ingestion patterns using BigQuery and Snowflake to ensure 99.9% data accuracy. Implemented observability and SLA monitoring with CloudWatch, Glue Data Quality, and SNS/SQS, reducing incident response times by 30% and increasing pipeline reliability by 25%. Led data governance initiatives for traceability and data validation using AWS Lake Formation and Glue; designed config-driven data quality framework enabling historical validation, reducing data issues by 40% and improving data quality by 35%.
Senior Data Engineer (Healthcare Domain) at HARMAN CONNECTED SERVICES
February 1, 2022 - February 1, 2022
Redesigned 50+ legacy Talend/Informatica workflows into AWS-native pipelines (Glue, Redshift, Airflow), achieving 20x–50x performance improvements and up to 20x cost reduction. Built dynamic, reusable PySpark ETL framework with runtime configuration enabling scalable rollout across 75+ client pipelines and enhanced debugging via embedded log/event tracking. Engineered batch and near real-time pipelines processing 250M–500M healthcare records per month powering BI dashboards, improving reporting accuracy by 30%. Enabled observability and SLA enforcement via CloudWatch, with alerting through SNS and near real-time event tracking using Kinesis and PySpark. Built prod QuickSight and dev Spotfire dashboards to transform large datasets into actionable insights.
Senior Data Engineer (Healthcare Domain) at PHROG APP LABS PVT. LTD
December 1, 2017 - December 1, 2017
Built 30+ scalable PySpark ETL pipelines transforming ad performance data into Hive fact/dimension models, enabling reporting and BI dashboards and contributing to 25% faster insights for marketing and clinical reports. Developed the OMOP Data Model to standardize data delivery across pharma clients, reducing data discrepancies by 35%.
Senior Data Engineer (Manufacturing Domain) at DCODE TECHNOLOGIES
May 1, 2017 - May 1, 2017
Designed and implemented IoT ingestion pipelines using Pentaho for data extraction and MongoDB for storage, enabling real-time analytics and improving anomaly detection speed by 40%. Built Spotfire dashboards to visualize KPI trends and detect early anomalies in manufacturing metrics, leading to a 20% improvement in process efficiency. Automated data ingestion and summary generation using Cron scheduling and SQL logic in Oracle/SQL Server, optimizing data processing workflows and reducing processing time by 30%.
Analyst (Financial Domain) at HCL
September 1, 2014 - September 1, 2014
Developed, troubleshooted, and debugged ETL pipelines for Toyota EDW enhancements, improving the accuracy of historical analysis and business reporting by 25%. Implemented Slowly Changing Dimensions (SCD) Types 1/2/3 and Change Data Capture (CDC) logic to support both historical and incremental data processing, increasing data freshness by 30%.

Education

B.Tech at Uttar Pradesh Technical University
January 1, 2009 - January 1, 2013

Qualifications

AWS Certified Data Engineer – Associate
July 1, 2025 - July 1, 2028
Microsoft Certified: Power BI Data Analyst Associate
July 1, 2025 - July 1, 2026

Industry Experience

Healthcare, Manufacturing, Financial Services, Professional Services, Other