I am a results-driven Data Engineer with 4.5+ years of experience building high-volume, production-grade data platforms for BI and analytics. I specialize in AWS-native ETL, PySpark-based transformations, and Redshift data warehousing. I take end-to-end ownership across ingestion, orchestration, validation, and monitoring, delivering reliable, scalable data pipelines and collaborating with BI teams to enable real-time dashboards.

Swapnil Bhakare

I am a results-driven Data Engineer with 4.5+ years of experience building high-volume, production-grade data platforms for BI and analytics. I specialize in AWS-native ETL, PySpark-based transformations, and Redshift data warehousing. I take end-to-end ownership across ingestion, orchestration, validation, and monitoring, delivering reliable, scalable data pipelines and collaborating with BI teams to enable real-time dashboards.

Available to hire
See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert

Language

English
Fluent
Hindi
Advanced
Marathi (Marāṭhī)
Fluent

Work Experience

Data Engineer at NEWGEN TECHNOMATE LLP
January 1, 2024 - Present
Designed and deployed fully serverless ETL pipelines using PySpark, AWS Glue, Step Functions, and Lambda for analytics workloads. Built scalable S3 → Glue → Redshift pipelines with optimized schema design and partitioning. Automated end-to-end workflow orchestration using Step Functions + Lambda, eliminating manual job triggers and reducing operational failures. Developed SQL + PySpark-based data validation and reconciliation framework to ensure upstream vs downstream data accuracy. Implemented CloudWatch logging, alerting for proactive pipeline failure detection monitoring. Collaborated closely with BI teams to deliver production-ready datasets for real-time dashboards.
Data Engineer at Tata Consultancy Services (TCS)
January 1, 2022 - December 1, 2023
Data Engineering: Key contributor to a large-scale on-prem to AWS cloud data modernization program. Built and supported production ETL pipelines using AWS Glue, S3, Redshift, Athena, and EMR for enterprise analytics platforms. Developed and optimized Athena-based validation queries for schema checks, completeness, and transformation integrity. Automated ETL orchestration using Lambda and Step Functions, removing dependency on manual scheduling. Managed IAM roles, fine-grained permissions, and CloudWatch monitoring for secured, auditable pipelines. Worked on PySpark-based transformation jobs to improve large-scale data ingestion performance and consistency.
Frontend Developer at Octrans Technologies
May 1, 2021 - December 1, 2021
Developed reusable React components and implemented basic state management. Integrated frontend screens with REST APIs and handled form validations. Fixed UI bugs, improved responsiveness, and followed Git workflows. Collaborated with backend teams to understand application data flow.

Education

B.C.A at Dr. Babasaheb Ambedkar Marathwada University
June 1, 2017 - April 1, 2020

Qualifications

B.C.A
June 1, 2017 - April 1, 2020

Industry Experience

Software & Internet, Professional Services, Computers & Electronics

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert