I architect and deliver Azure-based data platforms and lakehouse architectures that turn massive TB-scale datasets into actionable insights across finance, marketing, and supply chain. I partner with stakeholders to define KPIs, design scalable pipelines, and cut processing times while reducing infrastructure costs. I champion automation, CI/CD, data governance, and cross-team collaboration to ensure reliable, compliant, and high-uptime data solutions for global clients.

Punith B S

I architect and deliver Azure-based data platforms and lakehouse architectures that turn massive TB-scale datasets into actionable insights across finance, marketing, and supply chain. I partner with stakeholders to define KPIs, design scalable pipelines, and cut processing times while reducing infrastructure costs. I champion automation, CI/CD, data governance, and cross-team collaboration to ensure reliable, compliant, and high-uptime data solutions for global clients.

Available to hire

I architect and deliver Azure-based data platforms and lakehouse architectures that turn massive TB-scale datasets into actionable insights across finance, marketing, and supply chain. I partner with stakeholders to define KPIs, design scalable pipelines, and cut processing times while reducing infrastructure costs. I champion automation, CI/CD, data governance, and cross-team collaboration to ensure reliable, compliant, and high-uptime data solutions for global clients.

See more

Experience Level

Expert
Expert
Expert
Expert

Language

English
Fluent

Work Experience

Azure Data Engineer at Coral
September 1, 2024 - November 5, 2025
Collaborated with stakeholders to gather data requirements, define KPIs, and build scalable data solutions using Azure Data Services, accelerating reporting turnaround time by 35% and strengthening cross-departmental alignment. Architected and executed greenfield projects using Databricks, Synapse, Microsoft Fabric, and Unity Catalog to enable real-time analytics and reduce data latency and operational delays by 40%. Leveraged PySpark to optimize large-scale ETL pipelines, cutting data processing times by 50% for CRM data from Dataverse and external APIs. Implemented automated ETL pipelines with Python and ADF to validate, test, and load data, improving data accuracy by 30%. Built metadata-driven ETL pipelines with Delta Lake, Purview, ADLS, and Python for scalable data movement across Databricks and ADF (50+ TB daily). Integrated Power BI dashboards with Delta tables to deliver near real-time CRM insights, reducing refresh and reporting delays by 45%. Established a Delta Lake-based la
Senior Data Engineer at IBM Private Ltd
December 1, 2023 - December 1, 2023
Directed data requirements gathering with finance and supply chain stakeholders, guiding project planning and reducing scope creep by 20% across 6+ concurrent initiatives. Led Databricks-based implementation (PySpark, Unity Catalog, Synapse) to establish an enterprise-grade data lake with 99.9% uptime and enhanced reliability. Optimized SSIS-to-ADF transformations with PySpark in Databricks, reducing daily job runtimes by 55% to accelerate downstream reporting. Generated Python-based test scripts to validate data lineage and integrity, boosting automation coverage by 60%. Automated data cataloging with Azure Purview to ensure GDPR compliance and audit readiness. Consolidated Power BI with Delta Lake and SAP BO layers, delivering interactive executive dashboards with 3-second refresh times. Implemented a Delta Lake/ADLS-based data lake to standardize ingestion from SAP BW and Azure SQL, and delivered SSAS Tabular models for OLAP queries, improving analytics performance. Governed CI/CD p
Data Engineer at Infosys Ltd
May 1, 2022 - May 1, 2022
Partnered with cross-functional teams to define data strategy and reporting requirements, boosting alignment and reducing change requests. Delivered production-ready pipelines using Databricks, Azure Synapse, Unity Catalog, PySpark, and Fabric, enabling scalable analytics across the enterprise. Processed millions of PySpark records daily, cutting data ingestion time from 2 hours to 30 minutes and achieving ~85% faster processing speeds. Created Python ETL modules and test frameworks to enable test-driven development, improving code reliability across multiple environments. Planned parameterized ETL frameworks for reuse across 6+ domains, shortening development time by 40%. Linked and published Power BI dashboards to Delta tables for real-time KPIs and optimized refresh efficiency by 50%. Transferred data from Cosmos DB to Azure SQL DB and then to S3/Redshift, reducing data redundancy and cloud storage costs by ~20% while maintaining 99.9% data availability. Implemented RBAC/IAM governa
Business Intelligence Developer at Hitachi Energy Ltd
August 1, 2021 - August 1, 2021
Led requirement gathering with finance and marketing to align KPIs and data models, improving report adoption by 35%. Re-engineered SSIS packages to optimize data flow and reduce runtime by 40% for 10GB+ daily SAP data loads. Built end-to-end BI solutions using Tableau and Power BI for campaign and revenue reporting, automating reporting processes and reducing manual effort by 60%. Developed ETL workflows in Alteryx for rapid prototyping of sales and marketing pipelines, boosting SLA compliance by 20% and improving delivery timelines. Connected Power BI dashboards to SSAS cubes for faster drill-downs and near real-time data refreshes, and launched SSRS-based reporting hub for automated distribution, reducing daily email load by 70%. Enforced data governance across SSAS and Tableau dashboards, improving data accuracy to 99% and ensuring auditability. Enhanced metadata visibility by integrating Alteryx with SAP metadata for better transformation traceability. Provided ongoing production

Education

Master of Sciences (MSc) – Data Science at University of Salford
January 1, 2024 - May 1, 2025
Bachelor of Technology (B. Tech) – Electronics and Communication Engineering at Visvesvaraya Technological University
January 11, 2030 - November 5, 2025

Qualifications

Microsoft Certified: Azure Data Engineer Associate (DP-203)
January 11, 2030 - November 5, 2025

Industry Experience

Financial Services, Professional Services, Manufacturing, Transportation & Logistics, Software & Internet