I am Prabhu Kumar, a Data Engineer with 6+ years of hands-on experience building and operating cloud-scale data engineering solutions on Azure. I specialize in end-to-end data pipelines using Azure Data Factory, Databricks (PySpark), Delta Lake, Snowflake, and Azure Synapse Analytics, with a focus on performance, reliability, and data governance. I also build semantic models and analytical datasets to support BI tools like Power BI, SSRS, and SSAS, and I implement data quality, validation, and CI/CD practices to ensure production readiness.

Prabhu Kumar

I am Prabhu Kumar, a Data Engineer with 6+ years of hands-on experience building and operating cloud-scale data engineering solutions on Azure. I specialize in end-to-end data pipelines using Azure Data Factory, Databricks (PySpark), Delta Lake, Snowflake, and Azure Synapse Analytics, with a focus on performance, reliability, and data governance. I also build semantic models and analytical datasets to support BI tools like Power BI, SSRS, and SSAS, and I implement data quality, validation, and CI/CD practices to ensure production readiness.

Available to hire

I am Prabhu Kumar, a Data Engineer with 6+ years of hands-on experience building and operating cloud-scale data engineering solutions on Azure.

I specialize in end-to-end data pipelines using Azure Data Factory, Databricks (PySpark), Delta Lake, Snowflake, and Azure Synapse Analytics, with a focus on performance, reliability, and data governance. I also build semantic models and analytical datasets to support BI tools like Power BI, SSRS, and SSAS, and I implement data quality, validation, and CI/CD practices to ensure production readiness.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert

Language

English
Fluent

Work Experience

Azure and BI Engineer at Aptos
August 1, 2023 - Present
Architected and implemented enterprise-scale data ingestion and transformation pipelines using Azure Data Factory, supporting hundreds of tables across multiple source systems with automated dependency management, retries, and controlled reprocessing. Designed and optimized Spark-based ETL workloads in Azure Databricks using PySpark, implementing partition pruning, incremental loading strategies, and tuning for performance to reduce end-to-end pipeline execution time by approximately 30–45%. Built and maintained Delta Lake-based storage layers on Azure Data Lake Storage Gen2 with schema evolution, time travel, and optimized file layouts to support historical reprocessing and downstream analytics. Implemented analytical data platforms using Snowflake and Azure Synapse Analytics to enable high-performance SQL querying and scalable OLAP workloads for enterprise reporting. Developed a metadata-driven ETL/ELT framework using Azure Data Factory, Databricks notebooks, and configuration-driv
Cloud and BI Consultant at Axis Bank
March 1, 2021 - July 1, 2023
Architected and implemented a scalable data ecosystem using Apache Spark to process large volumes of data monthly, resulting in reduced operational costs and improved analytics turnaround. Developed end-to-end ingestion and transformation pipelines with Azure Data Factory and Spark-based jobs, enabling automated data flows from multiple sources. Ensured data security and privacy through data masking and anonymization within Azure Data Factory and controlled access to sensitive information. Participated in database code reviews as a DBA to maintain coding standards and best practices, and conducted BI data profiling and metadata management to gain comprehensive data understanding. Built and deployed a real-time fraud detection system using Spark and Scala to monitor transactions, detect anomalies, and trigger alerts. Crafted SSIS package designs to meet business requirements and managed data integration in Azure Synapse for advanced analytics. Implemented coding standards and automated
BI Consultant at Apollo
April 1, 2019 - March 1, 2021
Performed data validation, data blending, and SQL-driven transformations; offloaded heavy extracts to Tableau Server to optimize performance. Created dashboards with filters, calculations, and level-of-detail (LOD) expressions to deliver actionable insights and what-if analyses. Developed end-to-end ETL solutions using SSIS to integrate data from SQL Server, flat files, Oracle, and Excel into an enterprise data warehouse, supporting both full and incremental loads. Migrated older ETL packages to newer platforms and built an array of SSIS components (Lookup, Derived Column, Conditional Split, Aggregate, Pivot, Slowly Changing Dimension, Merge Join, Union All). Implemented custom logging for ETL processes and established robust data integration workflows. Created drill-down reports and visuals to enable comprehensive, business-ready analytics in a collaborative environment.

Education

Bachelor of Arts at Rabindranath Tagore University
January 11, 2030 - January 8, 2026
Bachelor of Arts at Rabindranath Tagore University
January 11, 2030 - January 8, 2026
Bachelor of Arts at Rabindranath Tagore University
January 11, 2030 - January 8, 2026

Qualifications

Microsoft Power BI Data Analyst
January 11, 2030 - January 8, 2026
Microsoft Power BI Data Analyst
January 11, 2030 - January 8, 2026
Microsoft Power BI Data Analyst
January 11, 2030 - January 8, 2026

Industry Experience

Software & Internet, Financial Services, Professional Services