I am an experienced data engineer with 19+ years of building enterprise data platforms, cloud migrations, and ETL/ELT pipelines for Fortune 500 companies and government agencies. I’m Databricks certified and deeply skilled in Spark, Python, and cloud-native data ecosystems, as well as traditional platforms like Informatica, SQL, and Oracle. I’ve delivered large-scale data solutions processing billions of records with cost savings surpassing $2M and performance improvements up to 10x. In recent roles, I’ve built AI-powered data platforms and real-time data pipelines, optimized SQL Server procedures, migrated Oracle workloads to cloud data warehouses, and led data governance and quality initiatives. I thrive in complex, regulated environments and enjoy translating business requirements into scalable data architectures that empower data-driven decision making.

Praveen Radhakrishnan

I am an experienced data engineer with 19+ years of building enterprise data platforms, cloud migrations, and ETL/ELT pipelines for Fortune 500 companies and government agencies. I’m Databricks certified and deeply skilled in Spark, Python, and cloud-native data ecosystems, as well as traditional platforms like Informatica, SQL, and Oracle. I’ve delivered large-scale data solutions processing billions of records with cost savings surpassing $2M and performance improvements up to 10x. In recent roles, I’ve built AI-powered data platforms and real-time data pipelines, optimized SQL Server procedures, migrated Oracle workloads to cloud data warehouses, and led data governance and quality initiatives. I thrive in complex, regulated environments and enjoy translating business requirements into scalable data architectures that empower data-driven decision making.

Available to hire

I am an experienced data engineer with 19+ years of building enterprise data platforms, cloud migrations, and ETL/ELT pipelines for Fortune 500 companies and government agencies. I’m Databricks certified and deeply skilled in Spark, Python, and cloud-native data ecosystems, as well as traditional platforms like Informatica, SQL, and Oracle. I’ve delivered large-scale data solutions processing billions of records with cost savings surpassing $2M and performance improvements up to 10x.

In recent roles, I’ve built AI-powered data platforms and real-time data pipelines, optimized SQL Server procedures, migrated Oracle workloads to cloud data warehouses, and led data governance and quality initiatives. I thrive in complex, regulated environments and enjoy translating business requirements into scalable data architectures that empower data-driven decision making.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
See more

Language

English
Fluent

Work Experience

Senior Consultant - Data Engineer / AI Solutions at SISU Solutions
December 1, 2024 - November 6, 2025
Built AI-powered data platforms for government child welfare agencies, including an Azure OpenAI Case Note Summarization System with multi-modal inputs, reducing documentation time by 60%. Implemented real-time call center intelligence with streaming transcription sub-500ms latency, designed data architectures for multimodal inputs, and developed a graph database for relationship mapping and network analysis.
Senior Consultant - Data Engineer at MUFG Pension & Market Services
June 1, 2024 - June 1, 2024
Optimized critical SQL Server procedures reducing batch runtime from 4 hours to 45 minutes. Built a data quality framework, Power BI pipelines for regulatory reporting, implemented incremental loading to reduce data transfer by 90%, created automated reconciliation for 100M+ monthly transactions, and led security remediation for database compliance.
Senior Consultant - Data Engineer at Energy Australia
October 1, 2023 - October 1, 2023
Processed 10TB+ daily for 1.7M customers; led AWS Redshift migration from on-premises Oracle; architected Databricks Delta Lake with Bronze/Silver/Gold medallion architecture; developed a Python ETL framework; implemented Kafka streaming; optimized Spark jobs with adaptive execution reducing costs; established S3 data lifecycle policies saving ~$100K annually; implemented comprehensive data quality monitoring.
Senior Consultant - Data Designer/Engineer at National Australia Bank (NAB)
July 1, 2022 - July 1, 2022
Built real-time data ingestion processing 1M+ events/second; developed AWS data lake with S3, Glue, and Athena; created complex JSON flattening for hierarchical banking data; designed dimensional models supporting 500+ KPIs; implemented CDC for near real-time data warehouse updates; optimized SQL queries for faster reports; established data governance for Open Banking compliance.
Technical Manager - Data Engineering at Republic Services
May 1, 2019 - May 1, 2019
Led enterprise ODS for 35,000+ employees across 340 facilities; implemented Denodo virtualization to eliminate data redundancy; developed 500+ Informatica mappings; led Master Data Management for customers and assets; created real-time CDC pipelines with PowerExchange; migrated legacy AS/400 data to modern SQL Server; managed a 25+ person data engineering team; achieved 99.9% uptime for critical pipelines.
Project Manager - ETL Development at Discount Tire Company
January 1, 2016 - January 1, 2016
Led SAP data migration affecting 900+ retail locations; developed real-time ETL processes for POS integration; built Informatica-based data pipelines for financial reconciliation; created data marts for sales analytics and inventory management; automated 200+ manual processes saving 1,000+ hours monthly; managed offshore ETL team of 30+ engineers.
ETL Developer - Senior Associate at Various Fortune 500 Companies
January 1, 2011 - January 1, 2011
Developed ETL solutions using Informatica PowerCenter and SQL; built data warehouses following Kimball methodology; created PL/SQL procedures for data transformations; implemented data quality checks and error handling frameworks; specialized in Oracle and SQL Server database development.
ETL Developer - Senior Associate at Various Fortune 500 Companies
December 31, 2011 - December 31, 2011
Developed ETL solutions using Informatica PowerCenter and SQL; built data warehouses following Kimball methodology; created PL/SQL procedures for data transformations; implemented data quality checks and error handling frameworks; specialized in Oracle and SQL Server database development.
Senior Consultant - Data Engineer at National Australia Bank (NAB)
July 1, 2022 - July 1, 2022
Built an AWS data lake (S3, Glue, Athena) for unstructured data analytics, developed complex JSON flattening logic for hierarchical banking data, designed dimensional models supporting 500+ KPIs for executive dashboards, implemented CDC for near real-time data warehouse updates, optimized SQL queries for faster reporting, and established Open Banking governance practices.
Senior Consultant - Data Engineer at MUFG Pension & Market Services
June 30, 2024 - June 30, 2024
Australia's largest superannuation administration platform. Optimized critical SQL Server procedures reducing batch runtime from 4 hours to 45 minutes. Developed comprehensive data quality framework for proactive anomaly detection in production. Built Power BI data pipelines for regulatory reporting (APRA), implemented incremental loading patterns reducing data transfer volumes by 90%, and created automated reconciliation processes for 100M+ monthly transactions. Designed and implemented security remediation for database compliance.
Senior Consultant - Data Engineer at Energy Australia
October 31, 2023 - October 31, 2023
Processing 10TB+ daily for 1.7M customers. Led AWS Redshift migration from on-premises Oracle (5TB data warehouse). Architected Databricks Delta Lake with Bronze/Silver/Gold medallion architecture. Developed Python ETL framework replacing legacy Oracle Data Integrator. Implemented Kafka streaming pipelines for real-time energy trading data. Optimized Spark jobs with adaptive execution, reducing costs by 40%. Created S3 data lifecycle policies saving $100K annually in storage costs. Built comprehensive data quality monitoring using Great Expectations.
Senior Consultant - Data Designer/Engineer at National Australia Bank (NAB)
July 31, 2022 - July 31, 2022
Big Four bank processing billions in daily transactions. Designed real-time data ingestion processing 1M+ events/second from call centers. Built AWS data lake using S3, Glue, and Athena for unstructured data analysis. Developed complex JSON flattening logic for hierarchical banking data. Created dimensional data models supporting 500+ KPIs for executive dashboards. Implemented CDC processes for near real-time data warehouse updates. Optimized SQL queries improving report generation from hours to minutes. Established data governance practices for Open Banking compliance.
Technical Manager - Data Engineering at Republic Services
May 31, 2019 - May 31, 2019
Enterprise data platform serving 35,000+ employees across 340 facilities. Led Denodo virtualization platform implementation eliminating data redundancy. Developed 500+ Informatica mappings for complex business transformations. Implemented Master Data Management for customer and asset data. Created real-time CDC pipelines using Informatica PowerExchange. Migrated legacy AS/400 data to modern SQL Server architecture. Managed data engineering team of 25+ developers. Achieved 99.9% uptime for critical data pipelines.
Project Manager - ETL Development at Discount Tire Company
January 31, 2016 - January 31, 2016
900+ retail locations with real-time inventory requirements. Led SAP data migration affecting all retail locations. Developed real-time ETL processes for POS integration. Built Informatica-based data pipelines for financial reconciliation. Created data marts for sales analytics and inventory management. Automated 200+ manual processes saving 1,000+ hours monthly. Managed offshore ETL development team of 30+ engineers.

Education

Master of Science in Computer Science at Georgia Institute of Technology, USA
January 11, 2030 - January 1, 2018
Bachelor of Technology in Information Technology at Mahatma Gandhi University, India
January 11, 2030 - January 1, 2005
Master of Science - Computer Science at Georgia Institute of Technology, USA
January 11, 2030 - January 1, 2018
Bachelor of Technology - Information Technology at Mahatma Gandhi University, India
January 11, 2030 - January 1, 2005
Master of Science - Computer Science at Georgia Institute of Technology, USA
January 11, 2030 - January 1, 2018
Bachelor of Technology - Information Technology at Mahatma Gandhi University, India
January 11, 2030 - January 1, 2005
Master of Science - Computer Science at Georgia Institute of Technology
January 1, 2016 - January 1, 2018
Bachelor of Technology - Information Technology at Mahatma Gandhi University
January 1, 2001 - January 1, 2005
Master of Science - Computer Science at Georgia Institute of Technology
January 11, 2030 - January 1, 2018
Bachelor of Technology - Information Technology at Mahatma Gandhi University
January 11, 2030 - January 1, 2005
Master of Science - Computer Science at Georgia Institute of Technology, USA
January 11, 2030 - January 1, 2018
Bachelor of Technology - Information Technology at Mahatma Gandhi University, India
January 11, 2030 - January 1, 2005

Qualifications

Databricks Certified Data Engineer Associate
January 1, 2024 - November 6, 2025
Microsoft Azure Data Engineer Associate
January 1, 2023 - November 6, 2025
ETL Offload in Hadoop for Data Warehouse Optimization (Informatica)
January 1, 2016 - November 6, 2025
Machine Learning (Stanford University/Coursera)
January 1, 2018 - November 6, 2025
Databricks Certified Data Engineer Associate
January 1, 2024 - November 6, 2025
Microsoft Azure Data Engineer Associate
January 1, 2023 - November 6, 2025
ETL Offload in Hadoop for Data Warehouse Optimization (Informatica)
January 1, 2016 - November 6, 2025
Machine Learning (Stanford University/Coursera)
January 1, 2018 - November 6, 2025
Databricks Certified Data Engineer Associate
January 1, 2024 - November 6, 2025
Microsoft Azure Data Engineer Associate
January 1, 2023 - November 6, 2025
ETL Offload in Hadoop for Data Warehouse Optimization (Informatica)
January 1, 2016 - November 6, 2025
Machine Learning (Stanford University/Coursera)
January 1, 2018 - November 6, 2025
Databricks Certified Data Engineer Associate
January 1, 2024 - November 6, 2025
Microsoft Azure Data Engineer Associate
January 1, 2023 - November 6, 2025
ETL Offload in Hadoop for Data Warehouse Optimization (Informatica)
January 1, 2016 - November 6, 2025
Machine Learning (Stanford University/Coursera)
January 1, 2018 - November 6, 2025
Databricks Certified Data Engineer Associate
January 1, 2024 - November 6, 2025
Microsoft Azure Data Engineer Associate
January 1, 2023 - November 6, 2025
ETL Offload in Hadoop for Data Warehouse Optimization (Informatica)
January 1, 2016 - November 6, 2025
Machine Learning (Stanford University/Coursera)
January 1, 2018 - November 6, 2025
Databricks Certified Data Engineer Associate
January 1, 2024 - November 6, 2025
Microsoft Azure Data Engineer Associate
January 1, 2023 - November 6, 2025
ETL Offload in Hadoop for Data Warehouse Optimization (Informatica)
January 1, 2016 - November 6, 2025
Machine Learning (Stanford University/Coursera)
January 1, 2018 - November 6, 2025

Industry Experience

Government, Financial Services, Energy & Utilities, Professional Services, Software & Internet, Manufacturing, Retail, Education, Healthcare, Transportation & Logistics, Media & Entertainment, Other