I’m Himanshu Maheshwari, a data engineering professional with 12+ years of experience delivering large-scale ETL pipelines, high-volume data ingestion frameworks, and regulatory-grade data workflows across Banking, AML, Insurance, and Healthcare. I specialize in building and optimizing DataStage-style pipelines and migrating legacy systems to cloud architectures to enable near real-time analytics. I’m passionate about automating data quality, improving SLA reliability, and delivering auditable datasets for ABAC, FINTRAC, and AML programs. I enjoy collaborating with business stakeholders, aligning with SME needs, and mentoring teams to ensure production-readiness and compliance across enterprise data landscapes.

Himanshu Maheshwari

I’m Himanshu Maheshwari, a data engineering professional with 12+ years of experience delivering large-scale ETL pipelines, high-volume data ingestion frameworks, and regulatory-grade data workflows across Banking, AML, Insurance, and Healthcare. I specialize in building and optimizing DataStage-style pipelines and migrating legacy systems to cloud architectures to enable near real-time analytics. I’m passionate about automating data quality, improving SLA reliability, and delivering auditable datasets for ABAC, FINTRAC, and AML programs. I enjoy collaborating with business stakeholders, aligning with SME needs, and mentoring teams to ensure production-readiness and compliance across enterprise data landscapes.

Available to hire

I’m Himanshu Maheshwari, a data engineering professional with 12+ years of experience delivering large-scale ETL pipelines, high-volume data ingestion frameworks, and regulatory-grade data workflows across Banking, AML, Insurance, and Healthcare. I specialize in building and optimizing DataStage-style pipelines and migrating legacy systems to cloud architectures to enable near real-time analytics.

I’m passionate about automating data quality, improving SLA reliability, and delivering auditable datasets for ABAC, FINTRAC, and AML programs. I enjoy collaborating with business stakeholders, aligning with SME needs, and mentoring teams to ensure production-readiness and compliance across enterprise data landscapes.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
See more

Language

English
Fluent

Work Experience

GCP Data Engineer (BigQuery/DBT/Workbench) at TATA Consultancy Services
September 1, 2024 - August 1, 2025
Developed ELT pipelines on GCP BigQuery integrating AML AI for high-risk transaction detection with explainable outputs. Designed AI-supported ELT pipelines processing 50M+ transaction records for AML surveillance. Leveraged Tangerine customer & transaction data from BigQuery as source and performed ELT transformations using Jupyter Notebook/Workstation, reloading transformed data back into BigQuery. Automated ingestion, model training, and risk scoring workflows, improving detection accuracy by 20% and reducing manual intervention by 20+ hours/month. Collaborated with business stakeholders for validation, fine-tuning, and deployment of AML AI use cases. Reduced BigQuery job costs by 25% through query tuning, partition pruning, and storage optimization.
Data Engineer (Azure Databricks/GCP) at TATA Consultancy Services
March 1, 2023 - August 1, 2025
Built and orchestrated 25+ DLT pipelines processing ~30M records/day with near real-time availability. Implemented CDC + SCD-2 across 30+ subject areas improving regulatory accuracy by 30%. Optimized Delta Lake using Z-ordering, clustering, and OPTIMIZE achieving 35% faster query performance. Automated ingestion using Auto Loader enabling schema evolution + fault-tolerance for streaming workloads. Key Achievements: Reduced overall batch latency by 40% and enabled actuarial real-time analytics. Modernized legacy ETL to cloud DLT framework, decreasing operational issues by 50%.
ETL Developer (DataStage + Hadoop) at TATA Consultancy Services
April 1, 2022 - September 1, 2024
Engineered DataStage pipelines consuming 100M+ transactions/month from Coupa, PeopleSoft & internal banking systems. Automated DQ checks (completeness, timeliness, reconciliation) reducing manual QA by ~50%. Led Data Quality initiatives within the Enterprise Data Management Office: defined DQ metrics, implemented automated checks, and established lineage tracing for CDEs. Tuned DataStage jobs (partitioning, buffering, lookup optimization) reducing runtimes by 30–50%. Designed lineage, runbooks, and recovery workflows increasing pipeline reliability to 99.5%+ SLA. Implemented data integration and transformation logic to populate business semantic layers (customers, accounts, lookups), enabling standardized reporting across enterprise systems and supporting ABAC monitoring use cases.
ETL Developer & Production Support (DataStage/Teradata) at NTT Data Services
January 1, 2019 - April 1, 2022
Developed DataStage jobs for claims/healthcare data processing 20M+ records per weekly cycle. Enhanced ETL performance by 40% via workload restructuring and parallelization. Created automated reporting framework reducing manual cycle reporting effort by 70%. Supported 300+ ETL jobs ensuring 24/7 availability and 99% SLA. Implemented SQL automation for Cycle Status Interim Status reporting and contributed to failure analyses and migration of fixes.
ETL Developer & Support (DataStage/Teradata) at NTT Data Services
August 1, 2016 - January 1, 2019
Managed 200+ DataStage jobs across daily/weekly/monthly cycles ensuring high availability. Performance tuning via Magic Wand/Viewpoint reducing system bottlenecks by 25%. Coordinated deployments, change requests, and incident resolutions. Conducted knowledge transfer and maintained JIRA for failure resolution tracking.
ETL Developer & Support at Dell International Services
April 1, 2013 - August 1, 2016
Developed DataStage jobs for healthcare data integration with zero deployment downtime. Daily monitoring + RCA improving system stability by 20%. Monitored and controlled performance with Magic Wand/Viewpoint, ensuring optimum utilization and timely root-cause analysis.
GCP Data Engineer at Tata Consultancy Services
September 1, 2024 - August 31, 2025
Developed ELT pipelines on GCP BigQuery using DBT for AML use cases at Scotiabank, processing 50M+ transactions. Automated ingestion, model training, and risk-scoring workflows; improved AML detection accuracy by 20% and reduced manual intervention by 20+ hours/month. Optimized queries with partitioning and storage improvements, achieving 25% cost reduction. Delivered AML lineage and reproducibility framework to support audit readiness and improved pipeline SLA compliance by 35%.
GCP Data Engineer (BigQuery / DBT / Workbench) at TATA Consultancy Services
September 1, 2024 - August 1, 2025
Developed ELT pipelines on GCP BigQuery integrating AML AI for high-risk transaction detection with explainable outputs. Designed AI-supported ELT pipelines processing 50M+ transaction records. Used Tangerine customer and transaction data from BigQuery as the source; performed ELT transformations via Jupyter Notebook/Workstation, reloading the transformed data back into BigQuery. Automated ingestion, model training, and risk-scoring workflows improved detection accuracy and reduced manual intervention by 20+ hours per month. Collaborated with business stakeholders for validation, fine-tuning, and deployment of AML AI use cases. Reduced BigQuery job costs by 25% through query tuning, partition pruning, and storage optimization.
Data Engineer (Azure Databricks / GCP) at TATA Consultancy Services
March 1, 2023 - August 1, 2025
Built and orchestrated 25+ DLT pipelines processing ~30M records/day with near real-time availability. Implemented CDC + SCD-2 across 30+ subject areas improving regulatory accuracy by 30%. Optimized Delta Lake using Z-ordering, clustering, and OPTIMIZE achieving 35% faster query performance. Automated ingestion using Auto Loader enabling schema evolution and fault-tolerance for streaming workloads.
GCP Data Engineer (Big Query / DBT / Workbench) at TATA Consultancy Services
September 1, 2024 - August 1, 2025
Developed ELT pipelines on GCP BigQuery integrating AML AI for high-risk transaction detection with explainable outputs. Processed 50M+ transaction records for AML surveillance using BigQuery, Jupyter Notebook/Workstation for transformations, and reloading data back to BigQuery. Automated ingestion, model training, and risk scoring workflows improved detection accuracy by 20% and reduced manual intervention by 20+ hours/month. Collaborated with stakeholders for validation and deployment of AML AI use cases. Reduced BigQuery costs by 25% through query tuning, partition pruning, and storage optimization. Delivered full AML lineage and reproducibility framework for audit readiness and improved pipeline SLA compliance by 35%.

Education

Bachelor of Engineering (Computer Science – Honours) at RGPV University, India
September 1, 2008 - June 29, 2012

Qualifications

Databricks Certified Data Engineer Associate
January 1, 2025 - January 1, 2027
Big Data & Hadoop
January 1, 2017 - December 12, 2025
Python Certification
January 1, 2021 - December 12, 2021
Teradata Architecture Training
January 1, 2019 - December 12, 2025
Databricks Certified Data Engineer Associate
January 1, 2025 - December 31, 2027
Big Data & Hadoop
January 11, 2030 - January 1, 2017
Python Certification
January 1, 2021 - January 1, 2021
Teradata Architecture Training
January 1, 2019 - January 1, 2019
Databricks Certified Data Engineer Associate
January 1, 2025 - January 1, 2027
Big Data & Hadoop
January 1, 2017 - December 12, 2025
Python Certification
January 1, 2021 - December 12, 2025
Teradata Architecture Training
January 1, 2019 - December 12, 2025
Databricks Certified Data Engineer Associate
January 1, 2025 - January 1, 2027
Big Data & Hadoop
November 30, 0002 - January 1, 2017
Python Certification
January 1, 2021 - January 1, 2021
Teradata Architecture Training
January 1, 2019 - January 1, 2019
Databricks Certified Data Engineer Associate
January 1, 2025 - January 1, 2027
Big Data & Hadoop
January 1, 2017 - December 12, 2025
Python Certification
January 1, 2021 - December 12, 2025
Teradata Architecture Training
January 1, 2019 - December 12, 2025

Industry Experience

Financial Services, Healthcare, Professional Services, Other, Software & Internet