I am a Senior Data Engineer with over 25 years of experience building production data platforms and AI/ML systems at scale. My expertise lies in the modern data stack, streaming architectures, graph databases, and Generative AI implementations. I specialize in cloud architectures across GCP, AWS, and Azure, and full-stack development using Python, Scala, Java, and React. I pride myself on delivering complex data pipelines and AI-driven solutions that align technical execution with business objectives. I enjoy collaborating across teams to create innovative solutions, particularly in cybersecurity and AI compliance, that have real-world impact and long-lasting client partnerships.

SUNIL RANGWANI

I am a Senior Data Engineer with over 25 years of experience building production data platforms and AI/ML systems at scale. My expertise lies in the modern data stack, streaming architectures, graph databases, and Generative AI implementations. I specialize in cloud architectures across GCP, AWS, and Azure, and full-stack development using Python, Scala, Java, and React. I pride myself on delivering complex data pipelines and AI-driven solutions that align technical execution with business objectives. I enjoy collaborating across teams to create innovative solutions, particularly in cybersecurity and AI compliance, that have real-world impact and long-lasting client partnerships.

Available to hire

I am a Senior Data Engineer with over 25 years of experience building production data platforms and AI/ML systems at scale. My expertise lies in the modern data stack, streaming architectures, graph databases, and Generative AI implementations. I specialize in cloud architectures across GCP, AWS, and Azure, and full-stack development using Python, Scala, Java, and React.

I pride myself on delivering complex data pipelines and AI-driven solutions that align technical execution with business objectives. I enjoy collaborating across teams to create innovative solutions, particularly in cybersecurity and AI compliance, that have real-world impact and long-lasting client partnerships.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
See more

Language

English
Fluent
Hindi
Advanced

Work Experience

Senior Architect at Vodafone Group
February 1, 2021 - Present
Led architecture and technical strategy for production systems at Vodafone. Spearheaded Cyber Security Analytics platform on GCP, developed semantic search applications using Vertex AI for security insights, and built Gen AI analytics platforms with natural language query capabilities. Designed onboarding for Cyber Security MDR offerings that streamlined customer acquisition with integrations to SecOps platforms. Architected data ingestion frameworks and optimized Apache Spark pipelines achieving massive cost reductions. Drove Responsible AI Program deliverables for risk management, AI compliance, and supplier chain risk evaluation.
Senior Big Data Engineer at Vodafone Group
November 30, 2020 - July 25, 2025
Built a comprehensive ingestion platform supporting 17PB across a 600-node Hadoop cluster. Delivered cloud migration from on-premises to GCP, including successful PoC and migration framework rollout. Participated in early GCP Data Engineering training by Google. Developed a Data Governance platform using Scala Play with async processing and functional programming deployed on GCP. Established Constellation microservices platform with Istio on GCP Kubernetes engine, led API integration standards for Data Governance.
Data Engineer at Groupon
December 31, 2022 - July 25, 2025
Delivered migration strategy for Supply Density project replacing legacy Teradata ETL with AI/Analytics solutions. Standardized streaming pipelines migrating from older Spark and Kafka versions to EMR Spark 3+ and Kafka 2.6+, achieving monthly cost savings. Implemented backpressure mechanisms to ensure stability during peak loads. Created real-time demo dashboards using Wavefront and Kafka offset manipulation to visualize performance, accelerating stakeholder approvals.
Data Engineer at VodafoneZiggo
May 31, 2022 - July 25, 2025
Led real-time platform strategy built on Kafka with AWS MSK infrastructure using CDK and GitOps deployment models. Pioneered SASL/SCRAM Java security automation in GitOps pipelines, an industry-first. Executed rapid AWS migration of 600+ Spark apps and served multiple roles including platform engineer and solutions architect interacting with various stakeholders.
Lead Big Data Engineer at HSBC Bank
November 30, 2017 - July 25, 2025
Redesigned Spark ETL framework achieving 10x performance improvements reducing processing time significantly. Built a configurable parallelisation framework extending Spark capabilities with throttle controls within 4 weeks of joining. Simplified legacy code for Type 2 SCD processing dramatically using SHA2 fingerprinting and generic algorithms.
Contractor at Comparethemarket.com
July 31, 2017 - July 25, 2025
Redesigned ETL workflows replacing Kafka/Camus/Oozie/PIG stack with Spark/Scala solutions improving SLAs and reliability. Delivered migration solution saving £300K in licensing and data center costs avoiding extensive historic data backfills. Developed autonomous ETL app using distributed Akka actors with positive business feedback on efficiency.
Senior Architect at Vodafone Group
February 1, 2021 - Present
Led the architecture and delivery of production systems, defined technical strategy, and managed stakeholders. Key achievements include spearheading a Cyber Security Analytics platform on GCP, developing semantic search applications on Vertex AI, and creating Gen-AI analytics platforms for natural language BI queries. Designed customer onboarding workflows for Cyber Security MDR and facilitated multi-cloud data ingestion frameworks. Achieved massive cost reductions in Apache Spark processing and ETL pipelines. Contributed to Vodafone's Responsible AI Program with automated compliance and risk management frameworks.
Senior Big Data Engineer at Vodafone Group
November 30, 2020 - July 25, 2025
Built a large-scale ingestion platform supporting a 17PB+ Hadoop cluster featuring thousands of data feeds. Delivered full cloud migration from on-premises Hadoop to GCP. Participated in early GCP Data Engineering training. Developed a Data Governance platform with Scala Play handling extensive REST calls with immutable design and functional programming. Established Constellation microservices platform on GCP GKE with istio service mesh, formed API integration standards, and led a team.
Data Engineer at Groupon
December 31, 2022 - July 25, 2025
Implemented migration strategy replacing legacy ETL with AI-driven analytics for supply density projects. Standardized streaming pipelines by upgrading Spark and Kafka versions achieving cost and stability improvements. Introduced backpressure mechanisms for streaming stability during peak loads and created engaging real-time performance demos to drive stakeholder approvals.
Data Engineer at VodafoneZiggo
May 31, 2022 - July 25, 2025
Led real-time platform strategy on Kafka and AWS MSK with IaC and GitOps deployments. Pioneered SASL/SCRAM security automation with Java in GitOps pipelines delivering industry-first solutions. Executed rapid AWS migration of over 600 Spark applications and acted in diverse roles including platform engineering and solutions architecture managing multiple stakeholders.
Lead Big Data Engineer at HSBC Bank
November 30, 2017 - July 25, 2025
Achieved a 10x performance improvement in ETL processing for risk calculations by redesigning Spark framework. Built configurable Spark parallelization with throttle controls and simplified legacy Type 2 SCD processing using Scala with SHA2 fingerprinting and generic algorithms.
Contractor at Comparethemarket.com
July 31, 2017 - July 25, 2025
Redesigned ETL workflows replacing legacy Kafka/Camus/Oozie/PIG stack with Spark/Scala improving reliability and meeting SLAs. Delivered migration solution saving £300K in licensing and data center costs and created autonomous ETL application using distributed Akka actors receiving positive business feedback.
Senior Architect at Vodafone Group
February 1, 2021 - Present
Hands-on architect delivering production systems while leading technical strategy and managing stakeholders. Spearheaded Cyber Security Analytics platform on GCP for SME customers, developed semantic search and Gen-AI analytics platforms enabling natural language queries and automated visualization generation. Designed customer onboarding for Cyber Security MDR offerings and advocated Agentic AI solutions securing funding. Architected data ingestion frameworks, optimized Apache Spark processing reducing costs drastically, troubleshot expensive ETL pipelines achieving 99.99% cost reduction, and drove critical deliverables for global Responsible AI Program with comprehensive risk management and AI security assessments.
Senior Big Data Engineer at Vodafone Group
November 30, 2020 - July 25, 2025
Built comprehensive ingestion platform across a 17PB+ storage Hadoop cluster with thousands of data feeds. Delivered end-to-end cloud migration from on-premises to GCP, established data governance platform with Scala Play, and built Constellation microservices platform on GCP GKE with Istio service mesh while leading teams and defining API integration standards.
Data Engineer at Groupon
December 31, 2022 - July 25, 2025
Delivered migration strategy replacing Teradata ETL and shell scripts with AI/Analytics approaches. Standardized streaming pipelines and implemented backpressure mechanisms ensuring system stability during peak loads. Created real-time demos showcasing system performance and developed effective load simulation demos accelerating stakeholder approvals.
Data Engineer at VodafoneZiggo
May 1, 2022 - July 25, 2025
Led real-time platform strategy using Kafka and AWS MSK infrastructure with CDK and GitOps deployments. Pioneered SASL/SCRAM security automation creating industry-first solutions and executed rapid AWS migration of 600+ Spark applications while serving multiple roles from engineer to solutions architect.
Lead Big Data Engineer at HSBC Bank
November 30, 2017 - July 25, 2025
Achieved 10x performance improvements in ETL processing for Risk-weighted assets calculations through Spark framework redesign. Built configurable parallelisation frameworks with throttle controls and simplified complex legacy code with efficient Scala implementations.
Contractor at Comparethemarket.com
July 31, 2017 - July 25, 2025
Redesigned ETL workflows replacing Kafka/Camus/Oozie/PIG with Spark/Scala solutions, meeting SLAs and improving reliability. Delivered innovative migration solutions saving considerable licensing and data center costs and created autonomous ETL applications employing distributed Akka actors.
Senior Architect at Vodafone Group
February 1, 2021 - Present
Led hands-on architecture and technical strategy for production systems including Cyber Security Analytics platform on GCP, semantic search applications using Vertex AI, and Gen-AI analytics platforms for natural language queries and automated visualization. Designed customer onboarding for Cyber Security MDR offering and developed a Graph RAG solution for EU AI Act compliance collaborating with researchers. Architected data ingestion frameworks, optimized Apache Spark processing significantly reducing costs, and contributed to Vodafone's Responsible AI Program at group level with risk management and compliance automation.
Senior Big Data Engineer at Vodafone Group
November 30, 2020 - July 25, 2025
Built a large-scale ingestion platform supporting 17PB storage on a 600-node Hadoop cluster and led an enterprise-wide cloud migration to GCP. Developed Data Governance platforms with Scala Play and established microservices platforms on GCP GKE with Istio servicemesh. Received training directly from Google as one of Europe's largest early customers of GCP services.
Data Engineer at Groupon
December 31, 2022 - July 25, 2025
Delivered migration strategy for critical projects replacing legacy ETL systems with AI/Analytics approaches. Standardized streaming pipelines on EMR Spark and Kafka improving stability and reducing costs. Implemented backpressure mechanisms to protect system stability during peak periods and created real-time demos for stakeholder approvals.
Data Engineer at VodafoneZiggo
May 31, 2022 - July 25, 2025
Led real-time data platform strategy using Kafka and AWS MSK, with full infrastructure automation via CDK and GitOps practices. Automated SASL/SCRAM security in Java with innovative GitOps pipelines, completed rapid AWS migration for Spark applications, and played multifunctional roles as Data Engineer, Platform Engineer, and Solutions Architect across various stakeholders.
Lead Big Data Engineer at HSBC Bank
November 30, 2017 - July 25, 2025
Achieved a 10x performance improvement in ETL processing by redesigning Spark frameworks for risk-weighted asset calculations. Developed a configurable parallelisation framework extending Spark capabilities with throttle controls and simplified complex legacy code for SCD processing using SHA2 fingerprinting.
Contractor at Comparethemarket.com
July 31, 2017 - July 25, 2025
Redesigned ETL workflows by replacing legacy Kafka/Camus/Oozie/PIG stacks with Spark/Scala solutions, significantly improving reliability and meeting SLAs. Delivered cost-saving migration solutions and created efficient autonomous ETL applications using distributed Akka actors with great business impact.
Senior Architect at Vodafone Group
February 1, 2021 - Present
Led hands-on architecture and technical strategy development for production systems in cybersecurity analytics on GCP. Delivered semantic search applications, Gen-AI analytics platforms, and end-to-end customer onboarding solutions. Optimized Spark processing reducing costs drastically and contributed to global Responsible AI Program. Collaborated with academic researchers for EU AI Act compliance using graph RAG solutions.
Senior Big Data Engineer at Vodafone Group
November 30, 2020 - July 25, 2025
Built large-scale data ingestion platforms on a 17PB+ Hadoop cluster, delivered cloud migration to GCP, and developed Data Governance platforms using Scala and GKE. Led microservices platform development with Istio servicemesh and defined API integration standards.
Data Engineer at Groupon
December 31, 2022 - July 25, 2025
Delivered migration strategy replacing legacy Teradata ETL with AI/Analytics, standardized streaming pipelines upgrading Spark and Kafka versions resulting in cost savings. Implemented backpressure mechanisms for system stability during peak loads and created real-time demos to showcase performance, accelerating stakeholder approval.
Data Engineer at VodafoneZiggo
May 31, 2022 - July 25, 2025
Led real-time platform design and AWS MSK infrastructure implementation using CDK and GitOps, pioneering SASL/SCRAM security automation uniquely. Executed rapid migration of 600+ Spark applications across various stakeholder groups, fulfilling roles including platform engineering and solutions architecture.
Lead Big Data Engineer at HSBC Bank
November 30, 2017 - July 25, 2025
Achieved 10x performance improvements in ETL processing for Risk-weighted asset calculations via Spark redesign. Developed parallelisation framework with throttle controls and simplified Type 2 SCD processing reducing code complexity significantly.
Contractor at Comparethemarket.com
July 31, 2017 - July 25, 2025
Redesigned ETL workflows replacing legacy pipeline with Spark/Scala, meeting SLAs and enhancing reliability. Delivered migration solutions saving £300K in costs and built autonomous ETL applications with Akka actors praised for efficiency.
Senior Architect at Vodafone Group
February 1, 2021 - Present
Led architecture and technical strategy for production systems on GCP including Cyber Security Analytics platform, Gen-AI analytics platform, and semantic search applications. Spearheaded customer onboarding designs integrating with Google SecOps platforms. Developed Graph RAG solutions for EU AI Act compliance leveraging AIRO Ontology and Neo4j. Optimized Apache Spark processing reducing costs significantly and drove global Responsible AI program with legal and privacy teams. Managed diverse ingestion framework and achieved multi-cloud enterprise integration.
Senior Big Data Engineer at Vodafone Group
November 30, 2020 - July 25, 2025
Built ingestion platform for 17PB Hadoop cluster with thousands of feeds. Delivered cloud migration from on-premises to GCP with PoC. Created Data Governance platform with Scala Play and async processing deployed on GCP GKE. Established Constellation microservices platform with Istio service mesh, led team building, and API integration standards design for Data Governance products.
Data Engineer at Groupon
December 31, 2022 - July 25, 2025
Delivered migration strategy replacing legacy Teradata ETL with Analytics approach. Standardized streaming pipelines upgraded from Spark 1.3/Kafka 0.8 to Spark 3+/Kafka 2.6+, achieving cost savings. Implemented backpressure mechanisms to stabilize pipelines during peak loads. Created real-time system performance demos accelerating stakeholder approvals.
Data Engineer at VodafoneZiggo
May 31, 2022 - July 25, 2025
Led real-time platform strategy based on Kafka and AWS MSK, designing infrastructure with CDK using GitOps deployments. Pioneered SASL/SCRAM security automation in Java with GitOps pipelines. Executed rapid AWS migration of 600+ Spark applications serving multiple stakeholder groups. Worked as Data Engineer, Platform Engineer, and Solutions Architect.
Lead Big Data Engineer at HSBC Bank
November 30, 2017 - July 25, 2025
Achieved 10x performance improvement for Risk-weighted assets ETL processing through Spark redesign. Built configurable parallelisation framework extending Spark's capabilities with throttle controls. Simplified Type 2 SCD processing from legacy code into concise Scala implementations.
Contractor at Comparethemarket.com
July 31, 2017 - July 25, 2025
Redesigned ETL workflows replacing legacy Kafka/Camus/Oozie/PIG with Spark and Scala, improving reliability and meeting SLAs. Delivered migration solution saving £300K in licensing and data center costs. Created autonomous ETL application using Akka actors with excellent business feedback.

Education

BSc at University of Mumbai
January 1, 1990 - December 31, 1994
Advanced Diploma at Software Engineering
January 11, 2030 - July 25, 2025
Diploma at Narsee Monjee Institute of Management Studies
January 11, 2030 - July 25, 2025
BSc at University of Mumbai
January 1, 1990 - December 31, 1993
Advanced Diploma at Software Engineering
January 1, 1994 - December 31, 1995
Diploma at Narsee Monjee Institute of Management Studies
January 1, 1996 - December 31, 1997
BSc at University of Mumbai
January 1, 1995 - December 31, 1998
BSc at University of Mumbai
January 1, 1995 - December 31, 1998
BSc at University of Mumbai
January 1, 1995 - December 31, 1998
BSc Physics with Computer Programming and System Analysis at University of Mumbai
January 1, 1996 - December 31, 1999
Diploma in Management Studies at Narsee Monjee Institute of Management Studies
January 1, 2000 - December 31, 2001

Qualifications

GCP Generative AI Leader certification
January 1, 2025 - December 31, 2025
Google Cloud Skills Boost
January 11, 2030 - July 25, 2025
Generative AI for Business Leaders, LinkedIn
January 1, 2023 - December 31, 2023
AI-First Product Leader, LinkedIn
January 1, 2023 - December 31, 2023
Google Cloud Platform Fundamentals: Core Infrastructure
January 1, 2019 - December 31, 2019
Google Cloud Platform Big Data and Machine Learning Fundamentals
January 1, 2019 - December 31, 2019
Data Engineering on Google Cloud Platform - 5-day training at Google
January 1, 2019 - December 31, 2019
Functional Program Design in Scala by École Polytechnique Fédérale de Lausanne
January 1, 2017 - December 31, 2017
Functional Programming Principles in Scala by École Polytechnique Fédérale de Lausanne
January 1, 2016 - December 31, 2016
Spark development bootcamp by Databricks
January 1, 2015 - December 31, 2015
Splunk training
January 1, 2014 - December 31, 2014
Risk Management: Principles And Applications by SOAS, University of London
January 1, 2012 - December 31, 2012
Coding the Architecture certification from Skills Matter
January 1, 2008 - December 31, 2008
Agile certification from Valtech
January 1, 2008 - December 31, 2008
Sun Certified Programmer for the Java 2 Platform
January 1, 2002 - December 31, 2002
GCP Generative AI Leader certification
January 1, 2025 - December 31, 2025
Google Cloud Skills Boost
January 1, 2020 - December 31, 2020
Generative AI for Business Leaders, LinkedIn
January 1, 2023 - December 31, 2023
AI-First Product Leader, LinkedIn
January 1, 2023 - December 31, 2023
Google Cloud Platform Fundamentals: Core Infrastructure
January 1, 2019 - December 31, 2019
Google Cloud Platform Big Data and Machine Learning Fundamentals
January 1, 2019 - December 31, 2019
Data Engineering on Google Cloud Platform - 5-day training at Google
January 1, 2019 - December 31, 2019
Functional Program Design in Scala by École Polytechnique Fédérale de Lausanne
January 1, 2017 - December 31, 2017
Functional Programming Principles in Scala by École Polytechnique Fédérale de Lausanne
January 1, 2016 - December 31, 2016
Spark development bootcamp by Databricks
January 1, 2015 - December 31, 2015
Splunk training
January 1, 2014 - December 31, 2014
Risk Management: Principles And Applications by SOAS, University of London
January 1, 2012 - December 31, 2012
Coding the Architecture certification from Skills Matter
January 1, 2008 - December 31, 2008
Agile certification from Valtech
January 1, 2008 - December 31, 2008
Sun Certified Programmer for the Java 2 Platform
January 1, 2002 - December 31, 2002
GCP Generative AI Leader certification
January 1, 2025 - December 31, 2025
Generative AI for Business Leaders - LinkedIn
January 1, 2023 - December 31, 2023
AI-First Product Leader - LinkedIn
January 1, 2023 - December 31, 2023
Google Cloud Platform Fundamentals: Core Infrastructure
January 1, 2019 - December 31, 2019
Google Cloud Platform Big Data and Machine Learning Fundamentals
January 1, 2019 - December 31, 2019
Data Engineering on Google Cloud Platform - 5-day training at Google
January 1, 2019 - December 31, 2019
Functional Program Design in Scala - École Polytechnique Fédérale de Lausanne
January 1, 2017 - December 31, 2017
Functional Programming Principles in Scala - École Polytechnique Fédérale de Lausanne
January 1, 2016 - December 31, 2016
Spark development bootcamp - Databricks
January 1, 2015 - December 31, 2015
Splunk training
January 1, 2014 - December 31, 2014
Risk Management: Principles And Applications - SOAS, University of London
January 1, 2012 - December 31, 2012
Coding the Architecture certification - Skills Matter
January 1, 2008 - December 31, 2008
Agile certification - Valtech
January 1, 2008 - December 31, 2008
Sun Certified Programmer for the Java 2 Platform
January 1, 2002 - December 31, 2002
GCP Generative AI Leader certification
January 1, 2025 - December 31, 2025
Generative AI for Business Leaders
January 1, 2023 - December 31, 2023
AI-First Product Leader
January 1, 2023 - December 31, 2023
Google Cloud Platform Fundamentals: Core Infrastructure
January 1, 2019 - December 31, 2019
Google Cloud Platform Big Data and Machine Learning Fundamentals
January 1, 2019 - December 31, 2019
Data Engineering on Google Cloud Platform - 5-day training at Google
January 1, 2019 - December 31, 2019
Functional Program Design in Scala by École Polytechnique Fédérale de Lausanne
January 1, 2017 - December 31, 2017
Functional Programming Principles in Scala by École Polytechnique Fédérale de Lausanne
January 1, 2016 - December 31, 2016
Spark development bootcamp by Databricks
January 1, 2015 - December 31, 2015
Splunk training
January 1, 2014 - December 31, 2014
Risk Management: Principles And Applications by SOAS, University of London
January 1, 2012 - December 31, 2012
Coding the Architecture certification from Skills Matter
January 1, 2008 - December 31, 2008
Agile certification from Valtech
January 1, 2008 - December 31, 2008
Sun Certified Programmer for the Java 2 Platform
January 1, 2002 - December 31, 2002
GCP Generative AI Leader certification
January 1, 2025 - December 31, 2025
Google Cloud Platform Fundamentals: Core Infrastructure
January 1, 2019 - December 31, 2019
Google Cloud Platform Big Data and Machine Learning Fundamentals
January 1, 2019 - December 31, 2019
Data Engineering on Google Cloud Platform - 5-day training at Google
January 1, 2019 - December 31, 2019
Functional Program Design in Scala by École Polytechnique Fédérale de Lausanne
January 1, 2017 - December 31, 2017
Functional Programming Principles in Scala by École Polytechnique Fédérale de Lausanne
January 1, 2016 - December 31, 2016
Spark development bootcamp by Databricks
January 1, 2015 - December 31, 2015
Splunk training
January 1, 2014 - December 31, 2014
Risk Management: Principles And Applications by SOAS, University of London
January 1, 2012 - December 31, 2012
Coding the Architecture certification from Skills Matter
January 1, 2008 - December 31, 2008
Agile certification from Valtech
January 1, 2008 - December 31, 2008
Sun Certified Programmer for the Java 2 Platform
January 1, 2002 - December 31, 2002
GCP Generative AI Leader certification
January 1, 2025 - December 31, 2025
Generative AI for Business Leaders, LinkedIn
January 1, 2023 - December 31, 2023
AI-First Product Leader, LinkedIn
January 1, 2023 - December 31, 2023
Google Cloud Platform Fundamentals: Core Infrastructure
January 1, 2019 - December 31, 2019
Google Cloud Platform Big Data and Machine Learning Fundamentals
January 1, 2019 - December 31, 2019
Data Engineering on Google Cloud Platform - 5-day training at Google
January 1, 2019 - December 31, 2019
Functional Program Design in Scala by École Polytechnique Fédérale de Lausanne
January 1, 2017 - December 31, 2017
Functional Programming Principles in Scala by École Polytechnique Fédérale de Lausanne
January 1, 2016 - December 31, 2016
Spark development bootcamp by Databricks
January 1, 2015 - December 31, 2015
Splunk training
January 1, 2014 - December 31, 2014
Risk Management: Principles And Applications by SOAS, University of London
January 1, 2012 - December 31, 2012
Coding the Architecture certification from Skills Matter
January 1, 2008 - December 31, 2008
Agile certification from Valtech
January 1, 2008 - December 31, 2008
Sun Certified Programmer for the Java 2 Platform
January 1, 2002 - December 31, 2002

Industry Experience

Telecommunications, Financial Services, Software & Internet, Government, Professional Services