I am an accomplished Data Engineer with 3 years of experience specializing in architecting large-scale data pipelines, real-time streaming infrastructures, and cloud-native solutions utilizing AWS, Spark, and Kubernetes ecosystems. I excel in designing and scaling distributed data solutions, automating real-time processing workflows, and optimizing data pipeline performance and data warehouse architectures to enhance scalability and reliability. I apply critical thinking and problem-solving skills to complex data models with dedication to engineering robust backend platforms that transform data into actionable insights. My work empowers business intelligence, accelerates decision-making, and drives innovation in data analytics.

Rahul Sohandani

I am an accomplished Data Engineer with 3 years of experience specializing in architecting large-scale data pipelines, real-time streaming infrastructures, and cloud-native solutions utilizing AWS, Spark, and Kubernetes ecosystems. I excel in designing and scaling distributed data solutions, automating real-time processing workflows, and optimizing data pipeline performance and data warehouse architectures to enhance scalability and reliability. I apply critical thinking and problem-solving skills to complex data models with dedication to engineering robust backend platforms that transform data into actionable insights. My work empowers business intelligence, accelerates decision-making, and drives innovation in data analytics.

Available to hire

I am an accomplished Data Engineer with 3 years of experience specializing in architecting large-scale data pipelines, real-time streaming infrastructures, and cloud-native solutions utilizing AWS, Spark, and Kubernetes ecosystems. I excel in designing and scaling distributed data solutions, automating real-time processing workflows, and optimizing data pipeline performance and data warehouse architectures to enhance scalability and reliability.

I apply critical thinking and problem-solving skills to complex data models with dedication to engineering robust backend platforms that transform data into actionable insights. My work empowers business intelligence, accelerates decision-making, and drives innovation in data analytics.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
See more

Work Experience

Graduate Research Assistant at Stevens Institute of Technology
July 31, 2024 - August 26, 2025
Built geospatial data processing workflows using GeoPandas and Shapely to optimize 100 GB+ of GBFS, GIS, and Census data facilitating equity-focused mobility policy decisions. Engineered large-scale ETL pipelines using PySpark on GCP DataProc to ingest, process, and analyze over 500 million geospatial data records, accelerating analytics for cloud-driven urban mobility insights. Designed scalable data architecture strategies including data modeling, partitioning, and efficient storage formats within DataProc pipelines, improving processing speed and supporting downstream geospatial analytics.
Senior Software Engineer at LTIMindtree
August 31, 2023 - August 26, 2025
Developed real-time ETL pipelines leveraging Apache Spark Structured Streaming, SQL, and Kafka, reducing event stream processing time by 40%. Implemented IT Service Management (ITSM) ticket and Enterprise Service Management (ESM) alert convergence solutions, improving data processing time by 90%. Led cross-functional effort to migrate Spark and Kafka clusters from Azure HDInsight to on-premise, cutting operational costs by 70% and boosting analytics throughput. Configured Prometheus and Grafana monitoring, enabling real-time issue detection and enhanced system reliability. Architected a predictive analytics project on time-series data achieving 94.3% model accuracy to improve business forecasting. Orchestrated end-to-end Kubernetes deployment of containerized machine learning applications, enhancing system scalability, resilience, and uptime.
Software Trainee at Netwin Infosolutions
July 31, 2020 - August 26, 2025
Built automated data ingestion pipelines to preprocess 10,000+ facial images preparing structured datasets for MobileNet model training on AWS SageMaker. Engineered end-to-end model training and deployment workflows on SageMaker, achieving 91% accuracy in customer interest detection for targeted marketing optimization. Designed and provisioned cloud infrastructure using Terraform, automating infrastructure as code deployment of 10+ data services and reducing manual setup time by hours to support scalable, reproducible ML workflows. Implemented CI/CD pipelines to automate data ingestion and infrastructure provisioning processes, reducing manual intervention and increasing deployment speed, supporting agile data development practices.

Education

MS - Computer Science at Stevens Institute of Technology
August 1, 2023 - May 1, 2025
BEng - Computer Engineering at University of Mumbai
July 1, 2017 - May 1, 2021

Qualifications

AWS Solution Architect Associate Certification
January 11, 2030 - August 26, 2025

Industry Experience

Software & Internet, Government, Education

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
See more