Looks like you have JavaScript disabled. For the full Twine experience, you will need to re-enable it.

I am an accomplished Data Engineer with 3 years of experience specializing in architecting large-scale data pipelines, real-time streaming infrastructures, and cloud-native solutions utilizing AWS, Spark, and Kubernetes ecosystems. I excel in designing and scaling distributed data solutions, automating real-time processing workflows, and optimizing data pipeline performance and data warehouse architectures to enhance scalability and reliability. I apply critical thinking and problem-solving skills to complex data models with dedication to engineering robust backend platforms that transform data into actionable insights. My work empowers business intelligence, accelerates decision-making, and drives innovation in data analytics.…I am an accomplished Data Engineer with 3 years of experience specializing in architecting large-scale data pipelines, real-time streaming infrastructures, and cloud-native solutions utilizing AWS, Spark, and Kubernetes ecosystems. I excel in designing and scaling distributed data solutions, automating real-time processing workflows, and optimizing data pipeline performance and data warehouse architectures to enhance scalability and reliability. I apply critical thinking and problem-solving skills to complex data models with dedication to engineering robust backend platforms that transform data into actionable insights. My work empowers business intelligence, accelerates decision-making, and drives innovation in data analytics.

Rahul Sohandani

Data Scientist, Developer, AI Engineer, +1





I am an accomplished Data Engineer with 3 years of experience specializing in architecting large-scale data pipelines, real-time streaming infrastructures, and cloud-native solutions utilizing AWS, Spark, and Kubernetes ecosystems. I excel in designing and scaling distributed data solutions, automating real-time processing workflows, and optimizing data pipeline performance and data warehouse architectures to enhance scalability and reliability. I apply critical thinking and problem-solving skills to complex data models with dedication to engineering robust backend platforms that transform data into actionable insights. My work empowers business intelligence, accelerates decision-making, and drives innovation in data analytics.…I am an accomplished Data Engineer with 3 years of experience specializing in architecting large-scale data pipelines, real-time streaming infrastructures, and cloud-native solutions utilizing AWS, Spark, and Kubernetes ecosystems. I excel in designing and scaling distributed data solutions, automating real-time processing workflows, and optimizing data pipeline performance and data warehouse architectures to enhance scalability and reliability. I apply critical thinking and problem-solving skills to complex data models with dedication to engineering robust backend platforms that transform data into actionable insights. My work empowers business intelligence, accelerates decision-making, and drives innovation in data analytics.

Available to hire

I am an accomplished Data Engineer with 3 years of experience specializing in architecting large-scale data pipelines, real-time streaming infrastructures, and cloud-native solutions utilizing AWS, Spark, and Kubernetes ecosystems. I excel in designing and scaling distributed data solutions, automating real-time processing workflows, and optimizing data pipeline performance and data warehouse architectures to enhance scalability and reliability.

I apply critical thinking and problem-solving skills to complex data models with dedication to engineering robust backend platforms that transform data into actionable insights. My work empowers business intelligence, accelerates decision-making, and drives innovation in data analytics.

Skills

Experience Level

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Intermediate

Intermediate

Intermediate

Intermediate

Intermediate

Intermediate

Intermediate

Intermediate

Intermediate

Intermediate

Intermediate

Intermediate

Work Experience

Graduate Research Assistant at Stevens Institute of Technology

July 31, 2024 - August 26, 2025

Built geospatial data processing workflows using GeoPandas and Shapely to optimize 100 GB+ of GBFS, GIS, and Census data facilitating equity-focused mobility policy decisions. Engineered large-scale ETL pipelines using PySpark on GCP DataProc to ingest, process, and analyze over 500 million geospatial data records, accelerating analytics for cloud-driven urban mobility insights. Designed scalable data architecture strategies including data modeling, partitioning, and efficient storage formats within DataProc pipelines, improving processing speed and supporting downstream geospatial analytics.

Senior Software Engineer at LTIMindtree

August 31, 2023 - August 26, 2025

Developed real-time ETL pipelines leveraging Apache Spark Structured Streaming, SQL, and Kafka, reducing event stream processing time by 40%. Implemented IT Service Management (ITSM) ticket and Enterprise Service Management (ESM) alert convergence solutions, improving data processing time by 90%. Led cross-functional effort to migrate Spark and Kafka clusters from Azure HDInsight to on-premise, cutting operational costs by 70% and boosting analytics throughput. Configured Prometheus and Grafana monitoring, enabling real-time issue detection and enhanced system reliability. Architected a predictive analytics project on time-series data achieving 94.3% model accuracy to improve business forecasting. Orchestrated end-to-end Kubernetes deployment of containerized machine learning applications, enhancing system scalability, resilience, and uptime.

Software Trainee at Netwin Infosolutions

July 31, 2020 - August 26, 2025

Built automated data ingestion pipelines to preprocess 10,000+ facial images preparing structured datasets for MobileNet model training on AWS SageMaker. Engineered end-to-end model training and deployment workflows on SageMaker, achieving 91% accuracy in customer interest detection for targeted marketing optimization. Designed and provisioned cloud infrastructure using Terraform, automating infrastructure as code deployment of 10+ data services and reducing manual setup time by hours to support scalable, reproducible ML workflows. Implemented CI/CD pipelines to automate data ingestion and infrastructure provisioning processes, reducing manual intervention and increasing deployment speed, supporting agile data development practices.