Looks like you have JavaScript disabled. For the full Twine experience, you will need to re-enable it.

Hi there! I'm Sindhuja Suresh, a data engineer/analyst with 4 years of experience delivering end-to-end data pipelines and analytics solutions. I enjoy turning complex data into actionable insights using Python, PySpark, SQL, and cloud-based data platforms. In my roles at Vosyn and TCS, I designed ingestion pipelines, optimized transformations across bronze-silver-gold lakehouse layers, implemented data quality checks, and partnered with stakeholders to enable self-service analytics with Power BI dashboards and reliable data models.…Hi there! I'm Sindhuja Suresh, a data engineer/analyst with 4 years of experience delivering end-to-end data pipelines and analytics solutions. I enjoy turning complex data into actionable insights using Python, PySpark, SQL, and cloud-based data platforms. In my roles at Vosyn and TCS, I designed ingestion pipelines, optimized transformations across bronze-silver-gold lakehouse layers, implemented data quality checks, and partnered with stakeholders to enable self-service analytics with Power BI dashboards and reliable data models.

Sindhuja Suresh

Data Scientist, Data Analyst, Database Developer





Hi there! I'm Sindhuja Suresh, a data engineer/analyst with 4 years of experience delivering end-to-end data pipelines and analytics solutions. I enjoy turning complex data into actionable insights using Python, PySpark, SQL, and cloud-based data platforms. In my roles at Vosyn and TCS, I designed ingestion pipelines, optimized transformations across bronze-silver-gold lakehouse layers, implemented data quality checks, and partnered with stakeholders to enable self-service analytics with Power BI dashboards and reliable data models.…Hi there! I'm Sindhuja Suresh, a data engineer/analyst with 4 years of experience delivering end-to-end data pipelines and analytics solutions. I enjoy turning complex data into actionable insights using Python, PySpark, SQL, and cloud-based data platforms. In my roles at Vosyn and TCS, I designed ingestion pipelines, optimized transformations across bronze-silver-gold lakehouse layers, implemented data quality checks, and partnered with stakeholders to enable self-service analytics with Power BI dashboards and reliable data models.

Available to hire

Hi there! I’m Sindhuja Suresh, a data engineer/analyst with 4 years of experience delivering end-to-end data pipelines and analytics solutions. I enjoy turning complex data into actionable insights using Python, PySpark, SQL, and cloud-based data platforms.

In my roles at Vosyn and TCS, I designed ingestion pipelines, optimized transformations across bronze-silver-gold lakehouse layers, implemented data quality checks, and partnered with stakeholders to enable self-service analytics with Power BI dashboards and reliable data models.

Skills

Experience Level

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Language

English

Fluent

Work Experience

Data Engineer at Vosyn Inc

April 1, 2025 - April 1, 2025

Designed and maintained data ingestion pipelines from diverse employee data sources from S3 into Azure Data Factory. Developed and optimized PySpark and SQL-based ETL transformations across bronze, silver, and gold layers of the lakehouse. Monitored data transfer performance, implementing partitioning, indexing, and caching to reduce query latency by 30%. Automated full and incremental loads in Azure Data Factory with triggers, scheduling, and failure handling. Created data quality checks and automated tests to validate schema, completeness, and consistency. Partnered with Data team to design schemas, relationships, and mapping logic for business-critical datasets. Standardized performance metrics across product and marketing dashboards, ensuring consistency in data interpretation.

Data Engineer at Tata Consultancy Services

December 31, 2023 - December 31, 2023

Client: Nationwide Insurance Company. Designed and implemented XML ingestion pipelines on the Databricks platform to process insurance policy data (≈2000 policies per file), parsing XML structures into relational tables with parent–child relationships using primary/foreign keys. Built a Raw Layer to store ingested data without transformation for traceability. Developed Spark SQL transformations to convert raw XML data into a Harmonized Layer, applying business rules and client-specific requirements. Integrated external API services to enrich harmonized datasets with additional attributes (e.g., premium details), updating records dynamically. Orchestrated workflows using Databricks, moving curated data into Snowflake for downstream analytics and consolidated policy-level reporting using Power BI. Led quarterly analytics sessions with clients to clarify KPI interpretations and enable self-service analytics.