Available to hire
Hi there! I’m Sindhuja Suresh, a data engineer/analyst with 4 years of experience delivering end-to-end data pipelines and analytics solutions. I enjoy turning complex data into actionable insights using Python, PySpark, SQL, and cloud-based data platforms.
In my roles at Vosyn and TCS, I designed ingestion pipelines, optimized transformations across bronze-silver-gold lakehouse layers, implemented data quality checks, and partnered with stakeholders to enable self-service analytics with Power BI dashboards and reliable data models.
Skills
Language
English
Fluent
Work Experience
Data Engineer at Vosyn Inc
April 1, 2025 - April 1, 2025Designed and maintained data ingestion pipelines from diverse employee data sources from S3 into Azure Data Factory. Developed and optimized PySpark and SQL-based ETL transformations across bronze, silver, and gold layers of the lakehouse. Monitored data transfer performance, implementing partitioning, indexing, and caching to reduce query latency by 30%. Automated full and incremental loads in Azure Data Factory with triggers, scheduling, and failure handling. Created data quality checks and automated tests to validate schema, completeness, and consistency. Partnered with Data team to design schemas, relationships, and mapping logic for business-critical datasets. Standardized performance metrics across product and marketing dashboards, ensuring consistency in data interpretation.
Data Engineer at Tata Consultancy Services
December 31, 2023 - December 31, 2023Client: Nationwide Insurance Company. Designed and implemented XML ingestion pipelines on the Databricks platform to process insurance policy data (≈2000 policies per file), parsing XML structures into relational tables with parent–child relationships using primary/foreign keys. Built a Raw Layer to store ingested data without transformation for traceability. Developed Spark SQL transformations to convert raw XML data into a Harmonized Layer, applying business rules and client-specific requirements. Integrated external API services to enrich harmonized datasets with additional attributes (e.g., premium details), updating records dynamically. Orchestrated workflows using Databricks, moving curated data into Snowflake for downstream analytics and consolidated policy-level reporting using Power BI. Led quarterly analytics sessions with clients to clarify KPI interpretations and enable self-service analytics.
Education
Master of Science in Big Data Analytics at Trent University, Ontario, Canada
January 1, 2024 - April 1, 2025Bachelor of Technology in Chemical Engineering at Anna University, Chennai, India
August 1, 2017 - January 1, 2021Qualifications
Databricks Data Engineer Certification
January 11, 2030 - November 22, 2025Industry Experience
Software & Internet, Professional Services, Media & Entertainment
Skills
Hire a Data Scientist
We have the best data scientist experts on Twine. Hire a data scientist in Mississauga today.