Available to hire
I am Pandia Rajakumari, a Senior Data Engineer with over 11 years of IT experience, including 5+ years in modern data engineering. I specialize in building scalable cloud-native data pipelines, ETL/ELT automation, and large-scale data processing.
I have hands-on experience with Python, SQL, PySpark, Snowflake, AWS Glue, Databricks, Iceberg, and Hadoop ecosystems; I have built medallion data architectures and optimized ETL to reduce cloud costs while processing tens of millions of records daily. I work across data governance and analytics, collaborating with cross-functional teams to deliver robust data solutions.
Skills
Language
English
Fluent
Work Experience
Data Analyst at Mindmap Technologies
April 1, 2025 - April 1, 2025Maintained Recon Art (Reconciliation) application to process outstanding records; analyzed input data from RDBMS and flat files, loaded into the Recon Art tool, and analyzed final outputs. Developed stored procedures/functions in SQL Server, Oracle PL/SQL, and Snowflake to purge old data. Created and tested Unix shell scripts and scheduled the Recon Art jobs using Control-M. Worked with Snow SQL, PostgreSQL, and T-SQL queries; maintained and scheduled ETL SSIS jobs.
PL/SQL/ETL Developer at U3 Infotech
March 1, 2024 - March 1, 2024Created CI/CD pipeline with Jenkins for Snowflake deployment. Assisted by developing Python scripts for ETL data loading. Created ETL jobs to load data from Oracle to Snowflake and scheduled in CTRL-M. Involved in extracting data from flat files and relational databases into staging area. Supported trading application built with front end as Java/Oracle. Worked in Snow SQL for data warehouse and had working knowledge of stored procedures and functions in Snowflake.
Data Engineer/Data Analyst at Rapid Data Technologies
August 1, 2023 - August 1, 2023AWS Glue ETL jobs developed to load data from databases to S3 and from S3 to Athena/Redshift for various data management services; worked with AWS RDS. Executed POC by reading data from S3 and publishing to endpoints using Python FastAPI and DuckDB. Designed scalable big data processing pipelines using Hadoop, PySpark, and other distributed technologies to process 1TB daily from RDBMS sources (Oracle, SQL Server) and various file formats into a Cloudera-based data lake. Wrote efficient Python ETL scripts (Pandas/Numpy); built/maintained 20 Power BI dashboards for data quality and KPI monitoring. Collaborated with data scientists to translate requirements into data models and architecture. Worked with Hadoop (HDFS), Hive/Impala; wrote advanced SQL/ T-SQL queries; developed Snowflake/SQL Server stored procedures; used Oozie for workflow orchestration.
PL/SQL Developer at Virtusa
June 1, 2022 - June 1, 2022Maintained PL/SQL procedures/functions for Investment system applications; modified Oracle Forms/Reports 12c (along with WebLogic) in existing modules. Used AWR to identify performance issues and implemented solutions. Performed BAU tasks and provided regular status updates. Monitored server performance metrics using ITRS; maintained datawarehouse using Snowflake and wrote ad hoc queries as required.
Data Engineer / Data Analyst / Database Developer at HCL Technologies
December 1, 2021 - December 1, 2021Life Insurance project: designed and implemented database structures to improve data integrity; extensively used ETL to load data from flat files, SQL Server, and Oracle to Oracle 11g; developed SSIS packages for ETL; strong knowledge in dimensional modeling and SCD types; developed Data Mart and used snowflake/star schemas; created ~20 Power BI reports with DAX and Power Query. Drug Lifecycle project: developed Scala Spark code for real-time streaming data, used Kafka and NoSQL HBase to store drug post-marketing feedback; ETL pipelines with SSIS for batch streaming; contributed across SDLC from requirements to testing.
SQL Developer at 3i Infotech
October 1, 2013 - October 1, 2013Developed/debugged and tested PL/SQL stored procedures, functions, packages, triggers, cursors, and collections; designed database structures based on Functional Design Specifications; developed forms and reports using Oracle Forms and Reports. Contributed to interfaces, conversions, and enhancements in receivables, payables, and project accounting; performed unit and integration testing.
Data Engineer / Data Analyst at Mind Map Technologies, Singapore
January 1, 2025 - April 1, 2025Managed enterprise reconciliation platform ensuring data completeness and accuracy. Developed and optimized Snowflake SQL and PL/SQL procedures. Automated workflows using Control-M and developed shell scripts. Improved performance through indexing and query tuning. Maintained SSIS ETL workflows supporting analytics pipelines.
Data Engineer at U3 Infotech, Singapore
November 1, 2023 - March 1, 2024Built CI/CD pipelines using Jenkins for Snowflake deployments. Developed ETL pipelines in Python for source-to-target migrations, including Oracle to Snowflake data warehouse. Designed staging and ingestion pipelines for RDBMS and flat files.
Senior Data Engineer at Rapid Data Technologies, India
August 1, 2022 - August 1, 2023Designed and developed AWS Glue (PySpark) ETL pipelines to load data across bronze/silver/gold layers and target Athena/Redshift. Created Glue workflows for S3 → Athena/Redshift pipelines. Reduced ETL costs by 40% through optimization and partition pruning. Built pipelines processing 30M+ rows/day using Cloudera (Hive/Impala) & PySpark. Developed Databricks notebooks and orchestrated workflows with PySpark. Built 20+ Power BI dashboards tracking KPIs, performance, and data quality. Developed Python FastAPI + DuckDB proof-of-concept to expose S3 datasets via APIs.
Data Engineer at HCL Technologies, India
August 1, 2015 - December 1, 2021Project-based data engineering and BI work across multiple domains. Implemented ETL via SSIS, designed data models and enterprise data marts using Star/Snowflake schemas with SCD Type 1 & Type 2. Delivered 20+ Power BI dashboards with DAX and Power Query. Created PL/SQL packages, procedures, and data validation logic. Projects included near real-time streaming and reporting components for life sciences and insurance domains.
SQL Developer at 3i Infotech Pvt Ltd, India
June 1, 2008 - October 1, 2013Developed PL/SQL procedures, data models, and insurance reporting solutions.
Education
M.Sc in Computer Science at Bharathidasan University, India
January 11, 2030 - November 16, 2025M.Sc. Computer Science at Bharathidasan University, India
January 11, 2030 - January 17, 2026Qualifications
Industry Experience
Software & Internet, Financial Services, Professional Services, Computers & Electronics
Skills
Hire a Data Analyst
We have the best data analyst experts on Twine. Hire a data analyst today.