Available to hire
I am a results-driven data engineer with 6+ years of experience designing and building scalable data pipelines, ETL frameworks, and cloud-native solutions across AWS and Azure ecosystems. I thrive in turning complex data into actionable insights and robust data products.
I excel at leveraging Kafka, Spark, Snowflake, and Airflow to handle large-scale data for analytics and real-time decision-making, with hands-on Python and PySpark for ETL development, data validation across systems, and enabling AI/ML readiness. I enjoy collaborating across teams to deliver reliable data solutions that accelerate business decisions.
Skills
Experience Level
Language
English
Fluent
Javanese
Advanced
Afar
Intermediate
Work Experience
Data Engineer - Financial Domain at Wealth Simple
July 1, 2023 - November 7, 2025Designed and developed data migration workflows from on-premises systems (Teradata, MySQL, DB2, Informix, Hadoop) to Azure Blob Storage and Google Cloud Storage using Apache Airflow. Defined data mapping, governance, and transformation rules for Master Data Management and OLTP/ODS layers. Built and maintained large-scale data processing pipelines with Pandas/NumPy and PySpark, and configured Snowflake for cloud data warehousing. Implemented Spark-based analytics in Python and built streaming data pipelines with Spark Structured Streaming (Kafka as source, Cassandra as sink). Automated cloud provisioning with Pulumi and deployed ML solutions in a DevOps style. Collaborated with cross-functional teams to support analytics, dashboards, and ML workflows.
Data Platform Engineer at Geotab
June 30, 2023 - June 30, 2023Developed and managed data pipelines using Azure Data Factory for ingestion and transformation of 50+ TB monthly. Used Azure PolyBase to shape raw data into relational formats for Azure Synapse, improving storage efficiency and processing times. Built and optimized data workflows with PySpark, Spark SQL, and Spark Streaming; processed 100M+ records daily. Implemented data ingestion pipelines into Azure Data Lake Storage Gen2 and designed Databricks ETL pipelines. Contributed to scalable data architectures for finance applications and real-time analytics.
Data Analytical Engineer at PointClickCare
December 31, 2021 - December 31, 2021Developed JSON scripts to deploy Azure Data Factory pipelines, designed data pipelines for diverse sources, and applied statistical techniques for market trends and forecasting. Wrote complex SQL queries (including CTEs and stored procedures) to support Power BI reports. Orchestrated Databricks data preparation and loading into SQL Data Warehouse; automated data validation scripts. Proficient in Azure data movement and scheduling for Blob Storage and SQL Database.
BI/Data Analyst at Lightspeed Commerce
June 30, 2020 - June 30, 2020Gained hands-on experience with Linux administration and database tuning. Performed data mining, data modeling reviews, and report development in Power BI and Tableau. Created ADA-compliant dashboards, collaborated with enterprise data modeling teams, and supported data warehouse design, extraction, transformation, and loading strategies.
Data Engineer at Wealth Simple
July 1, 2023 - PresentDesigned and developed data migration workflows from on-premises systems (Teradata, MySQL, DB2, Informix, and Hadoop) to cloud storage (Azure Blob Storage and Google Cloud Storage) using Apache Airflow for orchestration. Built and maintained large-scale data processing pipelines with Pandas/NumPy and PySpark, enabling efficient ingestion, transformation, and validation of diverse data formats. Automated network configuration and troubleshooting with PowerShell, implemented Spark-based analytics, and developed containerized prototypes for ML models using Docker. Contributed to AWS infrastructure reliability with CloudFormation and Ansible, supporting automated deployments and code reviews; supported ML workflows and data readiness. Designed and governed data mapping, transformation, and cleansing rules for Master Data Management (OLTP/ODS) and deployed containerized apps on Kubernetes.
Data Platform Engineer at Geotab
January 1, 2022 - June 1, 2023Developed and managed data pipelines using Azure Data Factory to ingest and transform 50+ TB of data monthly from on‑premises into the Azure cloud. Leveraged Azure PolyBase to transform raw data from Data Lakes into relational formats for storage and analytics in Azure Synapse, improving storage efficiency by 40% and processing times by 25%. Used Airflow for task orchestration with Python/Bash operators, processing 100M+ records daily and handling Avro, Parquet, and JSON formats with logging for monitoring. Optimized data workflows using PySpark, Spark SQL, and Spark DataFrames; implemented secure ingestion pipelines into ADLS Gen2 and built Databricks ETL pipelines. Designed RESTful APIs and RabbitMQ messaging to enable reliable, real-time data exchange across distributed systems.
Data Analytical Engineer at PointClickCare
July 1, 2020 - December 1, 2021Developed and deployed Azure Data Factory pipelines to extract, transform, and load data from sources such as Azure SQL, Blob storage, and Azure SQL Data Warehouse. Migrated application logic to Azure Data Lake and Data Factory, orchestrating Databricks ETL via JDBC connectors. Created Python scripts to validate data in Databricks and automated these checks through Azure Data Factory. Participated in sprint planning and translated business requirements into scalable data solutions with cross-functional collaboration.
BI/Data Analyst at Lightspeed Commerce
January 1, 2019 - June 1, 2020Linux administration (RHEL7) for data platform environments; created stored procedures with PL/SQL, performance-tuned backend processes, and conducted data mining to identify correlations. Enforced data naming standards, conducted logical/physical design reviews, and produced dashboards using MDX, DAX, Power BI, and Tableau. Supported data warehouse lifecycle from source data analysis to ETL design, collaborating with enterprise data modeling teams to build logical models.
Education
Bachelor of Engineering, Computer Science at Jawaharlal Nehru Technological University Hyderabad (IND)
January 11, 2030 - November 7, 2025Diploma, Programming & Networking at Humber College Toronto, ON
January 11, 2030 - November 7, 2025Bachelor of Engineering at Jawaharlal Nehru Technological University Hyderabad (IND)
January 11, 2030 - January 9, 2026Diploma at Humber College Toronto (ON)
January 11, 2030 - January 9, 2026Qualifications
Generative AI with Python – Udemy
January 11, 2030 - January 1, 2025Generative AI Fundamentals – Databricks
January 1, 2025 - January 9, 2026Generative AI with Python – Udemy
January 1, 2025 - January 9, 2026Data Engineering with Snowflake – Coursera
January 1, 2025 - January 9, 2026Industry Experience
Financial Services, Software & Internet, Professional Services, Education, Other, Computers & Electronics
Skills
Experience Level
Hire a Data Scientist
We have the best data scientist experts on Twine. Hire a data scientist in Toronto today.