Hi, I am Smruti Prava Sahu, an Azure Data Engineer with over 7 years of IT experience, including 5 years specializing in Azure data engineering. I am proficient in SQL, PySpark, SparkSQL, and various Azure services like Azure Data Factory, Azure Data Lake, Azure Synapse, and Azure Databricks. I have a strong track record of designing, developing, and optimizing scalable ETL pipelines for data migration and transformation across various sectors. I have experience working in agile and DevOps environments and have contributed to projects in Research, Banking, and Retail industries. My expertise includes migrating data from on-premises sources to the cloud, processing diverse data formats, optimizing data pipeline performance, and implementing data integration solutions for large enterprises. I enjoy collaborating with stakeholders to deliver efficient and reliable data engineering solutions.

Smruti Prava Sahu

Hi, I am Smruti Prava Sahu, an Azure Data Engineer with over 7 years of IT experience, including 5 years specializing in Azure data engineering. I am proficient in SQL, PySpark, SparkSQL, and various Azure services like Azure Data Factory, Azure Data Lake, Azure Synapse, and Azure Databricks. I have a strong track record of designing, developing, and optimizing scalable ETL pipelines for data migration and transformation across various sectors. I have experience working in agile and DevOps environments and have contributed to projects in Research, Banking, and Retail industries. My expertise includes migrating data from on-premises sources to the cloud, processing diverse data formats, optimizing data pipeline performance, and implementing data integration solutions for large enterprises. I enjoy collaborating with stakeholders to deliver efficient and reliable data engineering solutions.

Available to hire

Hi, I am Smruti Prava Sahu, an Azure Data Engineer with over 7 years of IT experience, including 5 years specializing in Azure data engineering. I am proficient in SQL, PySpark, SparkSQL, and various Azure services like Azure Data Factory, Azure Data Lake, Azure Synapse, and Azure Databricks. I have a strong track record of designing, developing, and optimizing scalable ETL pipelines for data migration and transformation across various sectors.

I have experience working in agile and DevOps environments and have contributed to projects in Research, Banking, and Retail industries. My expertise includes migrating data from on-premises sources to the cloud, processing diverse data formats, optimizing data pipeline performance, and implementing data integration solutions for large enterprises. I enjoy collaborating with stakeholders to deliver efficient and reliable data engineering solutions.

See more

Experience Level

Expert
Expert
Expert
Expert
Intermediate

Work Experience

Azure Data Engineer at Infosys (Remote)
May 31, 2025 - August 26, 2025
Migrated data from an on-premise server to Azure Data Lake using Azure Data Factory. Processed data formats including CSV, JSON, and XML using PySpark and Spark SQL within Databricks. Wrote processed data to Azure SQL Database and Delta Lake and archived data in separate containers. Orchestrated end-to-end data pipelines with Azure Data Factory and developed dashboards on Delta tables. Designed data integration solutions to migrate from Dynamics 365 on-premise to Azure cloud. Implemented data partitioning in ADLS Gen2 and optimized Spark execution using broadcast joins.
Azure Data Engineer at ADNEC
April 30, 2024 - August 26, 2025
Developed and maintained ETL data pipelines using Azure Data Factory and Databricks. Collaborated with stakeholders for requirements gathering and pipeline design. Ingested data from on-premise SQL servers and REST APIs into ADLS Gen2 with ADF and Databricks. Implemented multi-layer data architecture (landing, parking, archive, bronze, silver, gold) in ADLS Gen2 using mount points with SAS tokens and service principals. Created views for final data layers based on user requirements, including incremental data loading.
Azure Data Engineer at Tech-Mahindra (Scotia Bank)
April 30, 2023 - August 26, 2025
Developed scalable data solutions using PySpark, SQL, and Delta Lake. Orchestrated pipelines with Azure Data Factory integrating multiple data sources. Optimized ETL processes to load data into Azure SQL Database and Synapse Analytics. Managed Delta Lake tables and resolved performance bottlenecks. Implemented full and incremental data loading strategies from on-premises SQL servers and developed SCD-1 and SCD-2 solutions for streaming data.
Azure Data Engineer at Accenture (Allianz)
March 31, 2022 - August 26, 2025
Ingested data from Salesforce, SFTP, and SharePoint into Azure Blob Storage. Created Linked Services and Datasets for varied data structures. Developed Azure Data Factory pipelines and dataflows applying PySpark for data ingestion and transformation. Managed ADF activities such as Copy and Stored Procedure with configuration management. Facilitated data migration and deployment between on-premise and cloud environments.
Azure Data Engineer at Wipro Technologies (Corning Incorporated)
October 31, 2020 - August 26, 2025
Designed and optimized ETL pipelines with Azure Data Factory to extract, transform, and load data into a data lake. Configured Azure integration runtime and self-hosted runtime to connect on-premise systems with Azure cloud. Created pipelines to copy data from on-premise SQL servers to Azure Data Lake in CSV format. Used Spark SQL and Python in Azure Databricks to transform data into fact and dimension tables. Utilized ADF transformations including Lookup, Derived Column, Aggregate, and Conditional Split extensively.

Education

MCA at Centre for Post Graduate Studies, Orissa University of Agriculture and Technology, Odisha
January 11, 2030 - August 26, 2025
BSC at College of Basic Science and Humanities, Orissa University of Agriculture and Technology, Odisha
January 11, 2030 - August 26, 2025

Qualifications

Add your qualifications or awards here.

Industry Experience

Financial Services, Retail