I am a Data Engineer with over 6 years of experience specializing in designing, building, and optimizing large-scale data pipelines and platforms across cloud and on-premises environments. I have a strong background in modernizing data platforms, implementing ETL/ELT processes, and delivering actionable insights through data visualization. My expertise includes working with Azure, AWS, Snowflake, Hadoop ecosystem, and real-time streaming technologies. Throughout my career, I have successfully led cloud migrations, optimized data processing performance, and implemented comprehensive data governance frameworks, ensuring compliance with data privacy regulations. I enjoy collaborating with cross-functional teams, mentoring junior engineers, and integrating machine learning models to enable advanced analytics, fueling data-driven decision-making.

Samhitha Mali

I am a Data Engineer with over 6 years of experience specializing in designing, building, and optimizing large-scale data pipelines and platforms across cloud and on-premises environments. I have a strong background in modernizing data platforms, implementing ETL/ELT processes, and delivering actionable insights through data visualization. My expertise includes working with Azure, AWS, Snowflake, Hadoop ecosystem, and real-time streaming technologies. Throughout my career, I have successfully led cloud migrations, optimized data processing performance, and implemented comprehensive data governance frameworks, ensuring compliance with data privacy regulations. I enjoy collaborating with cross-functional teams, mentoring junior engineers, and integrating machine learning models to enable advanced analytics, fueling data-driven decision-making.

Available to hire

I am a Data Engineer with over 6 years of experience specializing in designing, building, and optimizing large-scale data pipelines and platforms across cloud and on-premises environments. I have a strong background in modernizing data platforms, implementing ETL/ELT processes, and delivering actionable insights through data visualization. My expertise includes working with Azure, AWS, Snowflake, Hadoop ecosystem, and real-time streaming technologies.

Throughout my career, I have successfully led cloud migrations, optimized data processing performance, and implemented comprehensive data governance frameworks, ensuring compliance with data privacy regulations. I enjoy collaborating with cross-functional teams, mentoring junior engineers, and integrating machine learning models to enable advanced analytics, fueling data-driven decision-making.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert

Work Experience

Sr Azure Data Engineer at Fulton Bank
August 1, 2022 - Present
Designed and implemented end-to-end data pipelines using Azure Data Factory, Databricks, and Synapse Analytics for ingestion, transformation, and loading of data into Azure Data Lake and Azure SQL Data Warehouse. Led migration of on-premises data warehouses to Azure Synapse and Snowflake, optimizing query performance and reducing costs by 30%. Developed PySpark and Spark SQL scripts for real-time analytics and reporting, implemented Data Mesh architecture to decentralize data ownership and improve data quality. Built CI/CD pipelines in Azure DevOps, integrated Snowflake with multi-cloud environments, and developed Power BI dashboards for performance visualization. Optimized Spark applications to reduce processing time by 40%, enforced data governance and security including RBAC and encryption, and implemented real-time streaming using Kafka and Azure Stream Analytics. Mentored junior engineers and collaborated with data scientists to operationalize machine learning models. Conducted pe
Azure Data Engineer at OSF Health Care
July 31, 2022 - August 28, 2025
Designed scalable ETL pipelines with Azure Data Factory and Apache Airflow to process structured and semi-structured data. Architected data pipelines extracting from APIs, databases, and flat files, transformed using PySpark in Azure Databricks, and loaded into Azure Data Lake and Snowflake, reducing processing time by 30%. Led migration to Azure Cloud with minimal downtime and integrated machine learning models for predictive analytics. Optimized storage in Azure Data Lake and query performance in Azure Synapse. Utilized Azure Stream Analytics and Event Hub for real-time processing. Implemented RBAC and encryption for security and compliance with HIPAA, GDPR, and CCPA. Automated data cataloging with Azure Purview, monitored pipelines with Azure Monitor, and developed CI/CD pipelines using Azure DevOps. Reduced overall cloud costs by 20% via resource scaling and query tuning. Developed disaster recovery strategies and enhanced query performance by 40%. Collaborated with multidisciplina
Hadoop Developer at Aptiv
August 31, 2020 - August 28, 2025
Designed and optimized Hadoop-based data pipelines using HDFS, MapReduce, Hive, and HBase for large-scale data analytics. Developed Spark Streaming pipelines with Kafka for real-time ingestion and analytics, improving decision-making speed and accuracy. Implemented ETL processes with Sqoop for database to HDFS migration. Optimized Hive queries for actionable insights and used Avro, Parquet, and ORC file formats to enhance storage and processing. Ensured data quality via validation and cleansing techniques. Developed batch and stream processing with MapReduce, HBase, and Spark. Integrated AWS S3 and Azure Blob Storage into Hadoop workflows for scalable solutions and containerized applications with Docker. Automated workflows with Apache Oozie and enforced data governance/policies for GDPR and CCPA compliance. Created dynamic reports with Power BI and Tableau, optimized query performance, integrated Git for version control, and implemented CI/CD pipelines with Jenkins. Conducted benchmar

Education

Masters in Computer and Information Science at University of Texas
January 11, 2030 - August 28, 2025

Qualifications

Add your qualifications or awards here.

Industry Experience

Healthcare, Financial Services, Software & Internet, Manufacturing

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert