I am a driven and detail-focused Data Engineer with over three years of experience delivering scalable, high-performance data solutions in cloud and big data ecosystems. I specialize in designing end-to-end data architectures, building distributed data pipelines, and optimizing ETL workflows to enable real-time analytics and business intelligence at scale. My expertise spans Hadoop, Spark, PySpark, SQL, and Python, combined with hands-on proficiency in AWS and Azure services such as Redshift, Lambda, Kinesis, Azure Data Factory, and Databricks. I bring a deep understanding of data systems engineering, paired with a strong sense of ownership, precision, and commitment to engineering excellence. Whether building new platforms or modernizing legacy infrastructure, I focus on delivering efficient, maintainable solutions aligned with long-term business strategy.

Pravalika Pasham

I am a driven and detail-focused Data Engineer with over three years of experience delivering scalable, high-performance data solutions in cloud and big data ecosystems. I specialize in designing end-to-end data architectures, building distributed data pipelines, and optimizing ETL workflows to enable real-time analytics and business intelligence at scale. My expertise spans Hadoop, Spark, PySpark, SQL, and Python, combined with hands-on proficiency in AWS and Azure services such as Redshift, Lambda, Kinesis, Azure Data Factory, and Databricks. I bring a deep understanding of data systems engineering, paired with a strong sense of ownership, precision, and commitment to engineering excellence. Whether building new platforms or modernizing legacy infrastructure, I focus on delivering efficient, maintainable solutions aligned with long-term business strategy.

Available to hire

I am a driven and detail-focused Data Engineer with over three years of experience delivering scalable, high-performance data solutions in cloud and big data ecosystems. I specialize in designing end-to-end data architectures, building distributed data pipelines, and optimizing ETL workflows to enable real-time analytics and business intelligence at scale. My expertise spans Hadoop, Spark, PySpark, SQL, and Python, combined with hands-on proficiency in AWS and Azure services such as Redshift, Lambda, Kinesis, Azure Data Factory, and Databricks.

I bring a deep understanding of data systems engineering, paired with a strong sense of ownership, precision, and commitment to engineering excellence. Whether building new platforms or modernizing legacy infrastructure, I focus on delivering efficient, maintainable solutions aligned with long-term business strategy.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
See more

Work Experience

Data Engineer at Blue Cross Blue Shield
May 1, 2024 - Present
Built scalable data pipelines using Python and Apache Spark for real-time processing from AWS S3 to RDS. Developed and automated ETL workflows and advanced SQL queries for ingestion and transformation of structured and unstructured data. Integrated NLP-based models into Flask APIs for client sentiment classification. Developed ML pipelines with scikit-learn and SHAP for explainable AI. Created dynamic Power BI dashboards supporting executive reporting. Ensured data validation and integrity using SQL and Python. Collaborated with stakeholders to build analytical solutions and delivered application experience metrics and leadership insights to improve IT service delivery and digital experience.
Assistant System Engineer – Data Analyst at Tata Consultancy Services (TCS)
August 31, 2023 - August 5, 2025
Analyzed user behavior and performance metrics using SQL and Python to guide business decisions. Created and maintained Power BI and Tableau dashboards for business intelligence reporting. Developed Python scripts for ML model evaluation. Documented and tested analytics use cases in an Agile environment. Used SAS and SAP BW for statistical analysis and customer segmentation. Participated in ETL and API automation to streamline data ingestion from external sources.
Data Science Associate at Wavetronic Solutions Private Limited
August 31, 2021 - August 5, 2025
Supported customer transaction analysis using Python and SQL to detect fraud patterns. Created dashboards with Tableau and Excel for monitoring high-risk activities. Assisted in building predictive models to classify transaction types and customer behavior segments. Performed data cleaning, exploratory analysis, and visualization using Pandas and Seaborn. Delivered weekly progress updates and presented findings internally.

Education

Master of Science in Information Technology and Management at St. Francis College, NY
January 1, 2023 - December 31, 2025
Bachelor of Engineering in Electronics & Communication at Teegala Krishna Reddy Engineering College, India
January 1, 2017 - January 1, 2021

Qualifications

Microsoft Certified: Azure Fundamentals (DP-900)
January 11, 2030 - August 5, 2025
Data Engineering Foundations – LinkedIn Learning
January 11, 2030 - August 5, 2025
Data Analysis using PySpark – Great Learning
January 11, 2030 - August 5, 2025

Industry Experience

Healthcare, Financial Services, Software & Internet, Professional Services, Education

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
See more