I am a skilled data engineer with extensive experience designing and building scalable ETL pipelines using modern cloud platforms and big data technologies. I enjoy optimizing data workflows and automating processes to deliver fast, cost-effective, and reliable insights that drive business impact. Currently, I focus on cloud native data engineering solutions in retail and enterprise environments, leveraging tools like Apache Spark, Airflow, and BigQuery to transform raw data into actionable analytics. My background combines hands-on technical expertise with collaboration across engineering, analytics, and product teams to deliver robust data platforms. I am passionate about continuous learning and applying best practices in cloud data engineering, infrastructure as code, and CI/CD to build efficient and scalable data ecosystems. I am excited about leveraging emerging technologies and methodologies to help businesses unlock the full potential of their data assets.

Pranitha Kamishetty

I am a skilled data engineer with extensive experience designing and building scalable ETL pipelines using modern cloud platforms and big data technologies. I enjoy optimizing data workflows and automating processes to deliver fast, cost-effective, and reliable insights that drive business impact. Currently, I focus on cloud native data engineering solutions in retail and enterprise environments, leveraging tools like Apache Spark, Airflow, and BigQuery to transform raw data into actionable analytics. My background combines hands-on technical expertise with collaboration across engineering, analytics, and product teams to deliver robust data platforms. I am passionate about continuous learning and applying best practices in cloud data engineering, infrastructure as code, and CI/CD to build efficient and scalable data ecosystems. I am excited about leveraging emerging technologies and methodologies to help businesses unlock the full potential of their data assets.

Available to hire

I am a skilled data engineer with extensive experience designing and building scalable ETL pipelines using modern cloud platforms and big data technologies. I enjoy optimizing data workflows and automating processes to deliver fast, cost-effective, and reliable insights that drive business impact. Currently, I focus on cloud native data engineering solutions in retail and enterprise environments, leveraging tools like Apache Spark, Airflow, and BigQuery to transform raw data into actionable analytics.

My background combines hands-on technical expertise with collaboration across engineering, analytics, and product teams to deliver robust data platforms. I am passionate about continuous learning and applying best practices in cloud data engineering, infrastructure as code, and CI/CD to build efficient and scalable data ecosystems. I am excited about leveraging emerging technologies and methodologies to help businesses unlock the full potential of their data assets.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate

Work Experience

Senior Data Engineer at Walmart
June 1, 2023 - Present
Designed and implemented scalable ETL pipelines using Scala and Apache Spark on Google Cloud Platform to enhance the Know Your Store Insights program, significantly reducing reporting latency from hours to minutes. Automated data workflows using Apache Airflow on Cloud Composer, achieving 99.9% SLA compliance. Secured data processing and lowered storage costs through optimized Cloud Storage and IAM policies. Developed Spark applications that improved query performance three-fold by efficiently transforming raw data to Hive Parquet format. Architected partitioned Hive tables and Spark SQL aggregations to reduce load times by 40%. Automated Dataproc cluster provisioning, saving $120k annually by eliminating idle nodes. Tuned Spark job configurations for up to 40% faster runtimes. Ensured responsive UI performance under peak load with stress testing. Optimized Druid clusters for high concurrency with zero downtime. Served as liaison between engineering and API teams to deliver high-speed
Data Engineer at Accenture
December 31, 2021 - August 26, 2025
Built and maintained cloud-ready ETL pipelines using Informatica PowerCenter and Informatica Cloud, integrating data from over 40 sources into a Snowflake data warehouse to enable rapid global reporting. Developed serverless data ingestion frameworks with AWS Glue, Lambda, and Step Functions, reducing time-to-insight from 24 hours to 2 hours. Managed daily data workloads orchestrated by AWS CloudWatch Events and Apache Airflow, ensuring 99.9% on-time SLA across more than 120 pipelines. Containerized legacy PowerCenter workflows with Docker and deployed on Amazon EKS, decreasing provisioning lead time from days to hours. Migrated 15 TB of data from on-premises SQL Server to Amazon Redshift, achieving significant licensing cost savings. Implemented real-time streaming data ingestion with Snowpipe and S3 event notifications. Introduced infrastructure as code via Terraform and GitLab CI/CD to guarantee environment reproducibility and minimize manual errors.

Education

Master of Science in Data Science at University of Texas at Arlington
January 11, 2030 - August 26, 2025
B.Tech. in Computer Science at Sreenidhi Institute of Science and Technology
January 11, 2030 - August 26, 2025

Qualifications

Microsoft Certified Fabric Data Engineer Associate
July 1, 2025 - August 26, 2025

Industry Experience

Retail, Software & Internet, Financial Services