I am a data engineer with 5+ years of experience designing and building scalable data infrastructure for storage, transformation, and analytics, including datasets exceeding 5 TB. I have hands-on experience with relational and NoSQL databases, cloud-native architectures on Azure and AWS, and modern data tooling such as Databricks, Airflow, Tableau, and Snowflake. I excel at delivering robust pipelines, streamlining deployments with Docker/Kubernetes and CI/CD, and enabling data-driven decision making for stakeholders. I enjoy collaborating with cross-functional teams to translate business needs into reliable data solutions, optimize performance, and drive measurable impact across enterprise data ecosystems.

RAVALIKA MARAPALLY

I am a data engineer with 5+ years of experience designing and building scalable data infrastructure for storage, transformation, and analytics, including datasets exceeding 5 TB. I have hands-on experience with relational and NoSQL databases, cloud-native architectures on Azure and AWS, and modern data tooling such as Databricks, Airflow, Tableau, and Snowflake. I excel at delivering robust pipelines, streamlining deployments with Docker/Kubernetes and CI/CD, and enabling data-driven decision making for stakeholders. I enjoy collaborating with cross-functional teams to translate business needs into reliable data solutions, optimize performance, and drive measurable impact across enterprise data ecosystems.

Available to hire

I am a data engineer with 5+ years of experience designing and building scalable data infrastructure for storage, transformation, and analytics, including datasets exceeding 5 TB. I have hands-on experience with relational and NoSQL databases, cloud-native architectures on Azure and AWS, and modern data tooling such as Databricks, Airflow, Tableau, and Snowflake. I excel at delivering robust pipelines, streamlining deployments with Docker/Kubernetes and CI/CD, and enabling data-driven decision making for stakeholders.

I enjoy collaborating with cross-functional teams to translate business needs into reliable data solutions, optimize performance, and drive measurable impact across enterprise data ecosystems.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent

Work Experience

Data Engineer at McKinsey & Company
October 1, 2024 - November 6, 2025
Designed batch processing applications leveraging PostgreSQL to automate workflows and manage 3 TB datasets within an S3 Data Lake, reducing processing latency and enhancing scalability for data operations. Revamped data pipelines by integrating Tableau for daily processing of 100 GB of data, cutting query times and delivering near real-time analytics for dashboards. Automated multiple data pipelines with Hadoop, ensuring uptime and reliability. Built cloud-native pipelines using Visual Studio, EMR, and AWS Lambda, shortening ETL cycles and streamlining analytics integration. Implemented Snowflake-based warehousing to optimize queries and reduce storage costs.
Data Engineer/Cloud Analyst at Accenture
December 1, 2022 - December 1, 2022
Architected a data pipeline to ingest and process semi-structured data from 5 sources, achieving 5-minute processing throughput. Engineered real-time streaming with Apache Kafka, sustaining 50 messages/second with sub-second latency. Designed Databricks/Spark workflows processing 1 TB daily, delivering 4-hour wrangling and 20,000 ready rows. Developed and version-controlled Python data solutions, cutting sprint timelines. Deployed Kubernetes clusters across nodes to maintain uptime and scale to 500 requests/min during peak loads. Led Agile practices for a team of 6, delivering multiple high-impact features. Implemented ETL workflows with Informatica and optimized cloud data operations on Azure with Databricks.
Data Engineer/Senior Analyst at Capgemini
September 1, 2021 - September 1, 2021
Implemented Airflow ETL workflows ingesting 1 TB of data from multiple APIs, enhancing pipeline reliability. Built dashboards with Seaborn to monitor software analytics. Managed Azure infrastructure supporting the software ecosystem with real-time monitoring, reducing manual configuration efforts. Used Cassandra to store 6 TB of user data, supporting high-query throughput; leveraged Teradata for BI across 8 TB of historic data to accelerate reporting. Orchestrated 7 Azure Data Factory pipelines ingesting telemetry data daily, enabling real-time performance metrics.
Data Analyst at Adani
January 1, 2019 - January 1, 2019
Streamlined SQL-driven data workflows to strengthen internal reporting. Implemented CI/CD pipelines with Jenkins and AWS CodePipeline/CodeDeploy to improve deployment speed. Managed Oracle databases, improving query times and supporting daily queries. Automated data transformation tasks with Excel-based tools and collaborated with cross-functional teams to modernize data pipelines.

Education

Master of Science in Computer Science at University of Central Missouri
January 1, 2023 - May 1, 2024
Bachelor of Technology in Computer Science and Engineering at SR University
June 1, 2014 - May 1, 2018

Qualifications

Add your qualifications or awards here.

Industry Experience

Computers & Electronics, Software & Internet, Professional Services