Hi, I’m Gopala Krishna, a data engineer with 4 years of experience designing and optimizing ETL pipelines, real-time streaming, and cloud data platforms. I enjoy turning complex data into reliable, business-driven insights using Python, SQL, Spark, Kafka, Airflow, and Databricks, with hands-on experience across Snowflake, Azure Synapse, Google BigQuery, and AWS S3. I focus on data modeling, governance, and security, and I’m committed to performance optimization, CI/CD automation, and Agile delivery to enable secure, scalable data solutions that empower risk management, fraud prevention, and customer analytics. I thrive when collaborating with data scientists, analysts, and product managers to deliver datasets that meet evolving business needs.

Gopala Krishna

Hi, I’m Gopala Krishna, a data engineer with 4 years of experience designing and optimizing ETL pipelines, real-time streaming, and cloud data platforms. I enjoy turning complex data into reliable, business-driven insights using Python, SQL, Spark, Kafka, Airflow, and Databricks, with hands-on experience across Snowflake, Azure Synapse, Google BigQuery, and AWS S3. I focus on data modeling, governance, and security, and I’m committed to performance optimization, CI/CD automation, and Agile delivery to enable secure, scalable data solutions that empower risk management, fraud prevention, and customer analytics. I thrive when collaborating with data scientists, analysts, and product managers to deliver datasets that meet evolving business needs.

Available to hire

Hi, I’m Gopala Krishna, a data engineer with 4 years of experience designing and optimizing ETL pipelines, real-time streaming, and cloud data platforms. I enjoy turning complex data into reliable, business-driven insights using Python, SQL, Spark, Kafka, Airflow, and Databricks, with hands-on experience across Snowflake, Azure Synapse, Google BigQuery, and AWS S3.

I focus on data modeling, governance, and security, and I’m committed to performance optimization, CI/CD automation, and Agile delivery to enable secure, scalable data solutions that empower risk management, fraud prevention, and customer analytics. I thrive when collaborating with data scientists, analysts, and product managers to deliver datasets that meet evolving business needs.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert

Language

English
Fluent

Work Experience

Data Engineer at PayPal
January 1, 2024 - November 21, 2025
Supported the design and maintenance of data pipelines that moved high-volume payment transaction data from source systems into Snowflake and Google BigQuery, enabling analytics for finance, fraud detection, and compliance. Built and optimized ETL workflows using Python, SQL, and Apache Spark, improving pipeline efficiency and reducing processing time by 20%. Implemented data quality checks and validation rules across ingestion layers to increase data accuracy and consistency for downstream reporting. Partnered with data scientists, analysts, and product managers to deliver datasets tailored to fraud prevention and customer experience initiatives, and supported ad-hoc analyses. Contributed to cloud migration efforts via GCP (Cloud Storage, Cloud Functions, BigQuery) while enabling migration from legacy systems.
Data Engineer at Accenture
December 1, 2021 - December 1, 2021
Developed ETL pipelines in Python and Apache Airflow to load healthcare datasets from EHR systems into a centralized warehouse, adding incremental loading, parallel processing, and retries to boost throughput by 40%. Built dimensional models on AWS S3 using star and snowflake schemas, optimized storage, partitioning, and indexing to reduce query times by 35% and improve dashboard refreshes. Processed multi-terabyte datasets with Apache Spark on Databricks, applying aggregations, joins, and transformations for compliance reporting, with checkpointing and error handling to improve stability. Automated data quality checks in Python and SQL to validate schema, null values, and business rule compliance, reducing downstream reporting errors by 30%. Implemented CDC pipelines in Azure Data Factory using SQL triggers and watermarking for real-time synchronization between on-prem databases and cloud storage, ensuring zero duplication and minimal latency. Configured Azure Purview to manage metada

Education

Master of Science in Computer Science at Lamar University, Beaumont, Texas
January 1, 2022 - December 1, 2023

Qualifications

AWS Certified Data Engineer
January 11, 2030 - November 21, 2025
Databricks Certified Data Engineer
January 11, 2030 - November 21, 2025

Industry Experience

Financial Services, Software & Internet, Professional Services