I am a Data Engineer with over 4 years of experience in designing and deploying large-scale ETL pipelines, real-time streaming architectures, and cloud-native data platforms, primarily in financial services and manufacturing domains. I am proficient in tools like Apache Spark, Kafka, AWS, Snowflake, Airflow, and Python, and have a strong background in data modeling, cloud migration, and compliance with regulations such as SOX, CCAR, and Basel III. My expertise includes optimizing batch and streaming workflows to reduce infrastructure costs and enhance decision intelligence through advanced analytics and BI dashboards created with Tableau and Power BI. I am passionate about working in Agile environments and have successfully led data integration and governance initiatives across globally distributed teams, driving operational efficiency and business insights.

VIJAY YARABOLU

I am a Data Engineer with over 4 years of experience in designing and deploying large-scale ETL pipelines, real-time streaming architectures, and cloud-native data platforms, primarily in financial services and manufacturing domains. I am proficient in tools like Apache Spark, Kafka, AWS, Snowflake, Airflow, and Python, and have a strong background in data modeling, cloud migration, and compliance with regulations such as SOX, CCAR, and Basel III. My expertise includes optimizing batch and streaming workflows to reduce infrastructure costs and enhance decision intelligence through advanced analytics and BI dashboards created with Tableau and Power BI. I am passionate about working in Agile environments and have successfully led data integration and governance initiatives across globally distributed teams, driving operational efficiency and business insights.

Available to hire

I am a Data Engineer with over 4 years of experience in designing and deploying large-scale ETL pipelines, real-time streaming architectures, and cloud-native data platforms, primarily in financial services and manufacturing domains. I am proficient in tools like Apache Spark, Kafka, AWS, Snowflake, Airflow, and Python, and have a strong background in data modeling, cloud migration, and compliance with regulations such as SOX, CCAR, and Basel III.

My expertise includes optimizing batch and streaming workflows to reduce infrastructure costs and enhance decision intelligence through advanced analytics and BI dashboards created with Tableau and Power BI. I am passionate about working in Agile environments and have successfully led data integration and governance initiatives across globally distributed teams, driving operational efficiency and business insights.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert

Work Experience

Data Engineer at State Street
January 1, 2025 - Present
Designed and optimized large-scale financial data ETL pipelines processing over 500 GB/day using Apache Spark, Airflow, Talend, NiFi, and Informatica, improving batch processing latency by 40%. Developed real-time ingestion and fraud detection workflows using Apache Kafka and Spark Streaming, reducing false positives by 15%. Architected scalable data lake platforms on Hadoop and AWS, enabling regulatory and credit risk analytics on petabyte-scale securities. Conducted credit risk modeling supporting Basel III, CCAR, and trading decision intelligence. Migrated legacy infrastructure to AWS RDS and Google BigQuery, improving query performance by 40% and reducing costs by 25%. Delivered automated BI dashboards with PySpark, Hive, and Tableau for real-time insights into trading and operational KPIs. Administered distributed databases enforcing data governance and compliance with SOX, CCAR, and FINRA.
Data Engineer at HCL Tech
July 1, 2023 - August 26, 2025
Designed and optimized scalable ETL pipelines with PySpark, Apache Spark, and AWS Glue handling IoT telemetry and behavioral data, reducing latency by 25% and increasing throughput by 30%. Built distributed data frameworks utilizing Hadoop, Hive, Pig, Kafka, and HBase managing over 5TB/day for compliance and performance analytics. Architected cloud-native data platforms with AWS and Snowflake improving scalability and reducing infrastructure costs by 20%. Developed RESTful APIs integrating 50+ microservices enhancing operational efficiency by 40%. Automated data ingestion pipelines using Airflow, Talend, and NiFi, enabling supply chain and production data workflows. Implemented CI/CD pipelines aligning with Agile and DevOps, reducing deployment time by 70%. Led cloud migration and data governance initiatives with full compliance and developed Power BI dashboards tracking KPIs.

Education

Master of Science at University of Central Missouri
January 11, 2030 - May 1, 2025

Qualifications

Add your qualifications or awards here.

Industry Experience

Financial Services, Manufacturing