I'm a data engineer who loves designing and maintaining scalable ELT pipelines in cloud environments. I specialize in Snowflake, Airflow-style orchestration, PySpark, AWS, and dbt to deliver analytics-ready data and reliable data platforms. I partner with BI and data architecture teams to improve data reliability, observability, and performance, automate data integrations, and support large-scale cloud migrations.

Sai Anuhya Bandi

I'm a data engineer who loves designing and maintaining scalable ELT pipelines in cloud environments. I specialize in Snowflake, Airflow-style orchestration, PySpark, AWS, and dbt to deliver analytics-ready data and reliable data platforms. I partner with BI and data architecture teams to improve data reliability, observability, and performance, automate data integrations, and support large-scale cloud migrations.

Available to hire

I’m a data engineer who loves designing and maintaining scalable ELT pipelines in cloud environments. I specialize in Snowflake, Airflow-style orchestration, PySpark, AWS, and dbt to deliver analytics-ready data and reliable data platforms.

I partner with BI and data architecture teams to improve data reliability, observability, and performance, automate data integrations, and support large-scale cloud migrations.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
See more

Language

English
Fluent

Work Experience

Data Engineer at Broadband Insights
July 1, 2024 - November 27, 2025
Engineered an enterprise AI Genie using LangChain, OpenAI GPT-4, and Pinecone (RAG) to automate broadband analytics and executive reporting. Developed ETL pipelines with Python, Dagster, and Snowflake handling 10M+ records daily with 60% improvement in data freshness. Built microservices for AI-driven anomaly detection using scikit-learn and AWS Lambda, reducing downtime by 45%. Automated deployment and testing workflows with Docker, Kubernetes, and GitLab CI/CD enabling zero-downtime releases. Technologies include Python, LangChain, OpenAI API, Pinecone, Snowflake, Dagster, AWS, GCP, Docker, Kubernetes, GitLab CI/CD.
Back End Developer Intern at SkyIT Services
August 1, 2024 - August 1, 2024
Designed a Flask API orchestration of BigQuery pipelines on Kubernetes, improving query throughput by 50%. Implemented IAM and LDAP security controls, lowering unauthorized access by 40%. Automated BigQuery data ingestion workflows with Python and Cloud Functions, improving reliability by 40%.
Data Consultant at Data Consulting Services (Hyderabad)
August 1, 2023 - August 1, 2023
Built and maintained 15+ Python microservices supporting 10K+ users for ITSM analytics. Automated 80% of ServiceNow workflows via REST API integration, cutting ticket processing time by 60%. Migrated 5 TB+ legacy data to AWS Data Lake and implemented CI/CD (Jenkins + GitLab), reducing release errors by 70%.
AI DevOps Engineer (Pilot) at SkyIT Services
July 1, 2024 - July 1, 2024
Designed and deployed Flask APIs orchestrating BigQuery pipelines on Kubernetes; improved query throughput by 50%. Implemented IAM + LDAP security controls; automated BigQuery data ingestion with Python and Cloud Functions; integrated RAG workflows with AWS/GCP.
Data Engineer at Broadband Insights
June 1, 2025 - January 1, 2026
Designed, implemented, and maintained scalable ELT pipelines using Snowflake and Python, processing 10M+ records daily and improving end-to-end data reliability by 60%. Orchestrated and monitored production data workflows using Dagster with Airflow-equivalent patterns, reducing data latency by 45% and ensuring SLA adherence. Managed integration and migration of legacy data sources into modern AWS- and GCP-based architectures, reducing manual data handling by 50% and enabling cloud-first analytics. Implemented comprehensive data quality checks, observability metrics, and automated alerts, decreasing pipeline failures by 40% and improving BI data trust. Optimized Snowflake schemas, transformations, and query performance, reducing analytics query latency by 30% and lowering warehouse compute costs by 25%. Automated ingestion and transformation workflows using Python and dbt, accelerating reporting delivery timelines by 35%. Authored and maintained detailed documentation for ELT pipelines,
Backend Developer Intern at SkyIT Services
May 1, 2024 - August 1, 2024
Built Python-based data ingestion and transformation services supporting cloud analytics workloads, increasing pipeline throughput by 50%. Assisted in orchestrating scheduled data workflows and automated jobs, reducing manual operational effort by 40%. Developed SQL-driven transformations and supported cloud warehouse queries, accelerating BI report and ad-hoc analysis delivery. Deployed containerized data services on Kubernetes, improving deployment consistency and reducing data pipeline issues by 40%. Implemented IAM and RBAC controls to secure data access, improving compliance and reducing unauthorized access incidents by 40%.
Software Developer at Tata Consultancy Services
August 1, 2021 - August 1, 2023
Developed Python-based services supporting enterprise data workflows and reporting platforms used by 10K+ active users. Led migration of 5TB+ of legacy datasets into AWS-based data lake and warehouse environments, improving data accessibility and analytics performance by 45%. Built and optimized complex SQL queries and backend integrations, automating 80% of reporting workflows and reducing processing time by 60%. Implemented CI/CD pipelines for data and backend services using Jenkins and Git, reducing release errors by 70%. Partnered with cross-functional engineering and analytics teams to improve data availability, system reliability, and production support operations.

Education

Master in Computer Science at University of Alabama
August 1, 2023 - April 1, 2025
Bachelor in Computer Science at Jawaharlal Nehru University
July 1, 2017 - July 1, 2021
Master in Computer Science at University of Alabama at Birmingham
August 1, 2023 - May 1, 2025
Bachelor in Computer Science at Jawaharlal Nehru University
August 1, 2017 - May 1, 2021

Qualifications

AWS Certified Developer - Associate
January 11, 2030 - November 27, 2025
Certified Kubernetes Administrator (CKA)
January 11, 2030 - November 27, 2025
Oracle Certified Professional Java Programmer (OCPJP)
January 11, 2030 - November 27, 2025
ServiceNow Application Developer
January 11, 2030 - November 27, 2025
Google Cloud Platform (GCP) Certifications
January 11, 2030 - November 27, 2025
AWS Certified Developer – Associate
January 11, 2030 - January 7, 2026
Certified Kubernetes Administrator (CKA)
January 11, 2030 - January 7, 2026
Certified ServiceNow Application Developer
January 11, 2030 - January 7, 2026
Oracle Certified Java Programmer (OCPJP)
January 11, 2030 - January 7, 2026

Industry Experience

Computers & Electronics, Software & Internet, Professional Services, Media & Entertainment, Education