I am a data engineering professional with over 10 years of experience designing, building, and maintaining data platforms across AWS, GCP, and Azure. I specialize in Gen AI data pipelines, multi-tenant data applications, and production-grade data workflows that power analytics, ML models, and intelligent assistants. I am currently based in Albany, NY, and I partner with ML engineers, data scientists, and business stakeholders to deliver scalable solutions, automate data pipelines with CI/CD, and implement Retrieval-Augmented Generation using tools like Bedrock, SageMaker JumpStart, and Vertex AI. I thrive on solving complex data challenges and turning data into actionable insights.

Sandhya Nunemunthala

I am a data engineering professional with over 10 years of experience designing, building, and maintaining data platforms across AWS, GCP, and Azure. I specialize in Gen AI data pipelines, multi-tenant data applications, and production-grade data workflows that power analytics, ML models, and intelligent assistants. I am currently based in Albany, NY, and I partner with ML engineers, data scientists, and business stakeholders to deliver scalable solutions, automate data pipelines with CI/CD, and implement Retrieval-Augmented Generation using tools like Bedrock, SageMaker JumpStart, and Vertex AI. I thrive on solving complex data challenges and turning data into actionable insights.

Available to hire

I am a data engineering professional with over 10 years of experience designing, building, and maintaining data platforms across AWS, GCP, and Azure. I specialize in Gen AI data pipelines, multi-tenant data applications, and production-grade data workflows that power analytics, ML models, and intelligent assistants.

I am currently based in Albany, NY, and I partner with ML engineers, data scientists, and business stakeholders to deliver scalable solutions, automate data pipelines with CI/CD, and implement Retrieval-Augmented Generation using tools like Bedrock, SageMaker JumpStart, and Vertex AI. I thrive on solving complex data challenges and turning data into actionable insights.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent

Work Experience

Senior Data Engineer at JP Morgan Chase & Co
September 1, 2021 - Present
Led the development of a Data Lake and Data Warehouse for Analytics; designed and built a machine learning platform and data pipelines to validate ML models; automated daily ETL dashboards to monitor model performance; created an analytical feature repository enabling ML engineers and data scientists to reuse features, reducing feature recreation time.
Data Engineer at RCG
August 1, 2021 - October 15, 2025
Enabled data sources (Data Lake and Data Warehouse) for analytics; designed, developed, and deployed a Data Lake to store real-time and batch data from applications and databases using Confluent Kafka and AWS S3; built a Redshift-based Data Warehouse; implemented batch pipelines using SNS/Lambda/Step Functions/Glue/EMR; created self-service pipelines for internal data teams and visualization using QlikView, Kibana, and Tableau.
Data Engineer at S&P Global
December 1, 2017 - October 15, 2025
Designed and deployed Data Lake, Data Mart, and Data Warehouse on Google Cloud Platform; built batch and real-time data pipelines using Dataflow, Pub/Sub, and Cloud Composer; migrated on-prem Hadoop workloads to Google Cloud; created real-time consumers to read from Kafka and load into Cloud Storage; developed streaming pipelines using Spark in DataProc; built DAGs in Airflow.
Data Analyst at IBM
December 1, 2015 - October 15, 2025
Participated in analysis, design, development, testing, and implementation of financial systems using Oracle, Developer and PL/SQL; created external table scripts; wrote UNIX Shell Scripts; developed packages, triggers, and PL/SQL modules; collaborated with SMEs to translate business rules into code; contributed to POC on image classification and ML-related tasks.
Senior Data Engineer at JP Morgan Chase & Co
September 1, 2021 - November 7, 2025
Led design and implementation of data lake and data warehouse for analytics, enabling ML feature engineering and model validation dashboards. Built and maintained end-to-end Gen AI pipelines and retrieval-augmented generation (RAG) workflows, including semantic search with Kendra and Bedrock-hosted LLMs. Created vector stores and embedding pipelines using SageMaker + FAISS to support efficient semantic search across structured and unstructured data. Automated daily ETL with Apache Airflow and Crontab; deployed infrastructure with Terraform; collaborated with ML engineers and data scientists to deliver AI-powered chatbots and context-aware workflows.
Data Engineer at RCG
August 1, 2021 - August 1, 2021
Designed and deployed a real-time and batch data platform, creating a Data Lake with Confluent Kafka and AWS S3, and a Data Warehouse on Redshift. Built self-service pipelines using SNS, Lambda, Glue, EMR, Athena, and SageMaker, enabling rapid data readiness for ML workloads. Implemented secure model access with IAM, API Gateway, and CloudWatch; containerized inference endpoints with ECS/Fargate; orchestrated batch jobs via Step Functions and Airflow.
Data Engineer at S&P Global
December 1, 2017 - December 1, 2017
Designed, developed, and deployed Data Lake, Data Mart, and Data Warehouse on Google Cloud Platform (GCP). Implemented batch and real-time data pipelines using Dataflow, Pub/Sub, DataProc, and Airflow, migrating legacy Hadoop ecosystems to GCP. Built streaming pipelines for ingest from Kafka/Confluent and IoT sources; developed DAGs and plugins for Airflow; enabled analytics and reporting with BigQuery.
Data Analyst at IBM
December 1, 2015 - December 1, 2015
Participated in analysis, design, development, testing, and implementation of various financial systems using Oracle, Developer and PL/SQL. Created external table scripts for ETL, wrote UNIX shell scripts, and developed packages, procedures, and triggers to support business requirements; collaborated with SMEs to translate business rules into data processing logic.

Education

Master of Computer Applications at Osmania University
January 11, 2030 - June 1, 2014
Bachelor of Science B.sc (Mathematics. Physics. Chemistry) at Kakatiya University
January 11, 2030 - June 1, 2010
Master of Computer Applications at Osmania University
January 11, 2030 - June 1, 2014
BSc (Mathematics, Physics, Chemistry) at Kakatiya University
January 11, 2030 - June 1, 2010

Qualifications

AWS Solution Architect Professional
January 11, 2030 - October 15, 2025
Google Cloud Certified Professional Data Engineer
January 11, 2030 - October 15, 2025
AWS Solution Architect Professional
January 11, 2030 - November 7, 2025
Google Cloud Certified Professional Data Engineer
January 11, 2030 - November 7, 2025

Industry Experience

Financial Services, Software & Internet, Professional Services