I am a results-driven Senior Data Engineer with around 10 years of experience in designing, building, and deploying production-grade data platforms, AI/ML systems, and Generative AI infrastructure across AWS and Azure. I specialize in end-to-end data engineering, lakehouse architecture, feature engineering, MLOps, and RAG systems, with hands-on expertise in LLM fine-tuning, vector databases, and scalable data ingestion. I translate business needs into robust data solutions that power analytics, model training, and enterprise AI at scale. As a collaborative leader, I mentor engineers and work closely with data science, governance, and business teams to deliver trusted AI capabilities in regulated industries such as banking, insurance, and healthcare. I’m focused on productionizing AI/ML platforms with strong governance, security, cost optimization, and observability to enable faster, safer decision-making.

Koushik RDY

I am a results-driven Senior Data Engineer with around 10 years of experience in designing, building, and deploying production-grade data platforms, AI/ML systems, and Generative AI infrastructure across AWS and Azure. I specialize in end-to-end data engineering, lakehouse architecture, feature engineering, MLOps, and RAG systems, with hands-on expertise in LLM fine-tuning, vector databases, and scalable data ingestion. I translate business needs into robust data solutions that power analytics, model training, and enterprise AI at scale. As a collaborative leader, I mentor engineers and work closely with data science, governance, and business teams to deliver trusted AI capabilities in regulated industries such as banking, insurance, and healthcare. I’m focused on productionizing AI/ML platforms with strong governance, security, cost optimization, and observability to enable faster, safer decision-making.

Available to hire

I am a results-driven Senior Data Engineer with around 10 years of experience in designing, building, and deploying production-grade data platforms, AI/ML systems, and Generative AI infrastructure across AWS and Azure. I specialize in end-to-end data engineering, lakehouse architecture, feature engineering, MLOps, and RAG systems, with hands-on expertise in LLM fine-tuning, vector databases, and scalable data ingestion. I translate business needs into robust data solutions that power analytics, model training, and enterprise AI at scale.

As a collaborative leader, I mentor engineers and work closely with data science, governance, and business teams to deliver trusted AI capabilities in regulated industries such as banking, insurance, and healthcare. I’m focused on productionizing AI/ML platforms with strong governance, security, cost optimization, and observability to enable faster, safer decision-making.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent

Work Experience

Senior Data Engineer at Fifth Third Bank (Remote)
June 1, 2024 - Present
Leading development of the bank's Generative AI platform and MLOps infrastructure on AWS. Architected and deployed production-grade RAG system using LangChain, Amazon Bedrock (Claude 2), and Pinecone for AI-powered customer support and internal knowledge management across structured and unstructured data. Built end-to-end MLOps pipelines with MLflow for experiment tracking, model versioning, and registry; automated training and inference workflows on EKS/Kubernetes via Airflow DAGs and GitHub Actions CI/CD. Implemented secure embedding pipelines with recursive chunking, versioned embeddings, and governance-backed re-indexing workflows to balance accuracy, latency, governance, and cost. Developed a Snowflake-based observability platform tracking prompts, retrieval hits, latency, token usage, feedback, and knowledge content analytics. Built Text-to-SQL agents and REST inference endpoints integrated with event-driven orchestration to enable AI-driven analytics for 100+ analysts.
Senior Data Engineer at Nationwide Insurance (Remote)
March 1, 2022 - May 1, 2024
Architected migration of Nationwide's on-prem Hadoop-based data warehouse to a modern AI/ML platform on Microsoft Azure. Led GenAI POC for claims operations using LangChain and Azure OpenAI, with PII controls and row-level security to comply with regulatory standards. Modernized enterprise ML analytics by migrating legacy Hadoop/MapReduce/Hive workloads to Azure Databricks, Snowflake, and lakehouse architecture, achieving performance improvements up to 5x. Designed scalable cloud data platforms using ADLS Gen2, Databricks, Azure Synapse, and Snowflake; built 50+ configurable ML feature ingestion pipelines via Azure Data Factory, Airflow, REST APIs, SFTP, Snowpipe, and CDC. Established governance with Atlan, implemented dbt-based modeling, feature stores, and reusable ML data products. Built ML/Ops pipelines with MLflow on EKS/Kubernetes, with CI/CD via GitHub Actions; implemented data quality controls using Great Expectations and Monte Carlo; FinOps savings of over $500K annually. Impl
Data Engineer at Change Healthcare
November 1, 2019 - February 1, 2022
Developed and maintained 30+ AWS Glue ETL jobs (Python/PySpark) to build ML-ready data sets from claims, eligibility, enrollment, provider, and utilization domains. Built Redshift data warehouse with optimized distribution keys, sort keys, WLM, materialized views, stored procedures, and UDFs; implemented lakehouse patterns with S3/Parquet/Delta Lake; established data zones (raw, staging, curated) on S3 with partitioning and schema evolution controls. Automated orchestration using Step Functions, Airflow, Lambda, SNS, and SQS; built 50+ ML feature ingestion pipelines; HIPAA-aligned security controls; data governance with Alation; observability via CloudWatch/Monte Carlo; created Tableau dashboards for claims, enrollment, provider analytics; implemented Redshift Spectrum for historical data; ensured data quality and reconciliation with legacy systems; supported regulatory reporting.
Data Analyst at WalkingTree Technologies
April 1, 2018 - October 1, 2019
Analyzed client data to deliver BI/analytics-ready data sets; built 50+ complex SQL queries, dashboards in Tableau/Power BI; automated ETL using Python; created end-to-end data pipelines for multi-source data (CRM, billing, product, marketing); performed EDA, cohort, churn, segmentation analyses; built Customer 360 data sets; implemented data quality checks; contributed to data storytelling and executive-level insights; mentored junior analysts; supported UAT and production issues.
Data Analyst at iPivot
September 1, 2016 - March 1, 2018
Focused on extracting, analyzing, and reporting operational and sales data to support internal business strategy and data-driven decision-making across departments. Built interactive dashboards and KPI scorecards using Power BI, Tableau, and SSRS; automated reporting workflows; developed data models and dashboards to enable self-service analytics; delivered executive-level data storytelling and insights.

Education

Bachelor of Technology at Osmania University, Hyderabad, India
January 1, 2012 - January 1, 2016

Qualifications

Add your qualifications or awards here.

Industry Experience

Financial Services, Healthcare, Professional Services