Hi, I'm Saify KS, a Senior Data Engineer with 9+ years of experience building scalable data platforms across fintech, health tech, and SaaS domains. I design and implement ETL/ELT pipelines, real-time streaming, and big data architectures on AWS, Azure, and GCP, continually optimizing for reliability and business value. I'm passionate about data governance, security, and cross-functional collaboration. I build self-service, ML-ready data platforms that accelerate innovation, enable predictive analytics, and empower decision-making for product and business stakeholders.

Saify KS

Hi, I'm Saify KS, a Senior Data Engineer with 9+ years of experience building scalable data platforms across fintech, health tech, and SaaS domains. I design and implement ETL/ELT pipelines, real-time streaming, and big data architectures on AWS, Azure, and GCP, continually optimizing for reliability and business value. I'm passionate about data governance, security, and cross-functional collaboration. I build self-service, ML-ready data platforms that accelerate innovation, enable predictive analytics, and empower decision-making for product and business stakeholders.

Available to hire

Hi, I’m Saify KS, a Senior Data Engineer with 9+ years of experience building scalable data platforms across fintech, health tech, and SaaS domains. I design and implement ETL/ELT pipelines, real-time streaming, and big data architectures on AWS, Azure, and GCP, continually optimizing for reliability and business value.

I’m passionate about data governance, security, and cross-functional collaboration. I build self-service, ML-ready data platforms that accelerate innovation, enable predictive analytics, and empower decision-making for product and business stakeholders.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
See more

Language

English
Fluent

Work Experience

Lead Data Engineer at Etleap
August 1, 2022 - November 24, 2025
Architected and scaled enterprise-grade AWS Lakehouse (S3, Glue, Redshift, EMR); enabling 100+ TB/day ingestion and improving query performance for 1,000+ users. Developed real-time streaming pipelines with Kafka + Spark, reducing event processing latency from 15s to 3s and powering mission-critical analytics and ML workflows. Migrated 200+ legacy ETL jobs to dbt + Airflow, cutting infrastructure costs by 30% while increasing reliability by 60%. Built self-service data pipelines enabling analysts to ingest and transform data with minimal engineering support. Partnered with product and data science teams to deliver ML-ready features, and implemented data governance using Collibra and Great Expectations to ensure data quality and compliance.
Senior Data Engineer at RTS Labs
July 1, 2022 - July 1, 2022
Built ETL/ELT pipelines for clinical, claims, and patient data, ensuring HIPAA, HL7, and FHIR compliance. Delivered Snowflake & Azure Synapse Lakehouse solutions enabling predictive health analytics and BI dashboards. Developed real-time Kafka + Spark pipelines, reducing clinical alert latency and supporting patient monitoring systems. Implemented data governance frameworks and automated data quality and lineage reporting; improved compliance and data accuracy. Led cross-functional teams across engineering, data science, and business stakeholders.
Data Engineer at Care Rev
November 1, 2018 - November 1, 2018
Built Python + SQL ETL pipelines to ingest data from SaaS applications, CRM, and operational systems for analytics. Designed optimized data models (Star Schema, 3NF) improving dashboard query performance. Automated reporting workflows with Airflow + Python, reducing manual reporting workload. Supported migration from SQL Server to AWS Redshift and prepared the foundation for cloud-native analytics. Collaborated with clinicians, product managers, and data scientists to develop self-service BI dashboards for faster decision-making.
Data Engineer at CareRev
November 1, 2018 - November 1, 2018
Built Python + SQL ETL pipelines to integrate data from SaaS applications, CRM, and operational systems for analytics.
Senior Data Engineer at RTS Labs
November 1, 2018 - July 1, 2022
Built real-time Kafka + Spark pipelines for clinical and financial data; delivered Snowflake & Azure Synapse lakehouse solutions; implemented monitoring with Prometheus, Grafana, ELK Stack, and OpenTelemetry; migrated 200+ legacy ETL jobs to dbt + Airflow; built self-service data pipelines; partnered with product and data science teams to deliver ML-ready features for fraud detection and customer analytics; automated data quality and lineage reporting; ensured HIPAA and HL7 compliance; led cross-functional teams to improve BI capabilities.
Data Engineer at Care Rev
March 1, 2016 - November 1, 2018
Built Python + SQL ETL pipelines to integrate data from SaaS applications, CRM, and operational systems for analytics; Designed optimized data models (Star Schema, 3NF), improving dashboard query speeds by up to 10x; Automated reporting workflows with Airflow, reducing manual reporting workload by 75%; Supported migration from SQL Server to AWS Redshift, enabling cloud-native analytics; Partnered with business stakeholders to develop self-service BI dashboards for faster decision-making; Implemented monitoring and observability to ensure data quality.

Education

Bachelor of Science in Information Systems at New Jersey Institute of Technology (NJIT)
January 11, 2030 - January 8, 2026

Qualifications

Bachelor in Computer Science
January 11, 2030 - November 24, 2025
Bachelor's Degree in Computer Science
January 11, 2030 - November 24, 2025
Bachelor's degree in Computer Science
January 11, 2030 - November 24, 2025
HIPAA Compliance
January 11, 2030 - November 24, 2025
HL7 & FHIR Integration Training
January 11, 2030 - November 24, 2025
PCI-DSS Compliance Awareness
January 11, 2030 - November 24, 2025
SOX Compliance Awareness
January 11, 2030 - November 24, 2025

Industry Experience

Healthcare, Financial Services, Software & Internet, Life Sciences, Professional Services, Other