I am a results-driven data engineer and business analyst with 6+ years delivering cloud-native data solutions and actionable insights across AWS, Azure, and GCP. I design scalable ETL/ELT pipelines and real-time streaming workflows using PySpark, Airflow, Databricks, Kafka, and Flink to translate complex data into meaningful analytics for clinical, operational, and financial stakeholders. I'm proficient in SQL, Python, DBT, and Snowflake, with a track record of modernizing legacy ETL, supporting ML/NLP workflows (SageMaker, LangChain), and implementing HIPAA/GDPR-compliant governance. I build executive dashboards with Power BI, Tableau, and Looker, collaborate in Agile teams to align data products with business goals, and mentor teams to raise delivery and quality standards.

Rajita Madhikarmimadhikarmi

I am a results-driven data engineer and business analyst with 6+ years delivering cloud-native data solutions and actionable insights across AWS, Azure, and GCP. I design scalable ETL/ELT pipelines and real-time streaming workflows using PySpark, Airflow, Databricks, Kafka, and Flink to translate complex data into meaningful analytics for clinical, operational, and financial stakeholders. I'm proficient in SQL, Python, DBT, and Snowflake, with a track record of modernizing legacy ETL, supporting ML/NLP workflows (SageMaker, LangChain), and implementing HIPAA/GDPR-compliant governance. I build executive dashboards with Power BI, Tableau, and Looker, collaborate in Agile teams to align data products with business goals, and mentor teams to raise delivery and quality standards.

Available to hire

I am a results-driven data engineer and business analyst with 6+ years delivering cloud-native data solutions and actionable insights across AWS, Azure, and GCP. I design scalable ETL/ELT pipelines and real-time streaming workflows using PySpark, Airflow, Databricks, Kafka, and Flink to translate complex data into meaningful analytics for clinical, operational, and financial stakeholders.

I’m proficient in SQL, Python, DBT, and Snowflake, with a track record of modernizing legacy ETL, supporting ML/NLP workflows (SageMaker, LangChain), and implementing HIPAA/GDPR-compliant governance. I build executive dashboards with Power BI, Tableau, and Looker, collaborate in Agile teams to align data products with business goals, and mentor teams to raise delivery and quality standards.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent

Work Experience

Data Engineer at Johnson & Johnson
September 1, 2023 - November 5, 2025
Designed and managed scalable ETL pipelines in Azure Data Factory and Databricks, processing 5TB+ of healthcare data daily across Medicare, Medicaid, and commercial providers. Applied Medallion architecture (bronze/silver/gold) within Azure Databricks to improve data quality, reusability, and analytics consistency. Built and optimized Postgres and MySQL integrations into Azure Synapse and Azure Data Lake, ensuring data integrity and faster query performance. Developed event-driven pipelines with AWS Glue, Step Functions, and ECS for real-time ingestion and analytics for clinical and operational dashboards. Implemented observability practices with CloudWatch, Grafana, and OpenTelemetry, reducing pipeline downtime by 40% and improving SLA adherence. Containerized and deployed API microservices in Docker and Kubernetes, supporting secure, scalable data access across distributed healthcare systems. Created automated testing frameworks for pipelines and APIs using Python and CI/CD tools, re
Business Analyst / Data Engineer at Diverge Health
August 1, 2023 - August 1, 2023
Designed and maintained scalable ELT pipelines with Python, dbt, and Snowflake, converting complex healthcare datasets into analytics-ready models. Built and optimized dbt transformations for patient segmentation, readmission risk, and cost stratification, directly improving predictive analytics for care providers. Integrated Postgres and MySQL data sources into centralized Snowflake models, ensuring referential integrity and optimized query execution. Orchestrated event-driven pipelines using AWS Glue, Step Functions, and Dagster for near real-time reporting. Implemented observability practices with AWS CloudWatch and pipeline dashboards, cutting SLA breaches by 30%. Developed reverse ETL workflows with Hightouch to sync Snowflake insights into Salesforce and marketing platforms, improving patient outreach by 30%. Created dashboards in Tableau, Power BI, and Sigma for self-service access by clinicians and payers. Automated testing frameworks with Great Expectations and dbt tests, main
Business Analyst – Data & Trading Analytics at Citadel
June 1, 2021 - June 1, 2021
Designed and deployed scalable ETL/ELT pipelines on AWS Glue and Step Functions, reducing data ingestion latency for market datasets by 35%. Engineered, tuned, and maintained Postgres and MySQL databases, applying indexing, partitioning, and replication strategies to improve query performance by 40%. Developed and containerized microservices with AWS ECS and Python, integrating market feeds and risk data into trading and analytics applications. Partnered with front-end teams to integrate REST and GraphQL APIs into dashboards, enabling traders to access real-time risk and pricing insights. Implemented observability frameworks with AWS CloudWatch and dashboards to improve monitoring and reduce incident MTTR by 25%. Built automated testing frameworks for pipelines and APIs, embedding validation into CI/CD pipelines with GitHub Actions. Designed data models to support AI/LLM-powered applications, ensuring compatibility with embeddings, vector databases, and NLP preprocessing requirements.

Education

Masters of Science in Business Analytics at Clark University
January 11, 2030 - November 5, 2025

Qualifications

Azure Data Engineer Associate
January 11, 2030 - November 5, 2025
AWS Certified Data Engineer - Associate
January 11, 2030 - November 5, 2025

Industry Experience

Healthcare, Financial Services, Professional Services, Software & Internet, Life Sciences