I am a Senior Cloud Data Engineer with 9+ years of industry experience delivering large-scale data engineering solutions across AWS, Azure, and GCP. I specialize in ETL/ELT pipelines, CDC processing, metadata-driven ingestion frameworks, data modeling, data warehousing, and real-time streaming architectures. I design, build, and optimize production-grade data platforms, automate CI/CD pipelines, enforce data quality and security standards, and collaborate with cross-functional stakeholders in financial services, SaaS, and analytics-driven environments.

Anuj Singh

I am a Senior Cloud Data Engineer with 9+ years of industry experience delivering large-scale data engineering solutions across AWS, Azure, and GCP. I specialize in ETL/ELT pipelines, CDC processing, metadata-driven ingestion frameworks, data modeling, data warehousing, and real-time streaming architectures. I design, build, and optimize production-grade data platforms, automate CI/CD pipelines, enforce data quality and security standards, and collaborate with cross-functional stakeholders in financial services, SaaS, and analytics-driven environments.

Available to hire

I am a Senior Cloud Data Engineer with 9+ years of industry experience delivering large-scale data engineering solutions across AWS, Azure, and GCP. I specialize in ETL/ELT pipelines, CDC processing, metadata-driven ingestion frameworks, data modeling, data warehousing, and real-time streaming architectures.

I design, build, and optimize production-grade data platforms, automate CI/CD pipelines, enforce data quality and security standards, and collaborate with cross-functional stakeholders in financial services, SaaS, and analytics-driven environments.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
See more

Work Experience

Senior Data Engineer (Contract) at DataArt
September 1, 2024 - Present
Architected and delivered a reusable Python SDK for standardized ETL pipelines, reducing development time by 35% across multiple teams. Designed and implemented metadata-driven ETL pipelines using JSON configurations, supporting mappings, transformations, and data quality rules. Deployed cloud infrastructure and data pipelines across Dev, QA, and UAT using Terraform and Jenkins-based CI/CD pipelines. Implemented event-driven orchestration using AWS Step Functions, Glue, and Lambda, improving pipeline reliability and auditability. Built automated end-to-end testing frameworks simulating production workloads, reducing post-release defects by 30%.
Senior Data & AI Engineer at Arvato
May 1, 2024 - August 1, 2024
Led development of an enterprise LLM-powered chatbot using OpenAI, LangChain, and LiteLLM, serving 500+ internal users. Implemented vector embeddings and semantic search, improving response relevance by 40%. Designed RESTful APIs with Flask, enabling secure project-level model configuration via Azure Key Vault. Integrated GPT-4 multimodal capabilities to generate structured metadata from documents and images.
Senior Data Consultant at Deloitte
December 1, 2022 - March 1, 2024
Migrated large-scale banking datasets from on-prem systems to Azure Data Lake, reducing data processing latency by 30%.

Education

Add your educational history here.

Qualifications

Add your qualifications or awards here.

Industry Experience

Financial Services, Software & Internet, Professional Services