I'm a Python-centric Data Engineer with hands-on experience delivering production-grade ETL pipelines. I design and ship end-to-end data workflows using Python, PostgreSQL/Supabase, and NLP enrichment to turn raw data into clean, query-ready insights. Currently, I lead a cross-functional engineering team building a multi-stage ETL pipeline that processes thousands of SME records daily—from raw scraping and NLP signal extraction to data quality checks and a scored data layer that powers real-time dashboards and downstream systems. I prioritize data integrity, pipeline observability, and clean schema design to enable reliable, scalable data delivery.

Salahuddin Ansari

I'm a Python-centric Data Engineer with hands-on experience delivering production-grade ETL pipelines. I design and ship end-to-end data workflows using Python, PostgreSQL/Supabase, and NLP enrichment to turn raw data into clean, query-ready insights. Currently, I lead a cross-functional engineering team building a multi-stage ETL pipeline that processes thousands of SME records daily—from raw scraping and NLP signal extraction to data quality checks and a scored data layer that powers real-time dashboards and downstream systems. I prioritize data integrity, pipeline observability, and clean schema design to enable reliable, scalable data delivery.

Available to hire

I’m a Python-centric Data Engineer with hands-on experience delivering production-grade ETL pipelines. I design and ship end-to-end data workflows using Python, PostgreSQL/Supabase, and NLP enrichment to turn raw data into clean, query-ready insights.

Currently, I lead a cross-functional engineering team building a multi-stage ETL pipeline that processes thousands of SME records daily—from raw scraping and NLP signal extraction to data quality checks and a scored data layer that powers real-time dashboards and downstream systems. I prioritize data integrity, pipeline observability, and clean schema design to enable reliable, scalable data delivery.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
See more

Language

English
Fluent

Work Experience

AI Automation Engineer at GarunaCDX
March 1, 2026 - Present
Led a 7-person team building ProcessForge — an automated SME Opportunity Detection System: web scraping (Naukri/LinkedIn), NLP pain-signal extraction, 0–100 company scoring, and AI-powered outreach generation; implemented data integrity checkpoints and automated enrichment workflows; delivered real-time MIS dashboards enabling signal-triggered outreach with minimal manual intervention.
Founder's Associate — Automation & Operations Lead at Printifile Innovations Pvt Ltd
August 1, 2025 - Present
Built end-to-end candidate routing for RemoteWorks India: LinkedIn scraping → Google Forms intake → Google Sheets master tracker → Zapier email automation → Telegram broadcasting; developed B2B cost-quoting tooling and live dashboards for Uber vendor operations; improved efficiency and reduced overhead.
Logistics & Operations Coordinator at Event & Brand Activation Operations (High-Profile Events)
January 1, 2024 - December 31, 2026
Contract role handling backstage logistics, credentialing, and volunteer coordination for Coldplay and Lollapalooza; established real-time access-control and credentialing workflows; managed multi-system coordination for international brand activations.
Python Developer & Team Lead (Data Science Intern) at GarunaCDX
March 1, 2026 - Present
Directing a 7-person cross-functional team building ProcessForge — a fully automated Python + Supabase platform for SME opportunity detection. Includes web scraping, NLP extraction, multi-stage data enrichment, company scoring (0–100), and AI-generated outreach. Implemented async, event-driven pipeline stages with clean data handoffs and stage-gate integrity checks. Managed Supabase schema design, API key configuration, Python 3 compatibility, and repository structure, owning the full infra footprint. Delivered real-time MIS dashboards reporting live scoring data to operational users.
Founder’s Associate - Developer & Infrastructure Lead at Printifile Innovations Pvt Ltd
August 1, 2025 - Present
Automated Quoting System (Python/JS) reducing material waste by 20% and eliminating manual pricing errors. Multi-system operational coordination for Uber Waiting Lounge at T2 Mumbai Airport, including procurement, vendor management, live dashboards, and pan-India stakeholder reporting. Automated candidate pipeline (LinkedIn scraping → Google Forms → Sheets → Zapier → Telegram) routing 150+ candidates with 40% overhead reduction. Directed expansions across 5 regional markets with data-driven procurement tracking achieving 12% overhead reduction.
Frontend & Platform Developer at Printifile Innovations
April 1, 2025 - July 1, 2025
Custom Web Application Development: built two production web applications using HTML5, CSS3, and JavaScript — deployed on Netlify with responsive UI, SEO optimization, analytics, and conversion-focused UX. E-commerce optimization: storefront SEO, analytics, and checkout flow improvements; experience applicable to Shopify and headless e-commerce.
Logistics & Systems Coordinator at High-Profile Events (Contract)
January 1, 2024 - December 31, 2026
Real-time multi-system management for large-scale events: credentials, access control, and scheduling for Coldplay and Lollapalooza; coordinated 35+ staff with zero workflow failures, demonstrating reliability under production conditions.
Python Data Engineer at GarunaCDX
March 1, 2026 - Present
Leading a 7-person team building ProcessForge, a multi-stage Python + Supabase ETL pipeline: raw scraping of Naukri/LinkedIn → NLP signal extraction & entity tagging → data quality checks & deduplication → 0–100 company scoring → load of scored records into PostgreSQL with a real-time dashboard. Implemented schema-level integrity rules and stage-gate validations; designed modular, independently schedulable pipeline stages; built real-time dashboards for live querying by downstream systems.
Founder's Associate (Data & Operations Lead) at Printifile Innovations Pvt Ltd
August 1, 2025 - Present
Engineered production Python automation tool for commercial print layout calculations and a data pipeline for candidate sourcing: LinkedIn scraping → Google Sheets ingestion → Zapier ETL → scheduled email dispatch; achieved 20% material waste reduction and 40% recruiter overhead reduction. Maintained live reporting pipelines and dashboards for Pan-India Uber stakeholders; managed procurement data and vendor records to support decision-making. Oversaw supply chain data and cost optimization across markets.
Logistics Coordinator at High-Profile Events (Freelance)
January 1, 2024 - January 1, 2026
Real-time credentialing and logistics data management for large-scale events under pressure, tracking 35+ staff assignments, access levels, and schedule changes. Demonstrated data discipline and operational precision essential to data engineering in high-stakes environments.

Education

BSc Computer Science at Pillai College of Arts, Commerce and Science
January 11, 2030 - January 1, 2027
BSc Computer Science at Pillai College of Arts, Commerce and Science
January 11, 2030 - January 1, 2027
BSc Computer Science at Pillai College of Arts, Commerce and Science
January 11, 2030 - January 1, 2027

Qualifications

Python for Data Science IBM
January 11, 2030 - June 16, 2026
Tableau Desktop Specialist
January 11, 2030 - January 1, 2025
SQL & DB Basics SoloLearn
January 11, 2030 - June 16, 2026
Web Dev Fundamentals Great Learning
January 11, 2030 - June 16, 2026
Infosys Springboard CodeSummit 2024 Offline Qualifier
January 11, 2030 - January 1, 2024
Python for Data Science
January 11, 2030 - June 16, 2026
Tableau Desktop Specialist
January 1, 2025 - June 16, 2026
SQL & DB Basics
January 11, 2030 - June 16, 2026
Code Summit 2024 Offline Qualifier
January 1, 2024 - June 16, 2026
Python for Data Science
January 11, 2030 - June 16, 2026
Tableau Desktop Specialist
January 1, 2025 - June 16, 2026
SQL & DB Basics
January 11, 2030 - June 16, 2026
Java Core
January 11, 2030 - June 16, 2026
CodeSummit 2024 Offline Qualifier
January 1, 2024 - June 16, 2026

Industry Experience

Software & Internet, Professional Services, Media & Entertainment, Education, Retail