Senior AI Data Engineer with 10+ years of experience architecting distributed data systems for blockchain marketplaces, high-growth SaaS analytics platforms, and cloud-native AI infrastructures. Expert in Spark, Kafka, Snowflake, Databricks, and ML feature engineering, with proven success scaling real-time event platforms, tokenized asset ecosystems, and multi-tenant SaaS data architectures.

David Johnson

Senior AI Data Engineer with 10+ years of experience architecting distributed data systems for blockchain marketplaces, high-growth SaaS analytics platforms, and cloud-native AI infrastructures. Expert in Spark, Kafka, Snowflake, Databricks, and ML feature engineering, with proven success scaling real-time event platforms, tokenized asset ecosystems, and multi-tenant SaaS data architectures.

Available to hire

Senior AI Data Engineer with 10+ years of experience architecting distributed data systems for blockchain marketplaces, high-growth SaaS analytics platforms, and cloud-native AI infrastructures. Expert in Spark, Kafka, Snowflake, Databricks, and ML feature engineering, with proven success scaling real-time event platforms, tokenized asset ecosystems, and multi-tenant SaaS data architectures.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
See more

Language

English
Fluent

Work Experience

Senior AI Data Engineer at Courtyard.io
May 1, 2022 - Present
Architected the core data backbone powering a YCW22 tokenized collectibles marketplace, engineering blockchain-synchronized pipelines supporting on-chain trading of graded cards vaulted with third-party security partners. Designed real-time ingestion frameworks for lab vending machines, curated mystery Pack Drops, and 90% buyback pricing systems, enabling sub-minute visibility into transaction liquidity, redemption events, and secondary market activity. Built hybrid Web2/Web3 reconciliation pipelines integrating blockchain event logs, custodial vault inventories, and fiat payment processors, ensuring accurate token-to-physical asset mapping across thousands of vaulted collectables.
Senior Data Engineer at Mixpanel
March 1, 2019 - April 1, 2022
Scaled distributed event ingestion systems processing billions of behavioral events daily, powering product analytics features including retention analysis, funnel breakdowns, cohort segmentation, and real-time dashboards. Engineered low-latency event modeling frameworks enabling dynamic user segmentation across web and mobile product telemetry streams.
Data Engineer at Druva
July 1, 2016 - February 1, 2019
Designed cloud-native data pipelines powering SaaS backup and data protection platforms including Phoenix, InSync, and CloudRanger. Built telemetry ingestion systems capturing backup performance metrics, recovery SLAs, and cloud-workload protection signals across AWS and hybrid enterprise environments. Built compliant data marts supporting enterprise audit reporting, retention policy validation, and cloud-workload risk scoring.

Education

Master of Science / Computer Science at California State University, East Bay
August 1, 2010 - May 1, 2016

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Computers & Electronics, Media & Entertainment