Yigang (James) Fu

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
See more

Work Experience

Sr. Data Engineer at Amira Learning
August 1, 2025 - August 1, 2025
Architect and develop new ETL pipelines and data warehouse for AI applications and customer report data processing and delivery for thousands of school districts and millions of students daily. Migrated daily ETL pipelines to AWS PySpark, improving performance from 16-18 hours to 2-3 hours. Design and build monitoring and alerting system for ETL pipeline health and key metrics tracking.
Software Engineer at Lyell Immunopharma
December 1, 2023 - December 1, 2023
Build cloud-based data processing infrastructure for event-driven real-time data processing and batch data processing. Design, develop, and deploy ETL pipelines for processing data for cross-functional teams of clinical, manufactural, and biological researches. Develop monitoring and alerting system for pipeline operation and data quality.
Sr. Data Engineer at Nauto
July 1, 2021 - July 1, 2021
Led data technology implementation of company data infrastructure processing fleet driving data for AI-powered fleet management services. Led migration of entire Java ETL pipelines to Databricks PySpark in AWS. Migrated entire data warehouse to Delta Lake. Led development of ETL pipelines for new products of manager insights performance analysis and reporting, driver behavior alert and coaching. Redesigned major ETL pipelines to improve both performances and reliabilities.
Sr. Data Engineer at Unity Technologies
March 1, 2019 - March 1, 2019
Lead development and deployment of ETL pipeline for redesign of Ad and recommendation systems which is company's largest dataset. Prepared data for customer facing Ads reporting system. Lead development of cross- teams data validation for monetization division. Development of machine learning decision engine pipeline.
Principal Software Engineer at Yume
January 1, 2017 - January 1, 2017
Designed and built big data infrastructure ecosystems for online video advertising on multiple devices. Created and maintained both in-house Cloudera Hadoop clusters and AWS EMR cluster for data science team. Architected and build end-to-end automated data workflows and ETL pipelines to create and manage multiple data sets. Architected, built, and launched new data models that provide intuitive ad-hoc analytics and reporting for product and business teams. Led design, development, testing, and deployment of data algorithm and processing for multiple big data projects such as video advertising inventory forecasting, targeting, cross-device, and fraud detection. Worked closely with data scientists to implement data science algorithms.
Staff Data Engineer at Chegg
July 1, 2013 - July 1, 2013
Designed and built robust, configurable, high-performance, and fault-tolerant big data ETL platform for automating data workflows and processing of heterogeneous data sources for Enterprise Data Warehouse. Migrated legacy data processing system to the new ETL infrastructure framework to improve SLA and performance. Architected data warehouse system integrating the new technologies. Led migration of big data analytics platform from Aster to Redshift. Built BI/Analytics/Reporting applications for multiple business lines; integrated fraud detection with real-time e-commerce data processing.
Staff Platform/Data Engineer at Digital Chocolate
February 1, 2011 - February 1, 2011
Created and administered Vertica database cluster (10+ TB on Amazon AWS) for BI analytics of online game data. Designed database schemas and complex queries for reporting and anomaly detection. Developed ETL processors. Maintained Ruby on Rails web servers; migrated Event Tracking server from Ruby/MySQL to Java/MongoDB. Led data pipeline design across ads and gaming platforms; built fraud detection and ad tracking systems.
Tech Leader at TeleNav
March 1, 2007 - March 1, 2007
Led engineering teams to design, develop, deploy, and maintain web/WAP services, customer services, user management, content management, real-time traffic GPS map services, and internationalization. Led redesign and development of online and mobile e-commerce shopping, billing, payment, shipping, and inventory management systems. Increased online store revenue by 400% over one year. Drove quality improvement for software development, testing, and deployment. Fixed historic code issues by using Aspect-Oriented Programming paradigm.
Contract Software Engineer at eBay
July 1, 2003 - July 1, 2003
Managed two releases (trains), and software change management for entire eBay international operations. Analyzed load, performance, and memory usages for eBay International systems. Managed code merges from and to multiple concurrent branches, and submitted daily QA builds. Diagnosed, triaged, assigned, fixed or suggested fixes, and monitored software bugs in both C++ and Java components for entire international systems. Diagnosed and rectified a critical, long-standing bug, enhanced overall system reliability. Designed and developed J2EE based eBay calendar reminder feature for eBay sites.
Technical Lead/Sr. Software Engineer at Loudcloud/Opsware
January 1, 2003 - January 1, 2003
Led software team of 2-6 developers. Designed, developed, and deployed highly publicized J2EE-based Loudcloud corporation Internet systems. Developed tools to customize and integrate Interwoven content management system. Full life-cycle product design and development of Loudcloud J2EE-based eServices Directory system. Full life-cycle design and development of monitoring system for myLoudcloud portal. Designed document center and security system. Designed Oracle database schemas for the projects.

Education

Ph.D. Chemistry (Crystallography) at University of Kentucky
January 11, 2030 - November 21, 2025
B.S. Chemical Engineering at Nanjing University of Technology, PR China
January 11, 2030 - November 21, 2025

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Healthcare, Education, Gaming, Media & Entertainment