I am a Senior Data Engineer with 7+ years of experience designing, building, and operating scalable data platforms and pipelines in production environments. I specialize in cloud-based lakehouse and streaming architectures, large-scale data processing, and distributed systems. I have strong hands-on experience with SQL, NoSQL, Python, and Java, along with big data technologies such as Spark, Kafka, and Hadoop. I also have practical experience building AI-powered data solutions, including RAG pipelines, LangChain, and vector databases to enable semantic search, analytics, and data discovery.

Andrew Tran

I am a Senior Data Engineer with 7+ years of experience designing, building, and operating scalable data platforms and pipelines in production environments. I specialize in cloud-based lakehouse and streaming architectures, large-scale data processing, and distributed systems. I have strong hands-on experience with SQL, NoSQL, Python, and Java, along with big data technologies such as Spark, Kafka, and Hadoop. I also have practical experience building AI-powered data solutions, including RAG pipelines, LangChain, and vector databases to enable semantic search, analytics, and data discovery.

Available to hire

I am a Senior Data Engineer with 7+ years of experience designing, building, and operating scalable data platforms and pipelines in production environments. I specialize in cloud-based lakehouse and streaming architectures, large-scale data processing, and distributed systems.

I have strong hands-on experience with SQL, NoSQL, Python, and Java, along with big data technologies such as Spark, Kafka, and Hadoop. I also have practical experience building AI-powered data solutions, including RAG pipelines, LangChain, and vector databases to enable semantic search, analytics, and data discovery.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Advanced

Work Experience

Senior Data Engineer at Flodesk
July 1, 2024 - Present
Designed a scalable data platform capable of handling streaming and batch data with high throughput; implemented complex data pipelines that adapt to business requirements while ensuring cost-effectiveness; led data platform workstreams with stakeholders and implemented RAG pipeline, Iceberg on AWS, and Lakehouse strategies.
Senior Data Engineer at IAG Insurance Company
February 1, 2023 - July 1, 2024
Migrated an on-prem Greenplum data platform to Google Cloud, modernizing legacy batch pipelines into a cloud-native ELT architecture while supporting insurance analytics, financial reporting, and BAU operations.
Senior Data Engineer at FPT Software
May 1, 2022 - February 1, 2023
Built a lakehouse platform on Databricks to ingest and process high-volume e-invoice events from messaging systems, producing trusted datasets for analytics and operational reporting.
Senior Data Engineer at Car Rental Company
February 1, 2021 - April 1, 2022
Designed and implemented a data warehouse, data pipeline, and data mart to support analytics and BI; used ELT with Fivetran/DBT; modeled with Star Schema; enabled Looker analytics.
Data Engineer at MobiFone IT Center
January 1, 2020 - February 1, 2021
CDR Search: planned and built big data architecture and pipelines to process up to 30M users' call detail records over two years using Hadoop, Spark, and Airflow.
Developer at MobiFone IT Center
May 1, 2019 - June 1, 2020
AIOT Platform: developed and deployed IoT platform; processed device data in InfluxDB; used Kafka and KSQL for real-time anomaly detection; containerized services with Docker and Kubernetes.
Developer at MobiFone IT Center
February 1, 2018 - May 1, 2019
Identifying phone numbers on the internet in real-time: built real-time system for high-throughput ingestion and lookup for 30M subscribers; used Kafka, Java/Golang, HBase/MySQL, and HAProxy.

Education

Information Technology Program: Computer Science at Post and Telecommunications Institute of Technology (PTIT)
January 11, 2030 - January 29, 2026

Qualifications

Databricks Certified Associate Developer for Apache Spark 3.0
January 11, 2030 - January 29, 2026
Databricks Certified Data Engineer Associate
January 11, 2030 - January 29, 2026

Industry Experience

Software & Internet, Telecommunications, Professional Services, Media & Entertainment