I am a highly skilled Data Engineer with extensive experience in designing, building, and optimizing data pipelines, data warehouses, and data lakes across diverse industries such as finance, healthcare, and retail. I specialize in implementing end-to-end data solutions using modern cloud technologies like AWS, Azure, and Google Cloud Platform to handle large-scale, complex datasets efficiently. Throughout my career, I have developed expertise in ETL/ELT pipeline creation with Apache Spark, Databricks, AWS Glue, and more, ensuring data quality, performance, and reliability. I am proficient in programming languages including Python, SQL, Scala, and Java, and have strong experience with distributed data processing and orchestration tools. I enjoy solving complex data engineering challenges to enable data-driven decision making.

Bobby Ray

I am a highly skilled Data Engineer with extensive experience in designing, building, and optimizing data pipelines, data warehouses, and data lakes across diverse industries such as finance, healthcare, and retail. I specialize in implementing end-to-end data solutions using modern cloud technologies like AWS, Azure, and Google Cloud Platform to handle large-scale, complex datasets efficiently. Throughout my career, I have developed expertise in ETL/ELT pipeline creation with Apache Spark, Databricks, AWS Glue, and more, ensuring data quality, performance, and reliability. I am proficient in programming languages including Python, SQL, Scala, and Java, and have strong experience with distributed data processing and orchestration tools. I enjoy solving complex data engineering challenges to enable data-driven decision making.

Available to hire

I am a highly skilled Data Engineer with extensive experience in designing, building, and optimizing data pipelines, data warehouses, and data lakes across diverse industries such as finance, healthcare, and retail. I specialize in implementing end-to-end data solutions using modern cloud technologies like AWS, Azure, and Google Cloud Platform to handle large-scale, complex datasets efficiently.

Throughout my career, I have developed expertise in ETL/ELT pipeline creation with Apache Spark, Databricks, AWS Glue, and more, ensuring data quality, performance, and reliability. I am proficient in programming languages including Python, SQL, Scala, and Java, and have strong experience with distributed data processing and orchestration tools. I enjoy solving complex data engineering challenges to enable data-driven decision making.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert

Work Experience

Senior Data Engineer at Intuit
July 1, 2024 - Present
Designed and deployed scalable ETL pipelines using AWS Glue to improve data processing efficiency for multi-terabyte datasets. Optimized Redshift data warehouse with partitioning and indexing to reduce query times by 30%. Developed real-time data streaming solutions with AWS Kinesis for low-latency financial transaction processing. Built interactive Tableau dashboards for insights into customer spending behavior. Automated cloud infrastructure with Terraform for scalability and resource optimization. Engineered distributed data processing workflows via AWS EMR achieving a 40% performance boost. Implemented Python-based data validation frameworks to ensure pipeline accuracy. Led API integrations for real-time data ingestion and orchestrated workflows with Apache Airflow. Ensured compliance with PCI-DSS and GDPR for secure data handling in financial applications.
Senior Data Engineer at Meta
June 30, 2024 - August 26, 2025
Designed and automated ETL pipelines using Azure Data Factory to streamline integration across cloud and on-premises systems. Developed and optimized data warehouses in Azure Synapse, improving query performance by 25%. Integrated Apache Spark and PySpark workflows in Databricks to increase processing efficiency. Automated CI/CD pipelines with Azure DevOps and containerized applications using Docker and Kubernetes for resilient deployments. Created interactive Power BI dashboards for real-time insights. Implemented role-based access control for secure data policies and led migration projects from legacy systems to Azure, reducing costs by 30%. Troubleshot pipelines and enhanced data accuracy using Python validation scripts.
Data Engineer at UnitedHealth Group
March 31, 2020 - August 26, 2025
Designed and implemented Spark streaming for real-time and batch data processing integrating Kafka with databases like Oracle, MySQL, and DB2 using Sqoop for advanced analysis. Enhanced ETL transformations using Python, Scala, and Spark SQL to improve data quality. Developed ETL frameworks with IBM DataStage and SSIS, streamlined integration and transformation workflows. Built ETL pipelines with SSIS packages and stored procedures while leveraging SSAS and SSRS for daily reporting. Managed large-scale ETL processes loading data to HDFS/S3 and executed structural modifications with MapReduce and Hive. Ensured data integrity through ACID-compliant ETL transactions and automated SQL script deployments via CI/CD pipelines. Led data movement strategies involving data lakes, Hadoop, and NoSQL databases such as HBase and Cassandra. Skilled in PowerShell and Linux/Unix for system performance and automation.

Education

Bachelor in Computer Science at Western Washington University
January 1, 2011 - January 1, 2015

Qualifications

Add your qualifications or awards here.

Industry Experience

Financial Services, Healthcare, Retail, Software & Internet

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert

Hire a Developer

We have the best developer experts on Twine. Hire a developer in Lehi today.