Senior Data Engineer with hands-on experience building scalable data pipelines, cloud data platforms, and analytics-ready datasets. I specialize in turning raw data into reliable insights that support reporting, decision-making, and business growth.

Arlind Greba

Senior Data Engineer with hands-on experience building scalable data pipelines, cloud data platforms, and analytics-ready datasets. I specialize in turning raw data into reliable insights that support reporting, decision-making, and business growth.

Available to hire

Senior Data Engineer with hands-on experience building scalable data pipelines, cloud data platforms, and analytics-ready datasets. I specialize in turning raw data into reliable insights that support reporting, decision-making, and business growth.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
See more

Work Experience

Senior Data Engineer at Ernst & Young (EY)
January 1, 2024 - Present
Led enterprise data warehousing and lakehouse initiatives by building a SQL Server/SSIS-based warehouse, consolidating data from multiple sources, and delivering enterprise BI models in Power BI. Designed scalable Azure data pipelines (ADF, Databricks notebooks in Python/Scala, Synapse) to transform and load large datasets into a data warehouse. Created Data Lakes in Microsoft Fabric Lakehouse with bronze/silver/gold zones; processed data with Fabric Spark (PySpark/SQL) for real-time analytics; orchestrated ingestion via Fabric Pipelines. Optimized lakehouse performance; integrated data models with Power BI and applied Row-Level Security. Implemented data governance using Purview and AWS Glue Data Catalog. Automated CI/CD with Azure DevOps for ADF, Databricks notebooks, and Snowflake scripts.
ETL BI Engineer at Macy's, Inc
October 1, 2020 - December 31, 2023
Developed batch and real-time ETL pipelines in AWS Glue and Redshift for financial analytics, processing millions of records daily. Built scalable Data Lakes on Azure Data Lake Storage and AWS S3, using PySpark and Databricks to process large datasets. Designed dimensional models and star schemas in Snowflake and Redshift to support BI dashboards. Transformed data with Python (Pandas/NumPy) and automated DBT transforms with CI/CD. Enforced data governance and access controls in Snowflake with RLS. Created Power BI dashboards to support operations, sales, and finance; integrated cross-cloud data via GCP BigQuery.
BI Engineer at Sony Corp
June 1, 2018 - September 30, 2020
Created and deployed ETL workflows using SSIS to move data from transactional systems into a SQL Server Data Warehouse; built and maintained a data lake in AWS S3; worked with SSAS OLAP cubes for analytics; automated data validation tasks with Python and PowerShell; supported Tableau dashboards; conducted data profiling and cleansing; implemented data security measures including Row-Level Security (RLS) in Power BI and encryption for sensitive datasets.

Education

Add your educational history here.

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Media & Entertainment, Financial Services, Professional Services