I am a results-driven Senior Data Engineer with over 6 years of experience designing and optimizing data lakehouse and warehouse solutions for critical domains such as legal research, aviation compliance, ERP/CRM systems, and US government chemical taxation. I specialize in building scalable ETL/ELT pipelines, real-time streaming systems, and cloud-native data platforms that reduce costs, improve data quality, and accelerate decision-making. I have a proven track record of enabling diverse teams including attorneys, compliance officers, executives, and government agencies to work with accurate, secure, and audit-ready data. Throughout my career, I have collaborated closely with legal, compliance, and research teams to prepare AI-ready datasets and establish strong data governance frameworks that comply with industry standards in privacy and security. I am passionate about leveraging modern cloud technologies and AI/ML integration to build efficient data solutions that drive business insights and operational excellence.

Mohammad Al-Sarayreh

I am a results-driven Senior Data Engineer with over 6 years of experience designing and optimizing data lakehouse and warehouse solutions for critical domains such as legal research, aviation compliance, ERP/CRM systems, and US government chemical taxation. I specialize in building scalable ETL/ELT pipelines, real-time streaming systems, and cloud-native data platforms that reduce costs, improve data quality, and accelerate decision-making. I have a proven track record of enabling diverse teams including attorneys, compliance officers, executives, and government agencies to work with accurate, secure, and audit-ready data. Throughout my career, I have collaborated closely with legal, compliance, and research teams to prepare AI-ready datasets and establish strong data governance frameworks that comply with industry standards in privacy and security. I am passionate about leveraging modern cloud technologies and AI/ML integration to build efficient data solutions that drive business insights and operational excellence.

Available to hire

I am a results-driven Senior Data Engineer with over 6 years of experience designing and optimizing data lakehouse and warehouse solutions for critical domains such as legal research, aviation compliance, ERP/CRM systems, and US government chemical taxation. I specialize in building scalable ETL/ELT pipelines, real-time streaming systems, and cloud-native data platforms that reduce costs, improve data quality, and accelerate decision-making. I have a proven track record of enabling diverse teams including attorneys, compliance officers, executives, and government agencies to work with accurate, secure, and audit-ready data.

Throughout my career, I have collaborated closely with legal, compliance, and research teams to prepare AI-ready datasets and establish strong data governance frameworks that comply with industry standards in privacy and security. I am passionate about leveraging modern cloud technologies and AI/ML integration to build efficient data solutions that drive business insights and operational excellence.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
See more

Work Experience

Senior Data Engineer at LexisNexis
October 1, 2022 - Present
Designed and deployed ETL/ELT pipelines using Python, Apache Spark, and Apache Airflow processing over 100 TB of legal and regulatory data monthly. Built and maintained data lakehouse architecture with Silver layers and a Gold-layer data warehouse on Amazon Redshift for legal and compliance teams. Developed data acquisition workflows sourcing court documents and regulatory updates through web scraping, APIs, databases, and files. Collaborated with legal research and compliance teams to prepare AI-ready datasets, improving classification accuracy in legal case retrieval by reducing data preparation costs by 70%. Established data governance and validation frameworks to adhere to legal privacy and confidentiality standards and strengthened security through encryption, IAM controls, and audit monitoring.
Data Engineer at Capella Solutions
September 30, 2022 - August 26, 2025
Contributed to the development of AI agents for intelligent automation in industrial and government sectors. Led data engineering for Valence, a data warehouse tracking civil aircraft parts and maintenance workflows ensuring aviation safety compliance. Designed and implemented pipelines supporting US government chemical taxation with audit-ready analytics on chemical imports and exports. Automated multi-source data ingestion including APIs, databases, IoT data, and files, reducing manual reporting workloads by 50%. Optimized real-time voice and text pipelines for AI analytics supporting compliance and operational use cases. Built aviation and chemical taxation-specific data governance and compliance frameworks to ensure audit readiness and regulatory compliance.
Software Engineer at Capella Solutions
September 30, 2020 - August 26, 2025
Led ERP and CRM data migrations ensuring seamless transition with zero downtime. Developed SQL queries and aggregation pipelines powering internal dashboards for executives to track KPIs, sales, and operations in real-time. Optimized backend SQL queries reducing report execution times from 5s to 1s for internal and customer-facing applications. Created data cleaning and preprocessing frameworks to enhance analytics accuracy and decision-making. Scaled ingestion pipelines to handle growing transaction and customer data volumes ensuring system reliability during expansion. Collaborated with DevOps teams to implement CI/CD pipelines using Docker, Kubernetes, and Jenkins, cutting deployment cycles by 50% and accelerating release speed for critical business tools.
Senior Data Engineer at LexisNexis
October 1, 2022 - Present
Designed and deployed ETL/ELT pipelines processing over 100 TB of legal and regulatory data monthly using Python, Apache Spark, and Apache Airflow. Built and maintained a data lakehouse architecture with Silver layers for standardized legal data and optimized a legal data warehouse in Amazon Redshift for business-ready models used by attorneys and compliance teams. Developed data acquisition workflows via web scraping, APIs, and databases to cover court documents and regulatory updates. Partnered with legal and compliance teams to prepare AI-ready datasets improving classification accuracy in legal case retrieval systems. Established data governance and validation frameworks meeting legal standards for privacy, confidentiality, and jurisdictional rules, enhancing security with encryption, IAM-based controls, and audit monitoring.
Data Engineer at Capella Solutions
September 30, 2022 - August 26, 2025
Contributed to AI agent development enhancing decision support systems in industrial and government domains. Led data engineering for Valence, building a data warehouse tracking civil aircraft parts, maintenance, and repair workflows to ensure compliance with aviation safety. Designed and implemented pipelines for US government chemical taxation systems, delivering audit-ready analytics that improved federal tax compliance and reporting accuracy. Automated multi-source data ingestion for AI-driven applications reducing manual reporting workloads by 50%. Optimized real-time voice and text pipelines supporting compliance monitoring. Built data governance frameworks aligned with aviation and chemical taxation regulations to ensure audit readiness.
Software Engineer at Capella Solutions
September 30, 2020 - August 26, 2025
Led ERP and CRM data migrations ensuring smooth data transition with zero downtime. Built SQL queries and aggregation pipelines powering internal dashboards for real-time KPI tracking by executives and business teams. Improved backend SQL query performance, reducing report execution times significantly. Developed data cleaning and preprocessing frameworks to enhance analytics dashboard accuracy. Scaled ingestion pipelines to handle growing transactional and customer data. Collaborated with DevOps to implement CI/CD pipelines using Docker, Kubernetes, and Jenkins, reducing deployment cycles by 50% and accelerating release speed.

Education

M.Sc. Computer Science at Mut’ah University
January 1, 2021 - January 1, 2023
B.Sc. Computer Science at Mut’ah University
January 1, 2016 - January 1, 2020
M.Sc. Computer Science at Mut’ah University
January 1, 2021 - January 1, 2023
B.Sc. Computer Science at Mut’ah University
January 1, 2016 - January 1, 2020

Qualifications

Add your qualifications or awards here.

Industry Experience

Government, Transportation & Logistics, Financial Services, Software & Internet, Professional Services