Hi, my name is

Amr Hagag

Data Engineer

I'm a passionate and detail-oriented Data Engineer dedicated to transforming raw data into meaningful insights that drive informed decisions. With a strong foundation in Python, SQL, and data pipelines, I specialize in designing efficient ETL workflows, building scalable data architectures, and ensuring data reliability across systems.

Amr Hagag - Data Engineer

About Me

I'm Amr Hagag, a Junior Data Engineer and DEPI Graduate with hands-on experience in building ETL pipelines, designing data warehouses, and implementing cloud-based data workflows. I have strong skills in Python, SQL, SSIS, Apache Airflow, Apache Spark, and Azure, and I've applied these tools in projects involving fraud detection, sales analytics, and data management systems. I am passionate about turning raw data into reliable, actionable insights and I'm looking to contribute to innovative teams in finance, fintech, or e-commerce.

Languages:

Python
C++

Tools & Technologies:

SQL Server
MySQL
SSIS
Apache Airflow
Apache Spark
Azure
Databricks

Data Engineering:

ETL Pipelines
Data Warehousing (Star Schema / Dimensional Modeling)
Cloud-based Data Workflows
Data Quality & Validation

Data Processing & Visualization:

Data Cleaning & Transformation
Data Preprocessing
Reporting & Basic BI (SQL Queries, Excel)

Experience

Jun 2025 – Dec 2025

Data Engineer Trainee

DEPI (Digital Egypt Pioneers Initiative)

Built ETL pipelines and managed data workflows using Python, SQL Server, SSIS. Designed scalable workflows with Apache Airflow, Spark, and Azure. Automated daily processes, improving data quality and reporting efficiency. Applied data validation rules across pipelines supporting ML-based fraud detection.

PythonSQL ServerSSISApache AirflowApache SparkAzureETLData Validation

Jul 2024

ICPC Regional Contest Participant

Menoufia University

Solved algorithmic problems in C++; ranked 104th on Day 2 of the Regional Contest.

C++AlgorithmsProblem SolvingData Structures

Featured Projects

Payment Security – Smart Fraud Detection & Analysis

Payment Security – Smart Fraud Detection & Analysis

Cleaned and preprocessed bank transaction dataset. Designed SQL Server database with Customers, Accounts, Transactions, Merchants, Devices, and Locations. Built ETL pipelines and a star-schema Data Warehouse. Developed Python scripts to generate synthetic transactions for testing. Implemented cloud workflows in Azure & Databricks supporting ML predictions.

PythonSQL ServerETLAzureDatabricksData WarehouseML
Sales Data Mart – SSIS Project

Sales Data Mart – SSIS Project

ETLed data from AdventureWorks2022 into Sales Data Mart. Applied transformations, validations, and optimized loads for efficiency.

SSISSQL ServerETLData WarehousingAdventureWorks
Smart E-Commerce Sales Management System

Smart E-Commerce Sales Management System

Designed SQL transactional DB and built Python preprocessing for data cleaning & validation. Improved query performance and reporting efficiency.

PythonSQLDatabase DesignData PreprocessingPerformance Optimization

Get In Touch

Let's Connect

I'm always interested in hearing about new opportunities in data engineering, whether that's a full-time role, contract work, or just a chat about data technology. Feel free to reach out if you'd like to connect!

© 2025 Amr Hagag. Built with Next.js and Tailwind CSS.

Built with v0