Skip to content
View ibrahimakerkouch's full-sized avatar
  • Washington, D.C.
  • 07:16 (UTC -12:00)

Block or report ibrahimakerkouch

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ibrahimakerkouch/README.md

👋 Hi, I'm brahim

Welcome to my GitHub! I’m a data professional specializing in data engineering, ETL pipelines, and data quality. I enjoy solving complex data problems and building reliable, scalable data solutions. My work focuses on performing data validation, anomaly detection, root-cause analysis, and dashboard reporting to ensure accurate and trustworthy data across systems.

What You’ll Find Here

⚙️ End-to-end ETL and data integration projects
🔍 Data quality, duplicate-detection, and issue detection workflows
🧬 Real-world datasets and pipeline testing setups
📈 Dashboards and visual analytics for KPIs, patterns, and trends
🧱 Documentation and best practices for scalable data solutions

Pinned Loading

  1. Patient-Registry-ETL Patient-Registry-ETL Public

    Automated ETL pipeline for patient registry data using PySpark and Airflow: extracts, transforms, and loads health condition, treatment, and patient records into Delta tables on S3 for analytics an…

    Python

  2. OpenTriviaDB-Analytics OpenTriviaDB-Analytics Public

    Automated ETL pipeline that ingests trivia data from the Open Trivia Database API into MySQL and delivers interactive analytics through a Power BI dashboard.

    Python

  3. Donor-Mailing-Data-PreProcessing Donor-Mailing-Data-PreProcessing Public

    Python ETL pipeline for cleaning, standardizing, and preparing donor data for mailing campaigns and BCC Software upload.

    Python

  4. ClinicalTrials.gov-Data-Pipeline ClinicalTrials.gov-Data-Pipeline Public

    End-to-end ETL pipeline for ClinicalTrials.gov cancer trials data with PostgreSQL storage and Power BI dashboard.

    Python

  5. TheMovieDB-Integration TheMovieDB-Integration Public

    Automated Python workflow for collecting and organizing movie data from TheMovieDB into MongoDB.

    Python

  6. Excel-Interactive-Dashboard Excel-Interactive-Dashboard Public

    A dynamic Excel dashboard featuring PivotTables, charts, and slicers to analyze sales performance by country, product, and month. Includes a “New” sheet to demonstrate refresh functionality.