Available for opportunities

George
Debrah

Data Engineer & Analyst with a passion for turning complex data into actionable intelligence. Building optimized pipelines, dashboards, and ML models that move the needle.

Berlin, Germany · Open to relocation
3+
Years experience
91%
ML model precision
50%
Reporting time cut
75.8
mAP thesis model
01 —

Experience

Oct 2024 – Oct 2025
Data Engineer
Phrontlyne Technologies · Accra
  • Analyzed large-scale mobile performance data using Python & SQL for data-driven decision-making.
  • Designed REST API pipelines reducing data retrieval time by 1.8 seconds.
  • Built Power BI dashboards contributing to a 15% increase in daily active users.
  • Collaborated with engineers to reduce app crash rates by 30%.
  • Implemented and customized ERP systems for the Finance and HR departments across four insurance companies, configuring finance modules to ensure IFRS 17 compliance and streamline actuarial reporting processes.
Jun 2024 – Sep 2024
Data Analyst
Pensive Global Consult · Accra
  • Performed data cleaning and exploratory analysis in Python & SQL for client projects.
  • Designed and trained an ML anomaly detection model achieving 91% precision.
  • Developed Power BI dashboards cutting reporting time by 50%.
  • Improved project success rates by 25% via data-driven process design.
Sep 2023 – Sep 2024
Software Engineer
MyStoreAid · Accra
  • Developed cloud-backed mobile components with React Native & AWS for scalable architecture.
  • Optimized AWS IAM, EC2, S3 configs, reducing security vulnerabilities by 50%.
  • Implemented automated test suites cutting post-release bugs by 60%.
Jun 2022 – Aug 2023
Data Analyst
TheBoardroom Africa · Mauritius
  • Analyzed multi-country datasets to identify trends supporting strategic decision-making.
  • Power BI & Excel dashboards that accelerated target achievement by 20%.
  • Built Python forecasting & ML models improving prediction accuracy by 35%.
02 —

Projects

Portfolio Project · 2025
Berlin Traffic & Mobility Dashboard
Data Visualisation JavaScript Chart.js Open-Meteo API Simulation

An interactive traffic and mobility control-room dashboard for Berlin, demonstrating data engineering and visualisation skills. Features four analytical views — Overview, Public Transit, Road Network, and Analytics — with live weather data pulled from the Open-Meteo API. Traffic and transit data uses realistic Berlin peak-hour simulation profiles.

4
Dashboard views
Live
Weather data
10+
Chart types
View Dashboard →
Thesis Research · 2022–2023
Intelligent Traffic System
YOLOv5 Bee Colony Optimization Computer Vision Python

Trained a YOLOv5 object detection model on 113,888 annotated traffic light images to intelligently detect and classify traffic conditions in Ghana. Paired the vision model with a Bee Colony Optimization algorithm to dynamically optimize traffic signal phasing, reducing average vehicle wait times in simulation by 32%.

75.8%
mAP achieved
−32%
Vehicle wait time
+28%
Traffic flow efficiency
03 —

Research

How can ML and AI be applied not just to understand trade and logistics systems — but to actively optimize them, removing friction, predicting failure, and making supply chains smarter in real time?

My research interest lies at the intersection of machine learning, combinatorial optimization, and operational intelligence — applied to the trade and logistics industry, where inefficiency is costly, decisions are time-critical, and data is abundant but underutilized.

Research inquiries: georgenarteydebrah@gmail.com

Research Vision

Global trade and logistics networks are among the most complex optimization environments in existence — thousands of interdependent decisions made under uncertainty, across time zones, regulatory boundaries, and fluctuating demand. Yet most systems still rely on static rules and reactive planning. My research asks: what becomes possible when ML and AI move from describing logistics problems to actively solving them?

My undergraduate thesis — combining YOLOv5 object detection with Bee Colony Optimization to cut vehicle wait times by 32% in a dynamic traffic environment — is the clearest expression of this conviction. Traffic management and freight routing share the same mathematical DNA: both are real-time, multi-agent, combinatorially explosive problems where learned models and adaptive algorithms together outperform any fixed strategy. The thesis was a proof of concept. Logistics is the application at scale.

My long-term goal is to develop and deploy ML/AI systems inside real logistics and trade operations — in industry R&D settings where research translates directly into measurable throughput, cost reduction, and resilience improvements.

Existing Research: Intelligent Traffic Optimization

My undergraduate thesis at Ashesi University produced a fully implemented intelligent traffic management system for Ghanaian urban intersections — one of the first studies to combine deep learning-based vehicle detection with a nature-inspired optimization algorithm in a sub-Saharan African road context.

I trained a YOLOv5 object detection model on 113,888 annotated traffic light images, achieving 75.8% mean average precision (mAP) on validation data. The detection model fed real-time vehicle counts into a Bee Colony Optimization (BCO) algorithm that dynamically adjusted signal phase timings across intersections — replacing fixed, pre-timed cycles with an adaptive system that responds to actual traffic load. The BCO solver achieved a lower optimal fitness value than baseline static schedules, corresponding to a 32% reduction in average vehicle wait time and a 28% improvement in overall traffic flow efficiency in simulation, validated against real-world traffic datasets from Ghanaian intersections.

This work established three principles that now shape my broader research agenda: that perception and optimization must be co-designed rather than bolted together; that nature-inspired search algorithms are particularly well-suited to dynamic, high-dimensional scheduling problems; and that domain-specific datasets — not generic benchmarks — are essential for building systems that actually work in context. These principles translate directly into how I approach ML/AI applications for trade and logistics.

Key outcomes
  • 75.8% mAP on 113,888-image annotated traffic detection dataset
  • 32% reduction in average vehicle wait time vs. static-timed baselines
  • 28% improvement in traffic flow efficiency validated on real Ghanaian intersection data

ML-Driven Route & Network Optimization

Classical vehicle routing and freight planning algorithms — TSP variants, column generation, branch-and-price — are powerful but brittle: they optimize for the world as modeled, not the world as it unfolds. Disruptions, port delays, carrier failures, and demand spikes break static plans immediately. My research explores how reinforcement learning and predictive ML can be integrated with optimization solvers to produce plans that are not just optimal at dispatch time, but dynamically re-optimized as conditions change.

Directly extending my thesis work, I am interested in how population-based metaheuristics — which proved highly effective for adaptive traffic signal phasing — can be adapted for multi-depot, multi-commodity freight routing under stochastic demand and real-time disruption signals. The goal is a planning system that learns from historical shipment data to anticipate bottlenecks before they materialize.

Future directions
  • Hybrid RL + metaheuristic solvers for last-mile delivery under live traffic conditions
  • Disruption-aware re-routing triggered by predictive delay models
  • Multi-objective optimization balancing cost, carbon footprint, and delivery SLA simultaneously

Predictive Analytics for Supply Chain Resilience

Supply chains fail quietly before they fail visibly — demand signals drift, supplier lead times shift, and inventory buffers erode gradually until a stockout or shipment crisis makes the problem undeniable. My experience building an ML anomaly detection model achieving 91% precision at Pensive Global Consult, and constructing forecasting pipelines at TheBoardroom Africa that improved prediction accuracy by 35%, established a strong foundation for applying these methods at the supply chain level.

I am interested in early-warning ML systems that continuously monitor supply chain health — tracking lead time variance, supplier reliability scores, port congestion indicators, and macroeconomic signals — and surface actionable alerts before disruptions propagate downstream. The critical research challenge is building models that remain calibrated as the distribution of disruption events shifts, which is precisely the adaptive learning problem my thesis work began to address.

Future directions
  • Supplier risk scoring using NLP on trade data, news feeds, and shipping records
  • Demand forecasting under structural breaks caused by geopolitical or climate events
  • Graph neural networks for modeling cascading failure across multi-tier supplier networks

Intelligent Data Pipelines for Trade Operations

Logistics organizations generate enormous volumes of operational data — from warehouse management systems, IoT sensors, ERP platforms, customs documentation, and carrier APIs — that sit in silos, arrive late, or require manual reconciliation before they can be used. My production experience building optimized data pipelines at Phrontlyne Technologies, reducing retrieval latency by 1.8 seconds and reporting time by 50%, and deploying ERP systems configured for IFRS 17 compliance, gave me direct insight into how data infrastructure quality determines the ceiling of any analytical or AI capability built on top of it.

I am interested in the design of real-time data integration architectures for trade operations — systems that ingest, clean, and route operational signals fast enough for ML models to act on them, not just analyze them after the fact. The research question is not just "what can we predict?" but "can we deliver that prediction in time for it to change a decision?"

04 —

Skills

A full-stack data practitioner — equally comfortable wrangling raw data, building production pipelines, and communicating insights through compelling visualizations.

Languages & Libraries
Python SQL Pandas NumPy Scikit-learn Matplotlib Seaborn React Native
Databases & Infrastructure
MySQL PostgreSQL Apache Spark Google BigQuery AWS (EC2, S3, IAM) Azure Git
Visualisation & BI
Power BI Tableau Excel
05 —

Education

MSc Data Analytics
Berlin School of Business and Innovation
March 2025 – Present · Berlin
BSc Computer Science
Ashesi University
Graduated June 2023 · Berekuso, Ghana
Google Data Analytics Certificate
Google / Coursera
August 2022
Data Analytics Consulting Program
KPMG
June 2021
06 —

Community

Volunteer
Benaks Humanitarian Organization · Sep 2022 – Present

Let's build
something great
together.

Currently based in Berlin and open to data engineering, analytics, ML engineering roles, and research collaborations. Let's connect.