AfriCDSA Hackathon 2025

Leveraging AI & Data Science for Africa's Future

Innovate. Compete. Transform.

00 Days
00 Hours
00 Minutes
00 Seconds

About the Hackathon

Join us for the AfriCDSA Hackathon 2025, where top innovators, developers, and data scientists come together to tackle real-world challenges using cutting-edge AI and data-driven solutions. This is your chance to showcase your skills, collaborate with like-minded professionals, and contribute to impactful projects that drive change.

Key Details:

  • Event Date: April 25, 2025 – May 2, 2025
  • Venue: Virtual & In-Person (Hybrid Format)
  • Theme: Leveraging AI & Data Science for Africa's Future
  • Prizes: Cash rewards, mentorship opportunities, and industry exposure
  • Registration Deadline: April 10, 2025
Hackathon Participants

Project Challenges

Participants will work on real-world challenges across four key domains. Each project will be judged based on innovation, execution, impact, and scalability.

Agriculture Challenge

Develop a machine learning model to identify diseases on cocoa leaves. This challenge focuses on creating efficient models that can operate on edge devices to help African farmers detect cocoa diseases in real-time.

Problem Statement

Cocoa is a key economic driver for millions of smallholder farmers in Africa, particularly in countries like Ghana and Côte d'Ivoire, which together account for over 60% of global cocoa production. Cocoa Swollen Shoot Virus Disease (CSSVD) and other cocoa diseases threaten not only individual livelihoods but also the broader agricultural economy.

Challenge Objectives

  • Develop machine learning models to accurately predict all diseases present in images of cocoa
  • Create models that can generalize well, even when encountering new diseases not seen in the training set
  • Ensure models operate efficiently on edge devices such as entry-level smartphones

Impact

By developing machine learning models capable of accurately identifying multiple cocoa diseases, this challenge will empower farmers with real-time, AI-driven diagnostics accessible through their mobile phones. The ability to detect new and emerging diseases beyond those seen in the training set will improve early intervention strategies, minimizing crop losses and reducing reliance on reactive pesticide use.

Energy Challenge

Forecast climate and operational effects on load generation for micro-hydropower plants in off-grid communities. Help optimize energy distribution in remote areas of Pakistan.

Problem Statement

In the remote Kalam region of Pakistan, micro-hydropower plants (MHPs) are the backbone of energy generation for rural communities. But nature doesn't always play along. Water flow fluctuates, maintenance disrupts supply, and the weather throws in its own surprises.

Challenge Objectives

  • Build a model that accurately predicts energy load generation (in kWh)
  • Utilize MHP data including voltage, current, power factors, and energy metrics
  • Incorporate climate indicators like temperature, dew point, wind speed, and precipitation

Impact

Predicting energy needs means less waste, fewer blackouts, and smarter resource management—all crucial for keeping the lights on in off-grid communities. Understanding how climate affects energy demand, and what that means for sustainable power planning, is a critical insight for sustainable power projects all over the world.

Finance Challenge

Create a machine learning algorithm to classify purchases into 13 different categories, helping improve financial management and categorization systems.

Problem Statement

Data for this challenge has been collected over 11 months. If a user is the first user to purchase at a merchant, the app asks the user to manually classify the merchant. The next user to purchase at that merchant has the opportunity to confirm the suggestion or enter a new categorization.

Challenge Objectives

  • Create a machine learning algorithm that classifies each purchase into one of 13 different categories
  • Work with verified transaction data from MPESA SMS receipts
  • Handle both verified and unverified transactions effectively

Dataset Information

  • ~400 purchases in training data
  • ~600 purchases in test data
  • ~10,000 unverified transactions
  • All transactions have been verified by Alvin as correctly classified

Healthcare Challenge

Predict waterborne disease outbreaks using climate and environmental data to enable timely interventions in vulnerable communities.

Problem Statement

In Third world countries, climate-sensitive waterborne diseases such as typhoid, amoebiasis, diarrhoea, schistosomiasis, and intestinal worms pose significant health risks, particularly for vulnerable populations like women and children. These diseases are exacerbated by poor water quality, inadequate sanitation, and changing climate patterns.

Challenge Objectives

  • Develop a machine learning model to predict outbreaks of climate-sensitive waterborne diseases
  • Utilize datasets comprising water sources, toilet quality, waste management, health facility data, and climate information from 2019 to 2023
  • Focus on a specific region in Tanzania

Impact

This predictive capability will enable governments and health organisations to implement timely, targeted interventions and optimize resource allocation, ultimately reducing disease incidence and enhancing public health resilience.

Data

Participants will have access to structured datasets and APIs to develop their solutions. Each challenge has its own dedicated dataset available for download.

Agriculture Challenge

Dataset for cocoa disease detection, including training and test images.

Energy Challenge

Micro-hydropower plant data and climate indicators for load forecasting.

Finance Challenge

Transaction data and merchant classification information.

Healthcare Challenge

Waterborne disease data, climate information, and health facility records.

Note: Datasets will be available to registered participants closer to the event start date. Please check back later.

Leaderboard

The leaderboard will display real-time scores based on evaluation metrics, including accuracy, efficiency, and impact. The top-performing teams will be ranked based on their submissions.

Rank Team Name Project Name Score
1 Team Alpha AI Health Bot 95/100
2 Data Wizards Smart Finance 92/100
3 Innovators EdTech AI 89/100

Group Members

Each team will consist of 3-5 members, including data scientists, software developers, and domain experts. Below are samples of the registered teams:

Team Alpha

  • John Wanjiku AI Engineer
  • Jane Smith Data Scientist
  • Mark Wafula Full-Stack Developer

Data Wizards

  • Sarah Lee ML Engineer
  • Robert Brown Data Analyst
  • Emily Clark Software Engineer

How to Participate

1

Register your team

Fill out the online registration form below.

2

Access problem statements

Get datasets and project guidelines.

3

Develop your solution

Work on your project during the hackathon.

4

Submit your project

Present your solution to the judges.