04/06/2026 | Press release | Distributed by Public on 04/07/2026 09:35
Timely and reliable poverty data are essential for effective policymaking. Yet in many countries, household consumption surveys, the backbone of monetary poverty measurement, are conducted infrequently due to cost and logistical constraints. As a result, policymakers often lack up-to-date information on household welfare, particularly during periods of rapid economic change.
To help address this gap, the World Bank launched the Poverty Prediction Challenge in November 2025, a global data science competition designed to advance methods for predicting household consumption and poverty across surveys. Hosted on the DrivenData platform, the Challenge invited participants from diverse analytical backgrounds to develop models that could estimate household expenditure and predict poverty rates using survey data.
The competition closed on February 4, 2026, with strong global engagement: 1,322 participants registered and more than 500 valid solutions were submitted.
The Poverty Prediction Challenge was designed as a supervised learning task. Participants were provided with test survey datasets and asked to:
Participants were free to use any modeling approach, including machine learning, classical econometric techniques, ensemble methods, or hybrid approaches. The competition was open to statisticians, data scientists, researchers, students, and practitioners from around the world.
The objective was not only to identify high-performing models, but also to explore how different approaches perform when using older survey data to estimate poverty in newer surveys, with particular attention to how well these methods work across different contexts.
Traditional household surveys remain the gold standard for measuring poverty. However, because of their periodic nature, they are less useful for governments seeking to respond quickly to shocks, economic downturns, or emerging vulnerabilities.
Improving methods to predict consumption and poverty between survey rounds can:
The Challenge contributed to the World Bank's broader Real-Time Monitoring agenda, which aims to combine traditional data sources with innovative methods to provide more timely and policy-relevant welfare indicators.
Following verification, the top-performing teams were:
|
1st Place - dwivedy045 |
A machine learning approach using structured survey data and careful validation across different surveys. |
|
2nd Place - Khartoum |
A combination of multiple models designed to improve prediction accuracy and adapt to differences across surveys. |
|
3rd Place - selman |
A model focused on ensuring predictions remained consistent across surveys and aligned with observed poverty patterns. |
Top-performing solutions relied on advanced data-driven methods and careful testing approaches to ensure their predictions worked well across different survey contexts.
The code for all winning solutions, including full model implementations and documentation, is publicly available in the official repository.
Several key insights emerge from the Challenge:
A forthcoming working paper will provide a detailed technical analysis. These lessons will inform future work on survey-to-survey imputation and the integration of predictive methods into official poverty monitoring systems.
The Challenge attracted a highly diverse pool of participants across countries and disciplines and featured a wide range of approaches.
All models were evaluated using predefined metrics that combined household-level prediction accuracy with performance in estimating poverty rates across multiple thresholds. Final rankings were determined through a rigorous verification process to ensure reproducibility and methodological soundness.