journal article

Multivariate random forest prediction of poverty and malnutrition prevalence

by Chris Browne,
David S. Matteson,
Linden McBride,
Leiqiu Hu,
Yanyan Liu,
Ying Sun,
Jiaming Wen and
Christopher B. Barrett
Open Access | CC BY-4.0
Citation
Browne, Chris; Matteson, David S.; McBride, Linden; Hu, Leiqiu; Liu, Yanyan; Sun, Ying; Wen, Jiaming; Barrett, Christopher B. 2021. Multivariate random forest prediction of poverty and malnutrition prevalence. PLoS ONE 16(9): e0255519 https://doi.org/10.1371/journal.pone.0255519

Advances in remote sensing and machine learning enable increasingly accurate, inexpensive, and timely estimation of poverty and malnutrition indicators to guide development and humanitarian agencies’ programming. However, state of the art models often rely on proprietary data and/or deep or transfer learning methods whose underlying mechanics may be challenging to interpret. We demonstrate how interpretable random forest models can produce estimates of a set of (potentially correlated) malnutrition and poverty prevalence measures using free, open access, regularly updated, georeferenced data. We demonstrate two use cases: contemporaneous prediction, which might be used for poverty mapping, geographic targeting, or monitoring and evaluation tasks, and a sequential nowcasting task that can inform early warning systems. Applied to data from 11 low and lower-middle income countries, we find predictive accuracy broadly comparable for both tasks to prior studies that use proprietary data and/or deep or transfer learning methods.