Predicting Economic Development using Geolocated Wikipedia Articles

05/05/2019
by   Evan Sheehan, et al.
0

Progress on the UN Sustainable Development Goals (SDGs) is hampered by a persistent lack of data regarding key social, environmental, and economic indicators, particularly in developing countries. For example, data on poverty --- the first of seventeen SDGs --- is both spatially sparse and infrequently collected in Sub-Saharan Africa due to the high cost of surveys. Here we propose a novel method for estimating socioeconomic indicators using open-source, geolocated textual information from Wikipedia articles. We demonstrate that modern NLP techniques can be used to predict community-level asset wealth and education outcomes using nearby geolocated Wikipedia articles. When paired with nightlights satellite imagery, our method outperforms all previously published benchmarks for this prediction task, indicating the potential of Wikipedia to inform both research in the social sciences and future policy decisions.

READ FULL TEXT
research
09/19/2018

Learning to Interpret Satellite Images Using Wikipedia

Despite recent progress in computer vision, fine-grained interpretation ...
research
11/08/2021

SustainBench: Benchmarks for Monitoring the Sustainable Development Goals with Machine Learning

Progress toward the United Nations Sustainable Development Goals (SDGs) ...
research
12/06/2017

On monitoring development using high resolution satellite images

We develop a machine learning based tool for accurate prediction of deve...
research
05/07/2019

Learning to Interpret Satellite Images in Global Scale Using Wikipedia

Despite recent progress in computer vision, finegrained interpretation o...
research
05/03/2022

Learning Economic Indicators by Aggregating Multi-Level Geospatial Information

High-resolution daytime satellite imagery has become a promising source ...
research
08/10/2022

Weak Supervision in Analysis of News: Application to Economic Policy Uncertainty

The need for timely data analysis for economic decisions has prompted mo...
research
03/25/2016

"Did I Say Something Wrong?" A Word-Level Analysis of Wikipedia Articles for Deletion Discussions

This thesis focuses on gaining linguistic insights into textual discussi...

Please sign up or login with your details

Forgot password? Click here to reset