ACAPS crisis indicators

There are several types of data used by responders in a crisis. Most crisismapping has focused to date on the data generated during and immediately after a crisis: the tweets, messages, reports, alerts and other social media traffic that happens in response to crisis situations and needs.

But a country doesn’t suddenly change socially because a crisis happens.  It doesn’t leap, ready-formed, into a new incarnation where the only thing that’s ever happened is the current earthquake, flood, famine etc.  Countries have histories.  Events happen, societies form, agencies and other countries, if needed, help them to develop and become resilient. And all this activity creates data.

One of the first things that a response agency does, as it goes into a crisis, is create a profile of that country.  How many people are there?  What’s it society like?  What are its existing needs?  Which crises have happened before (and how did people respond to them)?  Which NGOs are already active there?  Where is everything (towns, hospitals, clean water sources etc)?

I’ve been studying the crisis indicators from 3 organisations (UNOCHA, WWHGD, ACAPS), and the development indicators from many more (UNSTATS, UNDP, WHO, World Bank etc).  And the big questions that I’ve been asking are: what do responders going into a crisis need, where can we find that information, and how can we make that faster and easier to do.

IMHO, ACAPS has one of the best indicator lists available. Which is handy because I’m about to co-lead a Standby Task Force deployment to help make obtaining and organizing their indicator data faster and easier.  The numbers that they’re looking for are:

Country size in square kilometres
Average population growth rate
Climate
Corruption Perceptions Index
Country size in comparison to the world
Distribution of poverty (areas, vulnerable groups)
GDP
Gross national income per capita
Human Development Index
Labour force per occupation
Land cover, elevation, soils and geology
Literacy rate
Malnutrition prevalence, height for age (Percentage of children under 5)
Maternal Mortality rate as deaths per 100,000 live births
Mobile cellular subscriptions (per 100 people)
Natural Disasters Risk Index
Percentage of population living below poverty line – urban
Percentage of population living below poverty line – rural
Percentage of the population – rural
Percentage of the population – urban
Population density (people per square km)
Poverty rate
Seasonal migration patterns, by region and reasons
Socio cultural characteristics of the population – ethnic group, language, religion etc
under-5 Mortality Rate as deaths per 1,000 people
Unemployment (Percentage of total labour force)
Vulnerability and Crisis Index

An SBTF team is spending the next week looking for these indicators for a specific country (DR Congo) so we can learn about how difficult that is, alternative sources of information, indicator provenance etc. whilst other teams look for demographics, crisis timelines, existing surveys and ways to generalize and automate parts of this process.  I’s going to be quite an exciting week.

References:

Where do the crisis indicator numbers come from?

I’ve been tracking the provenance of some GIS datasets lately, working on the licenses and attributions that have to be attached to them when they’re released as open data. I’ve also been working on automatically generating development indicator sets for a country – the numbers that help responders understand the state of a country *before* a crisis happens (social problems don’t go away just because an earthquake happens).

A lot of crisismapping comes down to persistence, capability and trust. Trust is a biggie: as someone (Gisli Olafssen?) said, a disaster is not the time to be handing out business cards. We need to trust the people we’re working with, the systems we’re using (within sensible limits and the occasional online equivalent of kicking the case in the right place), and we need to (again within sensible limits) trust the data we’re using. Trust in the data that happens during a crisis has been talked about a lot recently (see conversations about verification, spoofing etc). It hasn’t been discussed so much when we’re talking about crisis indicators.

First, it’s not enough just to have a number, even if it’s only used as a rough rule-of-thumb for how bad a prior situation was. We also need to know how much we can trust that number, which means knowing where the number came from: who collected the data, how big a survey it was, how accurate the numbers are likely to be.

So, in recent work, we’ve started by looking for sources. Most databanks have a list of sources somewhere in the dataset description (or for individual datapoints, in the dataset footnotes); most maps have these somewhere in the margins. If they don’t, they should, because they can tell us a lot about how much we can trust the numbers.

Take, for example the MMR, the maternal mortality rate (annual number of deaths per 100,000 live births) for the Democratic Republic of Congo in the last few years. There are several versions of this number, and here’s a quick search for them and their listed sources:

Source Value Year Quoted source(s) for the number
CIA World Factbook”s DR Congo page 540 2010 No source quoted.But this is the same number as the World Bank indicator SH.STA.MMRT, so it”s probably from there.
World Bank Indicator SH.STA.MMRT 540 2010 WDI and GDF 2010. The data are estimated with a regression model using information on fertility, birth attendants, and HIV prevalence. Trends in Maternal Mortality: 1990-2010. Estimates Developed by WHO, UNICEF, UNFPA and the World Bank.
Trends in Maternal Mortality: 1990-2010 540 2010 Explains in great detail how the number was calculated, and which datasources were used for it. Gives an uncertainty range for the number (300-1100), number of deaths (15000), lifetime risk (30) and PM, the percentage of maternal deaths in deaths of women of reproductive age (18.4%). Also explains that the MMR has been rounded to the nearest 10.
Data.un.org: UNICEF State of the World”s Children 2010 report 670 2008 UN_WHO, UNICEF, UNFPA and World Bank . This is probably the same source used in the World Bank figures, but there”s no reference to follow on the data webpage.Periodically, the United Nations Inter-agency Group (WHO, UNICEF, UNFPA and the World Bank) produces internationally comparable sets of maternal mortality data that account for the well-documented problems of under-reporting and misclassification of maternal deaths, including also estimates for countries with no data. Please note that owing to an evolving methodology, these values are not comparable with previously reported maternal mortality ratio “adjusted” values.Data.un.org: UNICEF State of the World”s Children 2010 report: MMR Reported 550 2006-2010 UN_Nationally representative sources, including household surveys and vital registrationThe maternal mortality data in the column headed “reported” refer to data reported by national authorities.
Data.un.org: Millenium Development Goals 670 2008 Trends in Maternal Mortality: 1990-2008. WHO/UNICEF/UNFPA/WBUNICEF State of the World”s Children 2012 report (on unicef.org): Reported rate 550 2006-2010 The maternal mortality data in the column headed “reported” refer to data reported by national authorities.UNICEF State of the World”s Children report 2012 (on unicef.org): Adjusted rate 670 2006-2010 The data in the column headed “adjusted” refer to the 2008 United Nations inter-agency maternal mortality estimates that were released in late 2010. Periodically, the United Nations Inter-agency Group (WHO, UNICEF, UNFPA and the World Bank) produces internationally comparable sets of maternal mortality data that account for the well-documented problems of under-reporting and misclassification of maternal deaths, including also estimates for countries with no data. Please note that owing to an evolving methodology, these values are not comparable with previously reported maternal mortality ratio “adjusted” values.Comparable time series on maternal mortality ratios for the years 1990, 1995, 2000, 2005 and 2008 are available at www.childinfo.org.
Data.un.org Gender Info: MMR (estimate), female 15-49 yr 990 2000 WHO_Reproductive Health Indicators Database_Jul2007 (International estimate)
Data.un.org Gender Info: MMR (low estimate), female 15-49 yr 250 2000 WHO_Reproductive Health Indicators Database_Jul2007 (International estimate)
Data.un.org Gender Info: MMR (high estimate), female 15-49 yr 1800 2000 WHO_Reproductive Health Indicators Database_Jul2007 (International estimate)
WHO Reproductive Health Indicators Database Database not found. Might have been superceded by WHO”s Global Health Observatory.WHO Global Health Repository global burden of disease death estimates by sex, maternal conditions (GBD code W042) 34.7 2008 There”s a set of references in the spreadsheet”s Notes page, but no specific reference given for this number.

We learn several things from this.

  • The numbers are estimates. It’s not always clear that this is so.
  • Even on the same datasite, numbers for the same indicator can differ and aren’t always updated.
  • You can sometimes use the numbers to guess at their source (e.g. the CIA figure) – but you have to be careful about generalizing this.

Maternal mortality turned out to have a very strong single source (the trends in maternal mortality reports). But for many development indicators, the numbers from different sources very rarely match, and are sometimes very wildly different. So if we want a number (or even a couple of numbers) to use, we need to do some detective work. More on this soon.