Resources and Data

Commonly used datasets:

Basics on IR research design, regression analysis 

There are some resources by Xun Pang, who heads Tsinghua’s new Tsinghua International Relations Data & Computing Lab (DCL).

The list of states with COW abbreviations and ID numbers

This dataset records the fluctuating composition of the state system since 1816. It also identifies countries corresponding to the standard Correlates of War country codes.

The new list of wars that will be included in the COW war databases is available. Non-state War dataset (v4.0), Intra-state War dataset (v4.0), Inter-state War dataset (v4.0), and Extra-state War dataset (v4.0) are now available.

This data set records all instances of when one state threatened, displayed, or used force against another. Version 4.1 covers the 1816-2010 period.

This dataset records the geographic locations of MIDs in latitude/longitude coordinates. Version 1.1 covers the 1816-2001 period.

Power is considered by many to be a central concept in explaining conflict, and six indicators – military expenditure, military personnel, energy consumption, iron, and steel production, urban population, and total population – are included in this dataset. It serves as the basis for the most widely used indicator of national capability, CINC (Composite Indicator of National Capability) and covers the period 1816-2012.

This dataset aims to provide detailed information about religious adherence worldwide since 1945. This data set is hosted by Zeev Maoz, University of California-Davis, and Errol A. Henderson, Pennsylvania State University.

This data set records all formal alliances among states between 1816 and 2012, including mutual defense pacts, non-aggression treaties, and ententes. This data set is hosted by Douglas Gibler, University of Alabama.

This data set records all peaceful and violent changes of territory from 1816-2014. This data set is hosted by Paul Diehl, University of Illinois and Jaroslav Tir, Colorado-Boulder.

The Direct Contiguity dataset registers the land and sea borders of all states since the Congress of Vienna, and covers 1816-2016. This data set is hosted by Paul Hensel, University of North Texas.

The Colonial/Dependency Contiguity dataset registers contiguity relationships between the colonies/dependencies of states (by land and by sea up to 400 miles) from 1816-2016.

Although the number of intergovernmental organizations (IGOs) grew dramatically during the late 20th century, they have been part of the world scene for much longer. This dataset tracks the status and membership of such organizations from 1815-2005. Access information about this data here. This data set is hosted by Timothy Nordstrom, University of Mississippi, Jon Pevehouse, University of Wisconsin and Megan Shannon, Colorado-Boulder.

The Diplomatic Exchange dataset tracks diplomatic representation at the level of chargé d’affaires, minister, and ambassador between states from 1817-2005. This data set is hosted by Reşat Bayer, Koç University.

This dataset tracks total national trade and bilateral trade flows between states from 1870-2014. This data set is hosted by Katherine Barbieri, University of South Carolina, and Omar Keshk, Ohio State University.

National Material Capabilities (v5.0)

Power is considered by many to be a central concept in explaining conflict, and six indicators – military expenditure, military personnel, energy consumption, iron and steel production, urban population, and total population – are included in this dataset. It serves as the basis for the most widely used indicator of national capability, CINC (Composite Indicator of National Capability) and covers the period 1816-2012.

International Crisis Behavior (ICB) data sets (1918-2013)

The International Crisis Behavior (ICB) Version 11 datasets are now available! With the addition of 15 crises between 2008 and 2013, the ICB data now covers 1918 to 2013. This consists of 470 crises and 1,036 crisis actors.


China Biographical Database Project (CBDB)

The China Biographical Database is a freely accessible relational database with biographical information about approximately 370,000 individuals as of April 2017, primarily from the 7th through 19th centuries. With both online and offline versions, the data is meant to be useful for statistical, social network, and spatial analysis as well as serving as a kind of biographical reference. The image below shows the spatial distribution of a cross dynastic subset of 190,000 people in CBDB by basic affiliations (籍貫).


The Chinese General Social Survey (CGSS)

The Chinese General Social Survey (CGSS) launched in 2003, is the earliest national representative continuous survey project run by an academic institution in China mainland. CGSS is aimed to systematically monitor the changing relationship between social structure and quality of life in both urban and rural China. Social structure refers to dimensions of social group and organization as well as networks of social relationships. Quality of life is the objective and subjective aspects of the people well-being both at the individual and aggregate levels.


UCDP/PRIO Armed Conflict Datasets

A conflict-year dataset with information on armed conflicts where at least one party is the government of a state in the time period 1946-2016. This would be the most widely used datasets in the study of civil war.


The Empirical Studies of Conflict Project (ESOC) 

The ESOC publishes and compiles high quality sub-national data on civil war from Cold War conflict in Vietnam to contemporary conflicts in Afghanistan, Iraq; from Latin America to the Middle East.


My Data:


Software:

R is a free and designed for statistics but it also works for a variety of other purposes. download R software here and R studio provide a more user-friendly interface.

There are many free R tutorials online, Datacamp is one example.


Stata is another data analysis software that is easy to use but it is not free.


Tableau is an easy to use data visualization/GIS tool, this public version is free.

geocode


Public Opinion:

Pew Research Centre (Pew) conducts a wide range of surveys on U.S., and international issues. Their data is directly downloadable from the site.


American National Election Studies (ANES) provide some of the most comprehensive and high-quality survey data on US citizens.


Asian Barometer Survey (ABS) is very useful survey data in East and Southeast Asia. Just send them an application file and they will give you the data!