The GDELT Project

GDELT GKG GeoJSON Files Available

For those who want to create their own interactive GDELT maps in CartoDB like the three we've unveiled thus far (realtime news map, realtime happiest and saddest news animation, and food/water security, transportation, and health/disease map), or want to import GDELT into spatial analytics toolkits, we've made the underlying GDELT GKG GeoJSON files available for download.  The files below are updated every 15 minutes and reflect a rolling hour/24 hour window as described more below.  There are three files, the Hourly URL GeoJSON, the Minimized Non-US Hourly URL GeoJSON, and the 24 Hour Animation GeoJSON.  Read the details about each below.

If you have a CartoDB "John Snow" or above account, make sure to select "Sync every hour" when you import these files and they will live-update for you every hour, yielding the same constantly-updating maps that we've been creating!  If you are importing into your own system, just refetch each of these URLs once every hour to get the latest data.  For those with sufficient disk space, you could download the Hourly URL GeoJSON file every hour and append to a master table to create a master longitudinal time series map table.  In CartoDB you might do this by having a synced table and once an hour making a CartoDB SQL API call to copy the table into a master table (though you will run out of disk quite quickly unless you have an enterprise account).

 

Hourly URL GeoJSON

[Download GeoJSON]
This GeoJSON file contains all coverage that GDELT has monitored in the last hour.  It updates every 15 minutes and reflects all coverage seen in the preceding 60 minutes on a rolling basis.  Each row represents one location found in a given article.  If an article mentions multiple locations it will be listed multiple times, once for each location.  If a location is mentioned in multiple articles, it will be listed multiple times, once for each article.  The goal of this GeoJSON file is to enable the creation of clickable map layers that allow interactive spatial browsing of GDELT's geographic information.  NOTE that when loaded into a database platform like CartoDB it will freqently consume between 100 and 300MB, so requires a large amount of database space.

The fields contained in the file are as follows:

 

Minimized Non-US Hourly URL GeoJSON

[Download GeoJSON]
This file is identical to the Hourly URL GeoJSON with the exception that it excludes all locations in the United States and that it is capped at a maximum of 40MB and/or 40,000 entries, whichever is smaller.  This file is designed to offer a minimized version of the file, suitable for importing into free CartoDB accounts and other mapping or analytic platforms that have relatively small limits on data size.

 

24 Hour Animation GeoJSON

[Download GeoJSON]
This GeoJSON file contains a clustered version of all coverage that GDELT has monitored in the last 24 hours, collapsed by location+timestamp instead of by article.  It updates every 15 minutes and reflects all coverage seen in the preceding 24 hours on a rolling basis.  Unlike the Hourly GeoJSON files, in this file each row represents a single location in a single 15 minute interval instead of a single mention of a location in a single article.  Here, all mentions of a given location within a 15 minute interval are collapsed into a single row.  For the names and themes fields, all names and themes associated with any mention of the location in the given time interval are concatenated together, as is the case with the domains field.  For the other fields they are averages or summations of all mentions.  The name field is excluded from this file to conserve disk space.  The goal of this GeoJSON file is to enable the creation of animation map layers that show change over time, rather than supporting interactive clickable maps.  NOTE that when loaded into a database platform like CartoDB it will freqently consume between 250 and 450MB, so requires a large amount of database space.

The fields contained in the file are as follows: