The GDELT Project

Visual Global Knowledge Graph (VGKG) April 2016 Snapshot Dataset

Following in the footsteps of our February snapshot, we're releasing an April 2016 snapshot of the Visual Global Knowledge Graph (VGKG) by popular demand! This snapshot is in CSV format, one Article/Image per row, with the following columns (in order of appearance):

There are 36,769,236 rows, including the header row, totaling 233GB (for this extract only the newer images for which the full JSON output is available were included). Given the size of the full dataset, it has been broken into 6 parts and each has been GZIP'd. The first row of Part 1 is the CSV header row with the column names. To load into a database, simply download and gunzip all six parts and then concatenate back together into a single file.