The GDELT Project

New Television News Inventory Files

We're excited to announce the new Television News inventory files, which record the complete list of public shows archived by the Internet Archive's Television News Archive and processed as part of the Television Explorer and Television News Ngrams Dataset.

There are two new daily inventory files, one recording the list of all shows monitored that day and the other recording the overall statistics and dominate show for each 30 minute slot on each station that day (making it ideal for analyzing the ngrams dataset).

SHOWLIST

For each day there is a "showlist" file that contains a list of every show monitored that day, even if it was just a few minutes long (and thus may not appear in the "timelist" inventory file for that day).

The files are stored at "http://data.gdeltproject.org/gdeltv3/iatv/inventory/YYYYMMDD.showlist.txt" where YYYYMMDD should be replaced with the day of interest, from the start date of 20090604 through 24 hours from present. Files are updated every 30 minutes as new shows are processed. Thus the showlist inventory file for June 1, 2019 is "http://data.gdeltproject.org/gdeltv3/iatv/inventory/20190601.showlist.txt".

Each row represents one show monitored that day. Note that shows that span across two days (such a show that begins at 11PM one day and stretches one minute into the following day) will be recorded on both days, with their respective counts reflecting how much of the show's programming occurred on each day.

Each row has the following fields (there is no header row):

The dataset is also available in Google's BigQuery:

TIMELIST

For each day there is also a "timelist" file that breaks the day into 30 minute blocks for each station and records the aggregate statistics for each block, along with the primary show airing on that station in that block on that day.

The files are stored at "http://data.gdeltproject.org/gdeltv3/iatv/inventory/YYYYMMDD.timelist.txt" where YYYYMMDD should be replaced with the day of interest, from the start date of 20090604 through 24 hours from present. Files are updated every 30 minutes as new shows are processed. Thus the timelist inventory file for June 1, 2019 is "http://data.gdeltproject.org/gdeltv3/iatv/inventory/20190601.timelist.txt".

Each row represents a station/day/30-minute slot combination. In other words, for a given day there will be up to 48 slots for each station (24 hours at 30 minutes resolution). Note that shows that span across two days (such a show that begins at 11PM one day and stretches one minute into the following day) will be recorded on both days across their respective slots. The semantics of the timelist inventory file is identical to that used for the ngrams dataset.

Each row has the following fields (there is no header row) (note that the ordering of some fields is different than the showlist inventory files):

The dataset is also available in Google's BigQuery: