The GDELT Project

Announcing The New LowerThird Television News Chyron Search API!

We're tremendously excited to announce today the new Television News Chyron Search API! Similar to the Television News API, the new Television News Chyron Search API allows you to perform keyword and phrase searches across two and a half years of television news chyrons from BBC News, CNN, MSNBC and Fox News, generating timelines, station breakdowns, wordclouds and clip summaries, complete with links out to the Archive's website to view the matching clips themselves. Chyrons offer a powerful correlate to the spoken word transcripts of closed captioning, offering additional contextual detail, editorialization and even offering a proxy for the airtime of various guests.

Since August 2017, the Internet Archive's Television News Archive has extracted the chyrons of CNN, MSNBC, Fox News and BBC News by OCR'ing a small bounding box at the bottom of the screen every 1 second. Last month we began reprocessing the raw OCR data from the Archive into a new research chyron dataset.  Taking the raw per-second as-is OCR output from the Archive, we clean and reshape it into a new research-grade dataset using language modeling and edit distance clustering to yield a maximally comprehensive chyron dataset that surfaces the "best" available onscreen chyron text for each minute.

To date these chyrons have primarily been manually examined by journalists looking to compare the coverage of major events across the four stations. Typically the side-by-side comparison viewer is used to view the chyrons minute by minute and flag periods of particular difference.

The new LowerThird Television News Chyron Search API allows you to fulltext search these chyrons using an API interface almost identical to that of the Television News API.

Note that there is a very high level of OCR error in the current chyrons and the chyron feed for each station can experience outages, meaning results are incomplete, but offer a useful holistic summary of chyron appearances.

QUICK START EXAMPLES

Here are some simple examples to get you started.

FULL DOCUMENTATION

The GDELT LOWERTHIRD 2.0 API is accessed via a simple URL with the following parameters. Under each parameter is the list of operators that can be used as the value of that parameter.