The GDELT Project

Television News Ngram 2.0 Dataset: Quadgrams & 5-Grams Now Available!

Last week we unveiled the new Television News Ngram 2.0 Dataset of word frequency ngrams in unigram, bigram and trigram formats at 10 minute resolution for 14 stations, some stretching back more than a decade.

Today we're excited to announce that we've added quadgrams and 5-grams to this list to allow for more advanced kinds of contextualized linguistic analysis!

Learn More.