The GDELT Project

  • The GDELT Project Blog
  • Website

Television News Ngram 2.0 Dataset: Quadgrams & 5-Grams Now Available!

 June 14, 2020

Last week we unveiled the new Television News Ngram 2.0 Dataset of word frequency ngrams in unigram, bigram and trigram formats at 10 minute resolution for 14 stations, some stretching back more than a decade.

Today we're excited to announce that we've added quadgrams and 5-grams to this list to allow for more advanced kinds of contextualized linguistic analysis!

Learn More.

Post navigation

← Graph Hawkes Neural Network For Forecasting On Temporal Knowledge Graphs
University de Montemorelos: Application of Data Science to Discover Violence-Related Issues in Iraq →

Archives

The Official GDELT Project Blog