The GDELT Project

  • The GDELT Project Blog
  • Website

A Quarter-Billion Global Similarity Graph Document Embeddings Now Available Back to 2020

 August 23, 2021

We are excited today to announce that the Global Similarity Graph (GSG) Document Embeddings dataset has been extended back to January 1, 2020 and now covers more than a quarter-billion articles, each represented as a 512-dimension Universal Sentence Encoder v4 document-level embedding!

Learn More.

Post navigation

← Which Administration Officials Are Telling The Afghanistan Withdrawal Story?
More Experiments Using Video AI To Scan Television News For Specific Books →

Archives

The Official GDELT Project Blog