The GDELT Project

  • The GDELT Project Blog
  • Website

Global Embedded Metadata Graph (GEMG) Reaches 550GB Of JSON-LD

 May 29, 2021

For those interested in exploring how Schema.org and JSON-LD are being used across the world's news websites, the Global Embedded Metadata Graph (GEMG) has reached 550GB of JSON-LD across 740 million articles dating back to July 2018!

Computing that total took just a single SQL query in BigQuery:

SELECT sum(length(entry)) FROM `gdelt-bq.gdeltv2.gemg`, UNNEST(jsonld) entry

Learn More.

Post navigation

← How Have Cryptocurrencies Been Covered On Television News?
Global Frontpage Graph (GFG) Reaches 280 Billion Records →

Archives

The Official GDELT Project Blog