The GDELT Project

Global Embedded Metadata Graph (GEMG) Reaches 550GB Of JSON-LD

For those interested in exploring how Schema.org and JSON-LD are being used across the world's news websites, the Global Embedded Metadata Graph (GEMG) has reached 550GB of JSON-LD across 740 million articles dating back to July 2018!

Computing that total took just a single SQL query in BigQuery:

SELECT sum(length(entry)) FROM `gdelt-bq.gdeltv2.gemg`, UNNEST(jsonld) entry

Learn More.