For those interested in exploring how Schema.org and JSON-LD are being used across the world's news websites, the Global Embedded Metadata Graph (GEMG) has reached 550GB of JSON-LD across 740 million articles dating back to July 2018!
Computing that total took just a single SQL query in BigQuery:
SELECT sum(length(entry)) FROM `gdelt-bq.gdeltv2.gemg`, UNNEST(jsonld) entry