Continue Reading

Announcing The Global Similarity Graph Document Embeddings Using The Universal Sentence Encoder

Today we are tremendously excited to announce the debut of the Global Similarity Graph Document Embeddings, a realtime database of…

Continue Reading

Mapping The Media: A Geographic Lookup Of GDELT's Sources 2015-2021

Three years ago we showed how a single SQL query in BigQuery could process the entire GKG 2.0 and compile…

Continue Reading

Global Similarity Graph: Visualizing Language Overlap

Using the new Global Similarity Graph, a single SQL query can visualize language overlap using BigQuery + Gephi. Note that since…

Continue Reading

Global Similarity Graph: Visualizing How Similar News Websites Are

Using the new Global Similarity Graph, a single SQL query can visualize story overlap between news outlets using BigQuery + Gephi….

Continue Reading

Global Similarity Graph: Finding "Articles Like This"

Using the new Global Similarity Graph, it is trivial to find articles similar to a given URL. Using the BigQuery…

Continue Reading

Announcing The Global Similarity Graph

We are tremendously excited to announce today the debut of the GDELT Global Similarity Graph (GSG), which computes the pairwise…

Continue Reading

Identifying Breaking News Stories Across The World With Google’s Timeseries Insights API

GDELT today encompasses more than 8.4 trillion datapoints spanning global events and narratives in 152 languages across text, television, radio…

Continue Reading

Global Embedded Metadata Graph (GEMG) Reaches 550GB Of JSON-LD

For those interested in exploring how Schema.org and JSON-LD are being used across the world's news websites, the Global Embedded…

Continue Reading

Excluding Advertisements From VGEG Video AI Analysis Of Television News

Last April we showed how to analyze top visual trends by day across television news using the Visual Global Entity…

Continue Reading

Announcing The Global Numeric Graph

We are tremendously excited to announce today the debut of the GDELT Global Numeric Graph (GNG), which compiles appearances of…

Continue Reading

A TF-IDF Chronology Of The Global Geographic Graph Of The Top Terms Associated With Italy Jan-Apr 2020

How might we apply a TF-IDF analysis to the contextual field of the Global Geographic Graph of English language online…

Continue Reading

The Nuanced Networking Complexities Of Building Global-Scale Infrastructure

One of the greatest complexities in building global-scale geographically distributed systems that move vast amounts of data around the world…

Continue Reading

Visualizing The Global Covid-19 "Infodemic": Using Maps, Charts And Timelines To Visualize The Global Media Narrative

Through the Media-Data Research Consortium (M-DRC)'s Google Cloud COVID-19 Research Grant “Quantifying the COVID-19 Public Health Media Narrative Through TV…

Continue Reading

Updated Dr.'s Chronology From Television News OCR: 2020-2021

We've updated our chronology of Dr.'s from the onscreen OCR'd text of television news that we first released early last year. The latest…

Continue Reading

Using The Global Quotation Graph To Track Public Statements About Mis/Disinformation

How might we use the Global Quotation Graph to track global public commentary about mis/disinformation? In short, to compile a list…

Continue Reading

Research At Archives Scale: How The Cloud Makes Petascale Analysis Trivial

It is remarkable that in an era in which petabytes have become commoditized, petascale analyses still drive fear into the…

Continue Reading

Trump's Tweets Capture Media Attention Again In Aftermath Of Capitol Storming

How often are Trump's tweets embedded or linked to in worldwide online news coverage? The timeline below shows the total number…

Continue Reading

A Behind-The-Scenes Look At Our Petascale Video Processing Architecture For Cloud Video AI-Powered Annotation

How do we process all of the video we analyze each day through Google's Cloud Video API? What does a…

Continue Reading

A Deeper Dive Into Uncaptioned Television News Airtime

Diving deeper into the question of what is driving the high levels of uncaptioned airtime in the 2009-2013 era, we…

Continue Reading

Advertising Versus Uncaptioned Television News Airtime Using The Advertising Airtime Dataset 2009-2020

UPDATE: A new version of this analysis is available. Earlier today we offered a first glimpse at some of the…

Continue Reading

An Early Look At Television News Advertising Airtime Trends 2009-2020

Update (12/27/2020): A deep dive explanation on these trends is now available. Using our massive new Television News Advertising Inventory…

Continue Reading

Announcing The Television News Advertising Inventory Files (AIF) Captioning Time Dataset

We are tremendously excited to announce today the debut of the new Advertising Inventory Files (AIF) dataset for television news…

Continue Reading

Understanding Television News Through Onscreen Text OCR

While much of the work on television news has focused on speech recognition and caption search, through the Visual Global…

Continue Reading

A Glimpse Behind The Scenes At How We Perform Mass Ingest And Transformation Workflows In the Cloud

How do we orchestrate our mass ingest and transformation workflows when performing some of our largest analyses? Once data is…