Continue Reading

Excluding Advertisements From VGEG Video AI Analysis Of Television News

Last April we showed how to analyze top visual trends by day across television news using the Visual Global Entity…

Continue Reading

Announcing The Global Numeric Graph

We are tremendously excited to announce today the debut of the GDELT Global Numeric Graph (GNG), which compiles appearances of…

Continue Reading

A TF-IDF Chronology Of The Global Geographic Graph Of The Top Terms Associated With Italy Jan-Apr 2020

How might we apply a TF-IDF analysis to the contextual field of the Global Geographic Graph of English language online…

Continue Reading

The Nuanced Networking Complexities Of Building Global-Scale Infrastructure

One of the greatest complexities in building global-scale geographically distributed systems that move vast amounts of data around the world…

Continue Reading

Visualizing The Global Covid-19 "Infodemic": Using Maps, Charts And Timelines To Visualize The Global Media Narrative

Through the Media-Data Research Consortium (M-DRC)'s Google Cloud COVID-19 Research Grant “Quantifying the COVID-19 Public Health Media Narrative Through TV…

Continue Reading

Updated Dr.'s Chronology From Television News OCR: 2020-2021

We've updated our chronology of Dr.'s from the onscreen OCR'd text of television news that we first released early last year. The latest…

Continue Reading

Using The Global Quotation Graph To Track Public Statements About Mis/Disinformation

How might we use the Global Quotation Graph to track global public commentary about mis/disinformation? In short, to compile a list…

Continue Reading

Research At Archives Scale: How The Cloud Makes Petascale Analysis Trivial

It is remarkable that in an era in which petabytes have become commoditized, petascale analyses still drive fear into the…

Continue Reading

Trump's Tweets Capture Media Attention Again In Aftermath Of Capitol Storming

How often are Trump's tweets embedded or linked to in worldwide online news coverage? The timeline below shows the total number…

Continue Reading

A Behind-The-Scenes Look At Our Petascale Video Processing Architecture For Cloud Video AI-Powered Annotation

How do we process all of the video we analyze each day through Google's Cloud Video API? What does a…

Continue Reading

A Deeper Dive Into Uncaptioned Television News Airtime

Diving deeper into the question of what is driving the high levels of uncaptioned airtime in the 2009-2013 era, we…

Continue Reading

Advertising Versus Uncaptioned Television News Airtime Using The Advertising Airtime Dataset 2009-2020

UPDATE: A new version of this analysis is available. Earlier today we offered a first glimpse at some of the…

Continue Reading

An Early Look At Television News Advertising Airtime Trends 2009-2020

Update (12/27/2020): A deep dive explanation on these trends is now available. Using our massive new Television News Advertising Inventory…

Continue Reading

Announcing The Television News Advertising Inventory Files (AIF) Captioning Time Dataset

We are tremendously excited to announce today the debut of the new Advertising Inventory Files (AIF) dataset for television news…

Continue Reading

Understanding Television News Through Onscreen Text OCR

While much of the work on television news has focused on speech recognition and caption search, through the Visual Global…

Continue Reading

A Glimpse Behind The Scenes At How We Perform Mass Ingest And Transformation Workflows In the Cloud

How do we orchestrate our mass ingest and transformation workflows when performing some of our largest analyses? Once data is…

Continue Reading

Identifying Invalid JSON-LD Through The Global Embedded Metadata Graph

How might we use the Global Embedded Metadata Graph to identify invalid JSON-LD code in news articles? The following query…

Continue Reading

Using The Global Embedded Metadata Graph To Explore Trends In JSON-LD

How can we use the new Global Embedded Metadata Graph to explore some of the trends in metadata usage in a day…

Continue Reading

Announcing The Global Embedded Metadata Graph

We are enormously excited today to announce the debut of the Global Embedded Metadata Graph (GEMG), which records the hidden…

Continue Reading

Comparing The 2009-2020 Readability Scores Of CNN, MSNBC And Fox News

Yesterday we unveiled a massive new dataset of television news readability scores for 23 stations spanning portions of the past…

Continue Reading

The Methodological Challenges Of Interpreting Readability Scores For Spoken Word Television News Transcripts

Yesterday we announced the new Television News Readability Scores Dataset using data from the Internet Archive's Television News Archive. One…

Continue Reading

Television News Readability Scores Dataset 2009-2020

UPDATE: See this deep dive on the influence of captioner shifts and the difference between transcripts and captioning in results….

Continue Reading

Evaluating The "Readability" Of Online COVID-19 News Coverage

Using our massive new dataset of readability scores of worldwide English language news coverage of 2020, how might we evaluate…

Continue Reading

Readability Scores Dataset For Worldwide Online News Coverage in 2020

As we seek new ways of assessing and understanding how news conveys the world each day, one thread of our…