Continue Reading

Using Algorithmic Thumbnail Generation To Yield Better Single-Image Thumbnail Representations

Over the past several days we've explored applying ffmpeg's built-in representative frame detection "thumbnail" filter, returning to our earliest experiments…

Continue Reading

ChatGPT's o1-Preview "Reasoning Model" Fairs Little Better Than 4o On Our Thumbnail Experiments

Yesterday we explored the ability of ChatGPT's 4o foundational LLM to act as a brainstorming partner in the development of…

Continue Reading

ChatGPT's Unhelpful Take On Visually Summarizing Television News Broadcasts Through Thumbnails

Continuing our series on selecting "representative" frames from television news broadcasts to visually summarize them, thus far we've been experimenting…

Continue Reading

Visually Summarizing Television News Broadcasts Through Thumbnails: Algorithmic Vs Time-Based Experiments: Part 2

Continuing our experiments from yesterday creating representative thumbnails of a television news broadcast, let's look at how the two approaches…

Continue Reading

Scaling New Heights: Transformative Cross-GPU Sampling for Training Billion-Edge Graphs

A new paper by researchers at Wuhan University, NVIDIA and the University of Macau: Efficient training of Graph Neural Networks…

Continue Reading

Visually Summarizing Television News Broadcasts Through Thumbnails: Algorithmic Vs Time-Based Experiments

The Visual Explorer visually summarizes television news broadcasts through fixed 1/4fps thumbnail grids – an approach developed through extensive human…

Continue Reading

Comparing Television News Coverage Of Trump Vs Kamala Vs Biden

How are the two present and one former presidential candidates being covered on television news? The timeline below compares their…

Continue Reading

A Digital Twin Glimpse At The Internet Archive's TV News Archive: 27 Billion Seconds Over 10.9 Million Broadcasts From 327 Channels In 50+ Countries & 150+ Languages Over A Quarter-Century

Using our new BigQuery + Bigtable GCS digital twin, we can look across the entire Internet Archive's TV News Archive…

Continue Reading

Behind The Scenes: Network Intelligence Topology Mapping By GCP Service

Last year we explored how GCP's Network Intelligence Network Topology mapping can be used to understand the network flows across…

Continue Reading

WashPost: No Amount Of Evidence Will Convince Republicans Of Trump's 2020 Guilt

The Washington Post's Philip Bump examines media coverage of Donald Trump. Read The Full Article.

Continue Reading

TE-LSTM: A Prediction Model for Temperature Based on Multivariate Time Series Data

In the era of big data, prediction has become a fundamental capability. Current prediction methods primarily focus on sequence elements;…

Continue Reading

Compiling A List Of All Non-News Broadcasts In The TV News Archive From Business Channels Over The Past Decade: Part 2

Last month we compiled a list of distinct show names marked as non-news from each of the three business news…

Continue Reading

WashPost: Mark Robinson Offers Up The 2024 Version Of The I-Was-Hacked Defense

The Washington Post's Philip Bump examines media coverage of Mark Robinson. Read The Full Article.

Continue Reading

Using Our BigQuery + Bigtable + GCS Digital Twin To Track Historical Backfilling Progress

With our new BigQuery + Bigtable digital twin over our GCS archive, we can trivially compile ongoing inventories of our…

Continue Reading

Experiments With CCExtractor Using Our BigQuery + Bigtable + GCS Digital Twin

In December 2020 we unveiled a massive new initiative in collaboration with the Internet Archive's TV News Archive to catalog…

Continue Reading

Using Our BigQuery + Bigtable + GCS Digital Twin To Make Date-Based Random Samples For Content Analysis & Testing

A key concept in "content analysis" methodologies over large temporally diverse archives is the notion of time-based random samples: creating…

Continue Reading

Using Our BigQuery + Bigtable + GCS Digital Twin To Identify Missing Channels

One of the most powerful aspects of our BigQuery-analyzable Bigtable-based GCS digital twin is the capability it makes possible to…

Continue Reading

CJR: Can Kamala Harris Use The Debate To Keep Her Media Momentum?

Meghnad Bose and Dhrumil Mehta examine media coverage of Kamala Harris in a piece for Columbia Journalism Review (CJR). Read…

Continue Reading

How Much Attention Have Presidential Year Debates Gotten On TV News Over The Years? Hint: 2012 Was The Recent Peak

How much attention do presidential year debates get on television news? The timeline below shows total mentions of "debate" across…

Continue Reading

How Are Business Television News Channels Covering Bitcoin & Crypto?

The timeline below shows the percentage of daily airtime (in 15 sec blocks) across Bloomberg, CNBC and Fox Business over…

Continue Reading

Leveraging Bigtable For Highly Scalable Digital Twin Architectures

As we continue to load our entire historical GCS archive into our Bigtable digital twin, BigTable's remarkable scalability has allowed…

Continue Reading

The Covid-Era Focus On "Experts" Has Faded

During the pandemic, mentions of "experts" were everywhere as news media emphasized the credentials and expertise of those they interviewed….

Continue Reading

Scaling In The Cloud: Storing Billions Of Files Totaling Petabytes In GCS

One of the most remarkable aspects of working at "cloud scale" is the sheer scalability of the modern public cloud….

Continue Reading

OCR'ing Television News: Comparing GCP Cloud Vision API, Paligemma, Tesseract, Gemini 1.5 Pro, Gemini 1.5 Flash & GPT 4o

Television news in a number of countries contains copious onscreen text scattered across multiple locations on the screen, in multiple…