Author: Kalev Leetaru
Behind The Scenes: Some Initial Archive-Scale Closed Captioning Statistics
Only a portion of the TV News Archive's broadcasts contain broadcaster-provided closed captioning, but by virtue of being largely human-transcribed…
PolitiFact: Early Reports of Four Survivors Show “This DC Plane Crash Story Isn’t Adding Up.”
A PolitiFact look at the plane crash in DC. Read The Full Article.
At-Scale OCR Of Television News: 18.8 Billion Seconds Of Global Television News OCR'd For $71K Vs $47M
We are tremendously excited to announce today that in collaboration with the Internet Archive's Television News Archive, we have completed…
NBC News: How 'Dr. Phil' suddenly became so outspoken about immigration
An analysis of Dr. Phil's statements on immigration. Read The Full Article.
Snopes: Misleading Rumor Claims Trump Admin Found 75K-80K 'Missing' Migrant Children
Snopes uses the TV News Archive in its fact check of claims about migrant children. Read The Full Article.
Politifact: Republican leaders want to put conditions on California wildfire aid. Is there precedent for that?
A Politifact analysis on California wildfire aid. Read The Full Article.
Spatiotemporal Evolution of Hedging Effects in Asia-Pacific Countries Amid Sino-US Competition: Insights From Massive Event Data
Facing the pressure of Sino-US strategic competition, countries in the Asia-Pacific region often adopt hedging strategies to minimize risk and…
Behind The Scenes: Identifying Mismatches Between Expected And Real Video File Durations & Single Version Of The Truth (SVOT)
One of the most complex and time-consuming aspects of working with vast historical archives is diagnosing and addressing the myriad…
At-Scale OCR Of Television News Experiments: OCR Of Interlaced Video Using GCP's Cloud Vision
Amongst the TV News Archive's quarter-century of global broadcasts are interlaced broadcasts, which produce the tell-tale jagged ghosting seen below…
At-Scale OCR Of Television News Experiments: Optimizing The Still Frame File Storage Format
Analyzing petascale video archives poses unique computational challenges, from the underlying processor and accelerator requirements to simply moving that much…
The New York Times: Five Presidents and a Funeral
The New York Times' Maureen Dowd. Read The Full Article.
From LSM's To LMMs For ASR: Evaluating Gemini's Performance At Transcribing An Evening News Broadcast
As we continue to evaluate the rapid progress of large model ASR systems, from lightly to heavily generative LSMs to…
Comparing GCP's Chirp & Chirp 2 ASR Models: Dropping Entire Passages
Yesterday we examined how GCP's new Chirp 2 ASR model hallucinates speech during non-verbal musical interludes in news broadcasts, resulting…
Comparing GCP's Chirp & Chirp 2 ASR Models: Hallucinating Speech During Music
Over the past six months we have continued to compare GCP's Chirp and Chirp 2 ASR models, each time finding…
Audience-Specific Podcasts: Customizing Our Daily "Top Stories" Biosurveillance Podcast Concept For Experts, Policymakers & The American Public
Yesterday we demonstrated feeding a daily roundup of global disease outbreak news headlines from around the world into a "thinking"…
A Daily "Top Stories" Global Disease Outbreak Podcast Concept Using GCP's Gemini 2.0 Thinking + Text-to-Speech API
What might it look like to feed a daily roundup of global disease outbreak news headlines in all the world's…
Using GCP's Chirp + Gemini 1.5 Pro + Speech-To-Text API To Summarize A Day Of Russian TV News Into A 3 Minute "Top Stories" Podcast
What might it look like to use GCP's Speech-to-Text API's Chirp LSM model to machine transcribe a full day of…
A Daily "Top Stories" Global Investment News Podcast Concept Using GCP's Gemini 2.0 Thinking + Text-to-Speech API
What might it look like to feed a daily roundup of global investment news headlines in all the world's languages…
A Daily "Top Stories About NVIDIA" News Podcast Concept Using GCP's Gemini 2.0 Thinking + Text-to-Speech API
What might it look like to feed a daily roundup of news headlines about NVIDIA from across the world in…
AFP: US Conservatives Baselessly Tie New Orleans Attacker to Illegal Immigration
AFP Fact Check about the New Orleans attack. Read The Full Article.
At-Scale OCR Of Television News Experiments: OCR'ing 10 Billion Seconds Of Global TV News For Just $47.5K Vs $26.9M
In collaboration with the Internet Archive's Television News Archive, we have successfully OCR'd 4.2 million television news broadcasts from around…
Behind The Scenes: Identifying Failed Recordings: Using Large Multimodal Models Like ChatGPT & Gemini: Part 3
Continuing our series examining whether Large Multimodal Models (LMMs) like ChatGPT and Gemini might be able to help us identify…
Behind The Scenes: Identifying Failed Recordings: Using Large Multimodal Models Like ChatGPT & Gemini: Part 2
Earlier this week we demonstrated the limitations of using Large Multimodal Models (LMMs) like ChatGPT and Gemini to detect corrupted…
Behind The Scenes: Identifying Failed Recordings: Using Large Multimodal Models Like ChatGPT & Gemini
As we continue our efforts to scan the TV News Archive for failed recordings, how might Large Multimodal Models (LMMs)…