Continue Reading

At-Scale OCR Of Television News Experiments: Results From A Sample Circa-2018 Amharic Broadcast – Part 2: Onscreen Documents

Continuing yesterday's examination of OCRing an Amharic-language broadcast, towards the end of that broadcast is a fascinating example of a…

Continue Reading

At-Scale OCR Of Television News Experiments: Results From A Sample Circa-2018 Amharic Broadcast – Part 1

Continuing our OCR experiments applying GCP's Cloud Vision API to global television news broadcasts using montaging, below is an excerpt…

Continue Reading

At-Scale OCR Of Television News Experiments: What Have We Learned So Far?

In collaboration with the Internet Archive's TV News Archive, we are working to OCR the Archive's entire 7-million-hour quarter-century archive…

Continue Reading

AFP: Teary-Eyed Trudeau Video is Years Old, Unrelated to Trump Tariffs

AFP Fact Check on Trudeau. Read The Full Article.

Continue Reading

CNN: Hegseth Has A History of Supporting Controversial Policies Involving The Military

A CNN deep dive on Pete Hegseth. Read The Full Article.

Continue Reading

Digital Innovation Towards The Sustainable Development Goals: A Mass Media Analysis

Innovation and technology are essential to reduce the environmental impact of human activities and face the derived environmental and social…

Continue Reading

Behind The Scenes: The Perils Of AI-Powered Autonomous Agents In The Real World

We continue to explore the landscape of AI-powered autonomous agents. Despite their immense hype and ubiquitous mediagenic demos on social…

Continue Reading

GCP Tips & Tricks: Using The Cloud Monitoring API To Track AI API Usage In Realtime

Yesterday we discussed how we massively optimized our archive-scale OCR throughput by splitting montaging and OCR workloads. When working at the…

Continue Reading

Behind The Scenes: Splitting Workloads: OCR Montage Generation Vs API Calls

As we continue to scale up our work OCR'ing a quarter century of global television news broadcasts, one of the…

Continue Reading

At-Scale OCR Of Television News Experiments: Results From A Tunisian Broadcast

Below is an example of our Cloud Vision montage pipeline's transcription of a Tunisian broadcast from 2014, showing how it…

Continue Reading

At-Scale OCR Of Television News Experiments: Results From A Turkish TRT 1 Broadcast

Continuing our Cloud Vision television news OCR montaging experiments, below is a circa-2012 example from Turkish broadcaster TRT 1, showing…

Continue Reading

At-Scale OCR Of Television News Experiments: Results From A Thai Broadcast

Continuing our Cloud Vision television news OCR montaging experiments, below is a circa-2012 example of a Thai television news broadcast, showing…

Continue Reading

At-Scale OCR Of Television News Experiments: Remarkable Results From A Circa-2012 320×240 Pixel Syrian Broadcast

Continuing our Cloud Vision television news OCR montaging experiments, below is a truly remarkable circa-2012 example from Syrian television. The…

Continue Reading

At-Scale OCR Of Television News Experiments: Results From A Sample Quarter-Century-Old Chinese Language Broadcast

Thus far, we've demonstrated the results of applying GCP Cloud Vision to OCR a grid montage of 1fps frames from…

Continue Reading

At-Scale OCR Of Television News Experiments: Results From A Sample Circa-2010 SD-Resolution Arabic-Language Broadcast

Earlier this week we demonstrated the results from applying GCP Cloud Vision to a grid montage of 1fps still frames…

Continue Reading

At-Scale OCR Of Television News Experiments: Two Interesting Examples Of The OCR & Inconsistencies Of OCR At Scale

Earlier this week we explored the ability of GCP's Cloud Vision API to robustly OCR a grid of 224 video…

Continue Reading

Snopes: Video Does Not Show Biden Wandering Off into Amazon Rainforest

Snopes uses the TV News Archive to verify that a clip seen on social media is authenticate, but used out…

Continue Reading

At-Scale OCR Of Television News Experiments: Results From A Sample Circa-2010 SD-Resolution English-Language Broadcast

Last week we explored how combining large numbers of video frames into a single grid-based montage allows us to vastly…

Continue Reading

Why Large Multimodal Models (LMM) Like ChatGPT Are Unsuitable For Production OCR

While Large Multimodal Models (LMMs) are increasingly positioned as replacements for the full range of classical AI systems like OCR,…

Continue Reading

Tomorrow: Don't Inform, Inflame: AI's Limitations

For those here at Web Summit 2024 in Lisbon, head to Stage 15 tomorrow to see Kalev's stage talk "Tomorrow:…

Continue Reading

This Week: Web Summit 2024 In Lisbon

Join Kalev at Web Summit 2024 in Lisbon this week, which is drawing more than 70,000 attendees from over 160…

Continue Reading

At-Scale OCR Of Television News Experiments: More Grid Layout Experiments

As we ramp up our work to expand OCR to the complete Internet Archive Television News Archive, the key to…

Continue Reading

Comparing The New Visual Explorer Intelligent Thumbnail Generator Vs The Old Fixed Thumbnails

The current Visual Explorer represents each television news broadcast using a single frame extracted exactly 60 seconds into the broadcast….

Continue Reading

The Fascinating Findings Of Applying Advanced OCR To Television News: Transcribing Projected Text

As we continue ramping up our experiments applying GCP's Cloud Vision API OCR to television news, it never ceases to…