Continue Reading

Google Developer Experts' DevFest: Experts Bootcamp

Kalev is back in Silicon Valley today for the Google Developer Experts' DevFest: Experts Bootcamp event at Google.

Continue Reading

WashPost: Kevin McCarthy And His Caucus Contest Valuable Terrain: Attention

The Washington Post's Philip Bump examines media attention to the GOP caucus. Read The Full Article.

Continue Reading

Iran's PressTV Covers Israeli Societal Unrest

Iran has been devoting airtime to covering societal unrest in Israel over proposed and enacted judicial reforms, portraying itself as…

Continue Reading

Nature: Understanding Urban Flood Resilience In Chinese Sponge Cities Via GDELT GKG Analyses

Media data analytics has become prominent in enhancing urban flood resilience. Most of the media articles were posted during the…

Continue Reading

Transcribing 9.5M Minutes Of Global Television News Through Google's Chirp

The TV News Visual Explorer encompasses selections of television news from 108 channels spanning 50 countries and territories in 35 languages…

Continue Reading

The Perils Of LLMs For Translation Tasks On Lower-Resource Languages: Estonian Noun Declension

One of the great promises of large language models (LLMs) is their ability to revolutionize translation and linguistic tasks. A…

Continue Reading

Google's Chirp & Truly Multilingual Global Speech Transcription: An Example Of Three Languages In 60 Seconds

Google's new Universal Speech Model called Chirp is a large speech model speech transcription system that offers state-of-the-art speech transcription…

Continue Reading

Coming Soon: 6.6M Minutes Transcribed Through Google's Chirp

Our forthcoming Chirp ASR demo has reached 6.6 million minutes (110,000 hours) of global television news!

Continue Reading

Temporal Relationship Between Daily Reports Of COVID-19 Infections And Related GDELT And Tweet Mentions

Social media platforms are valuable data sources in the study of public reactions to events such as natural disasters and…

Continue Reading

The Rich Multilingual World Of Global Television News: New Possibilities For Understanding Codeswitching In News

For those accustomed to the monolingual English-only broadcasts of the United States, it can be nearly impossible to imagine the…

Continue Reading

More Multilingual Transcription With GCP's Chirp: An Arabic-English South Sudan Broadcast

While GCP's new Chirp ASR model does not officially support multilingual audio content, it has proven highly adept at correctly…

Continue Reading

Coming Soon: 2M Minutes Transcribed Through Google's Chirp

Our forthcoming Chirp ASR demo has reached 2 million minutes of global television news!

Continue Reading

Tucker Carlson Still Making Appearances On Russian Television

Despite having left Fox News months ago, Tucker Carlson is still appearing in excerpts on Russian television news, such as…

Continue Reading

A Vision For The Future Of LLM Trust & Safety: From Consumer Toy To Behavioral Enterprise Guardrails

Amongst the myriad challenges involved in deploying LLMs in the enterprise, perhaps the least appreciated is the impact of consumer-centric…

Continue Reading

Coming Soon: 1.1M Minutes Transcribed Through Google's Chirp

Stay tuned for a forthcoming major announcement regarding more than 1.1 million minutes of global television news transcribed through Google's…

Continue Reading

Large-Token LLMs: Using GCP's New 32K PaLM Models To Summarize An Entire ABC Evening News TV News Broadcast In A Single Prompt

This past July we demonstrated the use of Anthropic's Claude 2's 100K token LLM to summarize an entire evening news…

Continue Reading

Large-Token LLMs: Leveraging GCP PaLM 2's 32K Token Limit To Summarize An Entire Episode Of Russia's 60 Minutes – Part 2

Earlier today we examined how GCP's new 32K token models summarize a Russian television news broadcast, finding heavy hallucination and…

Continue Reading

Large-Token LLMs: Leveraging GCP PaLM 2's 32K Token Limit To Summarize An Entire Episode Of Russia's 60 Minutes

This past July we explored how large-token LLMs like Anthropic's Claude 2 can be used to summarize large texts. With…

Continue Reading

WashPost: The Media-Elite Narrative About Hunter Biden Sits On The Right

The Washington Post's Philip Bump explores how the media is covering the Hunter Biden story. Read The Full Article.

Continue Reading

Creating A Daily Disease Summarizer Using The DOC 2.0 API + PaLM/ChatGPT

Yesterday we explored different workflows for using the DOC 2.0 API to track global disease outbreaks using LLMs to summarize…

Continue Reading

How Tucker Carlson Is Being Covered On TV News

The timeline below shows that Tucker Carlson has all but disappeared on Fox News and that even CNN and MSNBC…

Continue Reading

DOC 2.0 API + PaLM/ChatGPT = Weekly Disease Summarization: Single Or Three-Part Prompts?

Following on our experiments with combining the DOC 2.0 API and LLMs like PaLM or ChatGPT to summarize global headlines,…

Continue Reading

Discrimination, Bias & Western Utopia In LLM-Based Machine Translation

Yesterday we explored how LLM-based machine translation poses novel challenges to translation workflows, encoding a set of "Western values" that…

Continue Reading

When Machine Translation Begins To Encode "Values": LLMs, Editorialization & Guardrails

For the long history of machine translation, from rules-based to SMT to NMT, translation systems were designed to be neutral…