Continue Reading

Embedding Models: Clustering COVID-19 Versus "Poxes"

Following on our "mpox" versus "monkeypox" experiment, let's use that same embedding visualization template to cluster a broader set of…

Continue Reading

Embedding Models: Revisiting Multilingual Embedding Through Visualization

Using our new embedding visualization template, let's revisit our multilingual embedding experiment and visualize how each embedding model clusters our…

Continue Reading

Embedding Models: Capitalization & Knowledge Cutoffs Part 2

Yesterday we introduced a Colab notebook template for visualizing embedding models and explored the impact of capitalization, word spacing and…

Continue Reading

A Template For Visually Comparing Embedding Models + Exploring Capitalization, Spacing & Knowledge Cutoffs

Embeddings are designed to look beyond the words on a page to the semantic concepts they represent, allowing a search…

Continue Reading

Visual Explorer: New Offset Referencing For Archive URL Alignment

Earlier this month we introduced direct ImageID referencing to the Visual Explorer, allowing you to specify a specific ImageID for…

Continue Reading

Multilingual Embedding For LLM External Memory & Semantic Search: Universal Sentence Encoder Family, LaBSE & Vertex AI Embeddings for Text

Embeddings have emerged in recent years as the go-to approach for semantic search. With the rise of Large Language Models…

Continue Reading

Generative AI: Translation APIs Versus LLM For Social Media Translation

Large Language Models can perform a wide array of tasks, including textual translation. Given the widespread availability of existing dedicated…

Continue Reading

The Predictability Of Global Events: How An LLM Predicted The Outcome Of Türkiye's Presidential Election

With the Turkish presidential election this past weekend, we wanted to test how well some of the major commercial Large…

Continue Reading

The Language Bias Of Large Language Models: Why Myanmar Is 1300% More Costly Than English In GPT-3

Large Language Models (LLM's) like OpenAI's ChatGPT and Google's Bard interpret language not as discrete words, but as word-parts known…

Continue Reading

Connecting The TV Explorer & Visual Explorer For Seamless CSPAN Search & Legislative Deep Linking

Earlier this week we unveiled a powerful new interface to CSPAN as part of our Visual Explorer Lenses initiative. When…

Continue Reading

Semantic Narrowing: Towards A Calculable Descriptive Statistic Associated With Press Freedom And Authoritarianism

This paper proposes a novel measure for operationalizing authoritarianism: the narrowing of semantic dispersion. This paper defines semantic dispersion as…

Continue Reading

Tomorrow: Connecting The TV Explorer & Visual Explorer

Stay tuned for a major new announcement about our work creating new interface metaphors connecting the TV Explorer and Visual…

Continue Reading

Language Models Can Improve Event Prediction By Few-Shot Abductive Reasoning

Large language models have shown astonishing performance on a wide range of reasoning tasks. In this paper, we investigate whether…

Continue Reading

CJR: How The Media Is Covering ChatGPT

The Two Center has an article in today's Columbia Journalism Review (CJR) exploring how the media has covered ChatGPT and…

Continue Reading

Tracking a Year of Tucker Carlson on Russian TV

Until his departure from Fox News last month, Tucker Carlson was a regular fixture on Russian television news, with clips…

Continue Reading

Today: Google IO Connect In Miami

Kalev will be at Google's I/O Connect event in Miami today – reach out if you're in town!

Continue Reading

FiveThirtyEight: The Rise, Fall And Potential Resurrection Of Ron DeSantis

FiveThirtyEight explores DeSantis' campaign trajectory, including an analysis of media coverage. Read The Full Article.

Continue Reading

Visual Explorer: New Broadcast Time Offset Referencing

Last week we introduced a new parameter to the Visual Explorer that allows referencing a specific thumbnail frame in a…

Continue Reading

Visual Explorer: Heads Of State Embedding Database For Searching Russian Television News

Last week we demonstrated searching an entire year of Belarusian, Russian and Ukrainian television news for appearances of current and…

Continue Reading

Hyperlinking Television: Connecting Our Nation's Legislation To The Legislative Process Via Deep Linking CSPAN

In collaboration with the Internet Archive's TV News Archive and the Media-Data Research Consortium, we are tremendously excited today to…

Continue Reading

Visual Explorer: Scripting Using The New Channel Inventory JSON File

Yesterday we announced the new Visual Explorer Channel Inventory JSON file for programmatic discovery. As a simple example of how…

Continue Reading

Visual Explorer: New Channel Inventory JSON File For Programmatic Discovery

To make it easier to programmatically interact with the Visual Explorer and its new Visual Explorer Lenses metaphor, we have…

Continue Reading

GCP Blog: Bringing The Power Of Large Models To Google Cloud's Speech API

We are incredibly excited to be cited in the Google Cloud Blog announcement today of Cloud Speech's USM Chirp! Read…

Continue Reading

WashPost: Ted Cruz Figures Out A Way To Guzzle New Bud Light Headlines

The Washington Post's Philip Bump includes an analysis of television news coverage of Bud Light and Dylan Mulvaney. Read The…