Extending The Face Detection Database To American TV News: 14 Years Of CNN, MSNBC & Fox News
Earlier this year we unveiled a massive new database of facial embeddings spanning one full year across seven Belarusian, Russian…
Extending The Face Detection Database To American TV News: Coming Tomorrow
Stay tuned for a major new announcement tomorrow about facial embeddings!
Visual Explorer: Ron DeSantis & Barack Obama Added To TV News Public Figures Embedding Database
This past May we unveiled the Visual Explorer TV News Public Figures Embedding Database that contains precomputed embeddings for major…
Entity Extraction: LLMs Versus Classical Neural Model + Live-Updating Knowledge Graph
Large language models are increasingly being positioned as a wholesale replacement for nearly all language analysis tasks, from Q&A and…
Generative AI: Using LLM's To Produce Audience-Tailored Translations In Place Of Classical NMT's One-Size-Fits-All
Last month we explored the ability of Large Language Models (LLMs) to produce higher-quality translations than traditional Neural Machine Translation…
WashPost: Documents Show How Conservative Doctors Influenced Abortion, Trans Rights
The Washington Post examines media coverage of abortion and trans rights. Read The Full Article.
WashPost: Republicans Keep Spilling Cold Water On Their Biden Bribery Allegations
The Washington Post's Philip Bump examines television news coverage of the Biden bribery allegations. Read The Full Article.
Generative AI: Using LLMs To Produce Culturally Recent Translations Vs Classical NMT – "Dropping" A Song
Last month we explored the ability of Large Language Models (LLMs) to produce higher-quality translations than traditional Neural Machine Translation…
WashPost: DeSantis's Campaign Launch Fizzled
The Washington Post's Philip Bump explores media coverage of DeSantis' campaign launch. Read The Full Article.
Fox News & Last Night's "Wannabe Dictator" Chyron
Last night, Fox News ran briefly ran the chyron "WANNABE DICTATOR SPEAKS AT THE WHITE HOUSE AFTER HAVING HIS POLITICAL…
Generative Search: The Curious Case Of Comet Cleaner's Active Ingredient
The use of LLMs to interpret and summarize search results (so-called "generative search") is widely touted as the future of…
Embedding Models: Multilingual Embedding Versus Machine Translation + English Embedding
There are at least 7,000 languages actively spoken today across the world, yet much of the focus of embedding models…
Embedding Models: Mitigating Knowledge Cutoffs Through Replacement Terms
Embedding models represent a snapshot in time of world knowledge. Like knowledge graphs, LLMs and all other forms of machine…
Embedding Models Vs Classical Sentiment Analysis For Tone & Framing Search
Yesterday we examined the significant limitations of using embeddings to search by tone and framing, demonstrating that embeddings are not…
Weaponized: Russian Propaganda Outlets Promote Presidential Candidate Robert F. Kennedy Jr.
This fascinating analysis by Caroline Orr Bueno examines media coverage of Russian outlets about Robert F. Kennedy Jr. Read The…
Embedding Models: The Impact Of Tone And Framing On Embedding Search
Continuing our embedding series, how strongly are tone and framing captured in embedding representations and are they sufficient to bias…
Embedding Models: The Unique Challenges Of News & The Impact Of Buried Facts On Embeddings As External LLM Memory
Embedding models form the basis of most current approaches to overcoming the input length and knowledge aging of large language…
Embedding Models: Using LLMs To Create Synthetic Comparison Data & The Impact Of Textual Length Part 4
Repeating our analysis from earlier today, this time we'll use an LLM to generate passages in three lengths (tweet, long-form…
Embedding Models: Using LLMs To Create Synthetic Comparison Data & The Impact Of Textual Length Part 3
As we continue our embedding series, we've demonstrated that the length of the input text can have an impact on…
Embedding Models: The Impact Of Textual Length On Embedding Similarity Part 2
As embeddings play an increasingly central role in semantic search and as LLM external memory, one challenge is that despite…
Embedding Models: The Impact Of Length On Embedding Similarity
Embeddings are highly sensitive to input length, where highly similar texts can yield very different embeddings depending on their size….
Embedding Models: Clustering COVID-19 Versus "Poxes"
Following on our "mpox" versus "monkeypox" experiment, let's use that same embedding visualization template to cluster a broader set of…
Embedding Models: Revisiting Multilingual Embedding Through Visualization
Using our new embedding visualization template, let's revisit our multilingual embedding experiment and visualize how each embedding model clusters our…
Embedding Models: Capitalization & Knowledge Cutoffs Part 2
Yesterday we introduced a Colab notebook template for visualizing embedding models and explored the impact of capitalization, word spacing and…