Continue Reading

The Dangers As Companies Outsource Customer Relations To LLM's: A Case Study Of A Well-Known Pizza Chain

Today's Wall Street Journal explores how a growing number of fast food chains are using AI-powered speech recognition to take…

Continue Reading

Fox News Dominates "Gold IRA" Mentions On TV News

The Washington Post today examines the "gold IRA" industry in the conservative news ecosystem. Fox News is replete with such…

Continue Reading

How Evolving Guardrails & RLHF Is Creating False Confidence In LLM Safety & Bias Issues

One of the more interesting aspects of safety and bias issues in current generation LLMs is the rate at which…

Continue Reading

How "Sorry For The Inconvenience" Rose In Our Lexicon

When SWB collapsed earlier this year, its website displayed a message that has become a symbol of modern life: "Our…

Continue Reading

How TV News Covered The Odisha Train Crash Last Month

The train accident in India's Odisha state last month killed more than 260 people and injured more than 1,000. Yet,…

Continue Reading

How Two LLMs See The Biden, Trump, Putin and Zelenskyy's Administrations: The Biases Of Web-Scale Training Data

As LLMs are increasingly positioned as an alternative to traditional search engines, what are the perspectives on various presidential administrations…

Continue Reading

The Unintended Consequences & Harms Of Multimodal LLM Debiasing: Detection Vs Generation

Multimodal LLMs represent uncharted new territory in the push to "debias" and "globalize" computer vision models. Past generations of object…

Continue Reading

Automated Image Captioning: Experiments With Google's New Imagen on Vertex AI Generative AI Service

With the release into general availability of Imagen on Vertex AI, Google's new image-based generative AI service, let's explore how…

Continue Reading

When Ukraine Praised Russia For "Delivering" Weapons: LLMs & The Severe Risks Of Geopolitical Hallucination & Conflation

One of the more informative findings from yesterday's ambiguous image captioning experiment was the degree to which, when asked to…

Continue Reading

LLMs & An Ambiguous Photo Caption From The Ukrainian Front Lines: Do LLMs Actually Reason Or Merely Pattern Match Their Training Data?

As we continue to explore the ability of LLMs to summarize and distill the chaotic conflicting cacophony that is the…

Continue Reading

WashPost: James Comer Has A James Comer Problem

The Washington Post's Philip Bump includes an examination of media coverage of James Comer. Read The Full Article.

Continue Reading

Large-Token LLMs: Leveraging Anthropic's Claude 2's 100K Token Limit To Summarize An Entire Episode Of Russia's 60 Minutes

Earlier today we demonstrated the extraordinary power of Anthropic's Claude 2's 100,000 token limit to summarize and topically annotate into…

Continue Reading

Large-Token LLMs: Leveraging Anthropic's Claude 2's 100K Token Limit To Summarize An Entire ABC Evening News TV News Broadcast In A Single Prompt

One of the greatest limitations of current LLMs, besides hallucination and instability, is their severe input limits: most common production…

Continue Reading

Experiments With Anthropic's Claude 2 For Summarization, Event & Relation Extraction, NER & Q&A

Continuing our explorations of how various commercial LLMs perform on the chaotic conflicting cacophony that is global news, what does…

Continue Reading

Experiments With Google's PaLM 2 LLM's For Embedding, Summarization, Event & Relation Extraction, NER & Q&A: Bison & Gecko

What does it look like to summarize a television evening news broadcast using Google's PaLM 2 large language model (LLM)?…

Continue Reading

Multimodal LLMs: What Bard Teaches Us About Consumer Versus Enterprise Applications & Imagery

Last week we explored some of the significant limitations of state of the art multimodal research LLM's when applied to…

Continue Reading

Using Embeddings To Rank Clinical, "Creative" & "Inspired Fiction" Summaries Of An Evening News Broadcast

Earlier this month we demonstrated the use of embeddings to combat hallucination in LLM summarization and as a form of…

Continue Reading

AGI & LLM Reasoning: Why Benchmarks Should Require 3-5 Runs To Reduce Anthropomorphization & False AGI Claims

Earlier today we examined how a SOTA multimodal LLM describes a variety of news and other images, testing how well…

Continue Reading

The Limitations Of Multimodal Large Language Models: Automated Image Description, Captioning & Reasoning

Multimodal Large Language Models (LLMs) are touted as the future of automated reasoning, with the ability to look across imagery…

Continue Reading

ChatGPT's Puzzling Inability To Solve A Simple Physics Riddle & The Brittleness Of Reasoning

Interviewers, especially at technology companies, have historically often turned to thought puzzles to understand how a candidate reasons under uncertainty….

Continue Reading

WashPost: Murdoch Is Realizing That He's Stuck With The Monster He Created

An article in today's Washington Post by Philip Bump includes an analysis of media coverage using the TV Explorer. Read…

Continue Reading

Complex & Ambiguous Phrasing, Re-Reading & LLMs' Inability To Conceptually Reason

Conceptually, LLM reasoning is a linear process, "confined to token-level, left-to-right decision-making processes during inference." This means that when confronted…

Continue Reading

Fox News Covered Mulvaney Story 3x More Than CNN & MSNBC Combined

The timeline below shows mentions of Dylan Mulvaney across CNN, MSNBC and Fox News since April of this year, showing…

Continue Reading

FiveThirtyEight: Which 2024 Candidates Had The Best – And Worst – Campaign Launches?

An analysis in FiveThirtyEight today includes a mention of television news coverage of Tim Scott's campaign. Read The Full Article.