Continue Reading

Our Journey Towards User-Facing Vector Search: Evaluating Elasticsearch's ANN Vector Search RAM Costs

As we continue our journey towards offering realtime user-facing semantic search over our growing collection of embedding datasets, we are…

Continue Reading

GCP Tips & Tricks: Observations On A Decade Of Running Elasticsearch On GCP: Part 3 – Future Storage Options

We've run Elasticsearch clusters on GCP for almost a decade across many different iterations of hardware and cluster configurations. Earlier…

Continue Reading

The Hidden Dangers Of Generative Coding (CodeGen) Guardrails

One of the most intriguing findings from our generative code modernization (codegen) experiment earlier today was the degree to which…

Continue Reading

A Look Back At Mapping 2013's "The Global Conversation" And Ahead To The Future

One decade ago we mapped "The Global Conversation" for the December 2013 print edition of Foreign Policy Magazine that coincided…

Continue Reading

Experiments In Summarizing Global Media Tenor: Views Towards China – Part 1

How might we use LLMs to summarize at-scale media tone and portrayals of countries? Let's look at a few different…

Continue Reading

Tracing Global Media Tone & Anxiety Towards China & Ukraine 2020-Present

Using the GKG, what might we be able to learn about global media tone towards China and Ukraine? Let's first…

Continue Reading

The Perils Of LLMs For Translation Tasks On Lower-Resource Languages: Estonian Noun Declension

One of the great promises of large language models (LLMs) is their ability to revolutionize translation and linguistic tasks. A…

Continue Reading

The Erdogan Heart Attack Rumor On Social Vs Mainstream Media: The Perils Of Prioritizing Speed Over Verification

One of the most-touted aspects of social media when it comes to realtime global warning is its claimed ability to…

Continue Reading

GEN4: GCE Networking: VPC Networks, Firewalls, IAP Tunneling & PGA

Earlier this year we explored how Google Compute Engine (GCE)'s VPC networks and especially GCP's "Private Google Access" (PGA) allows…

Continue Reading

GEN4: Building A Complete Near-Realtime Live Stream Video Analytics Platform In The Cloud In Just A Few Lines Of Code

Given the growing use of live streaming video across the world, from speeches by heads of state to news programming,…

Continue Reading

GDELT Opening Keynotes: Watching, Visualizing And Forecasting The World In Realtime

For organizations and conferences looking for aspirational "grand challenge" opening keynotes and workshops to inspire their audiences, the next in…

Continue Reading

Web Archives As Digital History: Methodologies, Workflows And Technological Needs

Within GDELT's vast archives lie decades of global human history. A library of open datasets spanning more than 8 trillion…

Continue Reading

GDELT Keynotes & Workshops: In-Person & Virtual

For organizations interested in everything from inspirational opening keynotes on the incredible new insights we gain into the functioning of…

Continue Reading

Using The Global Similarity Graph To Bootstrap Categorization Models Using Web NGrams 3.0

A common question from organizations building document classifiers on top of the Web NGrams 3.0 dataset is how to accelerate…

Continue Reading

Using Chyrons To Understand Diversity In Television News

When television news channels turn to outside experts to interview on the major stories of the moment, who do they…

Continue Reading

Mapping News: The Hidden Geography Of The World's News Media

Ever since Culturomics 2.0 showcased the immense insights and hidden predictive power of the geography of the world's news media,…

Continue Reading

"You Are Here": Helping The Public Navigate Their Informational Choices

Over the years we've explored how the GKG's outlink graph can be used to construct "you are here" maps of…

Continue Reading

Creating A New Generation Of Recommender Services: From Cohorts & Classification To Context & Trustworthiness

As we continue to explore how to help the world's citizenry access trustworthy information, one key area of research we…

Continue Reading

Planetary Scale Knowledge Graphs: Tractable Approaches To Mining Massive Graphs

Earlier this week we touched on some of the incredible and unparalleled new classes of research questions that are now…

Continue Reading

Behind The Scenes: Lessons Learned From Ingesting Large Datasets Into The Cloud

Given GDELT's immense global scale and footprint that extends across nearly every GCP data center worldwide, we are frequently asked…

Continue Reading

Inaugural Google Innovators Hive 2022 Talk: Using Google Cloud & GCP AI APIs To Watch, Visualize And Forecast The World In Realtime

It was a tremendous honor to speak last week at the inaugural Google Innovators Hive 2022 event, presenting "Using Google…

Continue Reading

Inaugural Google Innovators Hive 2022 Talk Today: Using Google Cloud & GCP AI APIs To Watch, Visualize And Forecast The World In Realtime

Kalev will be presenting today at the inaugural Google Innovators Hive 2022 event, presenting "Using Google Cloud & GCP AI…

Continue Reading

Kalev Speaking At Inaugural Google Innovators Hive 2022: Using Google Cloud & GCP AI APIs To Watch, Visualize And Forecast The World In Realtime

Kalev will be speaking this week at the inaugural Google Innovators Hive 2022 event, presenting "Using Google Cloud & GCP…

Continue Reading

Ukraine & Custom Event Coding And Narrative Assessment

We've received a lot of requests for guidance on how to apply custom event coding schemas to track media reporting…