The GDELT Project

Improving Multilingual Entity Identification And Disambiguation

As we look to ways to help the NLP community move towards truly multilingual approaches, one area of especial interest in many news-related applications lies in more robust multilingual entity identification and disambiguation, especially in unusual contexts.

We'd love to see greater research in this space using the new 150-language Web News NGrams 3.0 dataset!