The GDELT Project

Common Crawl And Unlocking Web Archives For Research

The world’s web archives contain tens of petabytes of data charting the evolution of our digital world, yet little of this historical record is available for academic research. What might archives learn from Common Crawl’s model of open data as a future for big data research?

Read The Full Article.