Massive Yahoo News Feed Dataset Released

Yahoo has released a massive News Feed dataset.

Here's an excerpt from the announcement:

The Yahoo News Feed dataset is a collection based on a sample of anonymized user interactions on the news feeds of several Yahoo properties, including the Yahoo homepage, Yahoo News, Yahoo Sports, Yahoo Finance, Yahoo Movies, and Yahoo Real Estate. The dataset stands at a massive ~110B lines (1.5TB bzipped) of user-news item interaction data, collected by recording the user-news item interaction of about 20M users from February 2015 to May 2015.

Digital Scholarship | Digital Scholarship Sitemap

Avatar photo

Author: Charles W. Bailey, Jr.

Charles W. Bailey, Jr.