Visible to the public GNMID14: A Collection of 110 Million Global Music Identification Matches

TitleGNMID14: A Collection of 110 Million Global Music Identification Matches
Publication TypeConference Paper
Year of Publication2016
AuthorsSummers, Cameron, Tronel, Greg, Cramer, Jason, Vartakavi, Aneesh, Popp, Phillip
Conference NameProceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval
Conference LocationNew York, NY, USA
ISBN Number978-1-4503-4069-4
Keywordscollection, Content Identification, dataset, fingerprint, genre, geotemporal, mood, music, pubcrawl170201, similarity, test

A new dataset is presented composed of music identification matches from Gracenote, a leading global music metadata company. Matches from January 1, 2014 to December 31, 2014 have been curated and made available as a public dataset called Gracenote Music Identification 2014, or GNMID14, at the following address: This collection is the first significant music identification dataset and one of the largest music related datasets available containing more than 110M matches in 224 countries for 3M unique tracks, and 509K unique artists. It features geotemporal information (i.e. country and match date), genre and mood metadata. In this paper, we characterize the dataset and demonstrate its utility for Information Retrieval (IR) research.

Citation Keysummers_gnmid14:_2016