Proposed by: Andrew McCallum
Added on: 20 November 2016.
The Cora data contains bibliographic records of machine learning papers that have been manually clustered into groups that refer to the same publication.
Note that various versions of the Cora data set have been used by many publications in record linkage and entity resolution over the years.
Note the second column (field/attribute) contains the entity identifiers (publication identifiers).