Title | ||
---|---|---|
The intelius nickname collection: quantitative analyses from billions of public records |
Abstract | ||
---|---|---|
Although first names and nicknames in the United States have been well documented, there has been almost no quantitative analysis on the usage and association of these names amongst themselves. In this paper we introduce the Intelius Nickname Collection, a quantitative compilation of millions of name-nickname associations based on information gathered from billions of public records. To the best of our knowledge, this is the largest collection of its kind, making it a natural resource for tasks such as coreference resolution, record linkage, named entity recognition, people and expert search, information extraction, demographic and sociological studies, etc. The collection will be made freely available. |
Year | Venue | Keywords |
---|---|---|
2012 | north american chapter of the association for computational linguistics | united states,expert search,intelius nickname collection,quantitative analysis,entity recognition,coreference resolution,quantitative compilation,largest collection,public record,name-nickname association,information extraction |
Field | DocType | Citations |
Data science,Record linkage,Coreference,Public records,Computer science,Natural resource,Information extraction,Named-entity recognition | Conference | 0 |
PageRank | References | Authors |
0.34 | 3 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Vitor R. Carvalho | 1 | 672 | 36.38 |
Yigit Kiran | 2 | 1 | 0.69 |
Andrew Borthwick | 3 | 94 | 17.07 |