Abstract | ||
---|---|---|
Author name disambiguation is an important problem that needs to be resolved in bibliometric analysis or tech mining. Many techniques have been presented; however, most of them require a long run time or additional information. A new method based on semantic fingerprints was presented to disambiguate author names without external data. A manually annotated dataset was built to testify on the efficiency of the presented method. Experiments using co-author features, institution features, and text fingerprints were conducted respectively. We found that the first two methods had higher precision, but their recall was low, and the text fingerprint method had higher recall and satisfied precision. Based on these results, we integrated co-author features, institution features, and text fingerprints to provide semantic fingerprints for disambiguating author names and achieving better performance on the F-measure. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1007/s11192-017-2338-6 | Scientometrics |
Keywords | Field | DocType |
Name disambiguation,Simhash,Semantic fingerprint | Data mining,Information retrieval,Author name,Computer science,Fingerprint,Artificial intelligence,Natural language processing,Recall,Name disambiguation | Journal |
Volume | Issue | ISSN |
111 | 3 | 0138-9130 |
Citations | PageRank | References |
1 | 0.34 | 25 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Hongqi Han | 1 | 1 | 0.68 |
Changqing Yao | 2 | 22 | 6.71 |
Yuan Fu | 3 | 2 | 1.41 |
Yongsheng Yu | 4 | 5 | 1.08 |
Yunliang Zhang | 5 | 1 | 0.68 |
Shuo Xu | 6 | 23 | 6.23 |