Abstract | ||
---|---|---|
Keyphrase extraction aims at automatically selecting small set of phrases in a document, that best describe its main ideas. There is great need for better methods of keyphrase extraction in the absence of labeled data, as currently unsupervised algorithms fail to achieve adequate performance, compared to their supervised counterparts. In this paper we suggest a widely applicable distant supervision framework based on auxiliary data from query logs. By propagating information from queries and subsequent consumption of content, weak labels are produced, transforming the problem into the easier supervised task. Evaluation on a large dataset shows the superiority of this approach over unsupervised alternatives. |
Year | DOI | Venue |
---|---|---|
2020 | 10.1109/BigDataService49289.2020.00019 | 2020 IEEE Sixth International Conference on Big Data Computing Service and Applications (BigDataService) |
Keywords | DocType | ISBN |
Keyphrase Extraction, Document Analysis, Knowledge Extraction | Conference | 978-1-7281-7023-7 |
Citations | PageRank | References |
0 | 0.34 | 0 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Oren Sar Shalom | 1 | 20 | 7.74 |
Hezi Resheff | 2 | 0 | 0.34 |
Alex Zhicharevich | 3 | 0 | 0.34 |
Rami Cohen | 4 | 49 | 3.14 |