Abstract | ||
---|---|---|
Semi-supervised clustering which aims to integrate side information to improve the performance of clustering process, has received a lot of attentions in research community. Generally, there are two kinds of side information called seed (labelled data) and constraint (must-link, cannot-link). By integrating information provided by the user or domain expert, the semi-supervised clustering can produce expected results of users. In fact, clustering results usually depend on side information provided, so different side information will produce different results. In some cases, the performance of clustering may decrease if the side information is not carefully chosen. This paper addresses the problem of selecting good constraints for semi-supervised clustering algorithms. For this purpose, we propose an active learning algorithm for the constraints collection task, which relies on the min-max algorithm and peaks estimation based on density score. Experiments conducted on some real data sets from UCI show the effectiveness of our approach. |
Year | DOI | Venue |
---|---|---|
2022 | 10.23919/ICACT53585.2022.9728938 | 2022 24TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): ARITIFLCIAL INTELLIGENCE TECHNOLOGIES TOWARD CYBERSECURITY |
Keywords | DocType | ISSN |
Clustering, Semi-supervised clustering, Constraint, Density peak, Active Learning | Conference | 1738-9445 |
Citations | PageRank | References |
0 | 0.34 | 0 |
Authors | ||
10 |
Name | Order | Citations | PageRank |
---|---|---|---|
Viet-Vu Vu | 1 | 0 | 1.69 |
Byeongnam Yoon | 2 | 0 | 1.01 |
Hong-Quan Do | 3 | 0 | 1.69 |
Hai-Minh Nguyen | 4 | 0 | 0.34 |
Tran-Chung Dao | 5 | 0 | 0.34 |
Cong-Mau Tran | 6 | 0 | 0.68 |
Doan-Vinh Tran | 7 | 0 | 1.01 |
Thi-Nhuong Phi | 8 | 0 | 0.34 |
Viet-Thang Vu | 9 | 0 | 0.68 |
Tien-Dung Duong | 10 | 0 | 0.68 |