Title | ||
---|---|---|
Cluster Prototypes and Fuzzy Memberships Jointly Leveraged Cross-Domain Maximum Entropy Clustering |
Abstract | ||
---|---|---|
The classical maximum entropy clustering (MEC) algorithm usually cannot achieve satisfactory results in the situations where the data is insufficient, incomplete, or distorted. To address this problem, inspired by transfer learning, the specific cluster prototypes and fuzzy memberships jointly leveraged (CPM-JL) framework for cross-domain MEC (CDMEC) is firstly devised in this paper, and then the corresponding algorithm referred to as CPM-JL-CDMEC and the dedicated validity index named fuzzy memberships-based cross-domain difference measurement (FM-CDDM) are concurrently proposed. In general, the contributions of this paper are fourfold: 1) benefiting from the delicate CPM-JL framework, CPM-JL-CDMEC features high-clustering effectiveness and robustness even in some complex data situations; 2) the reliability of FM-CDDM has been demonstrated to be close to well-established external criteria, e.g., normalized mutual information and rand index, and it does not require additional label information. Hence, using FM-CDDM as a dedicated validity index significantly enhances the applicability of CPM-JL-CDMEC under realistic scenarios; 3) the performance of CPM-JL-CDMEC is generally better than, at least equal to, that of MEC because CPM-JL-CDMEC can degenerate into the standard MEC algorithm after adopting the proper parameters, and which avoids the issue of negative transfer; and 4) in order to maximize privacy protection, CPM-JL-CDMEC employs the known cluster prototypes and their associated fuzzy memberships rather than the raw data in the source domain as prior knowledge. The experimental studies thoroughly evaluated and demonstrated these advantages on both synthetic and real-life transfer datasets. |
Year | DOI | Venue |
---|---|---|
2016 | 10.1109/TCYB.2015.2399351 | Cybernetics, IEEE Transactions |
Keywords | Field | DocType |
cross-domain clustering,maximum entropy clustering (mec),privacy protection,transfer learning,validity index,optimization,linear programming,prototypes,clustering algorithms,convergence,indexes,entropy | Data mining,Transfer of learning,Robustness (computer science),Rand index,FLAME clustering,Linear programming,Artificial intelligence,Cluster analysis,Mathematical optimization,Fuzzy logic,Principle of maximum entropy,Machine learning,Mathematics | Journal |
Volume | Issue | ISSN |
PP | 99 | 2168-2267 |
Citations | PageRank | References |
9 | 0.42 | 43 |
Authors | ||
7 |
Name | Order | Citations | PageRank |
---|---|---|---|
pengjiang qian | 1 | 30 | 5.48 |
Yizhang Jiang | 2 | 382 | 27.24 |
Zhaohong Deng | 3 | 647 | 35.34 |
lingzhi hu | 4 | 11 | 0.80 |
shouwei sun | 5 | 12 | 0.80 |
Shitong Wang | 6 | 1485 | 109.13 |
Raymond F. Muzic Jr. | 7 | 28 | 4.48 |