| タイトル | Measuring Constraint-Set Utility for Partitional Clustering Algorithms |
| 著者(英) | Basu, Sugato; Davidson, Ian; Wagstaff, Kiri L. |
| 著者所属(英) | Jet Propulsion Lab., California Inst. of Tech. |
| 発行日 | 2006-09-18 |
| 言語 | eng |
| 内容記述 | Clustering with constraints is an active area of machine learning and data mining research. Previous empirical work has convincingly shown that adding constraints to clustering improves the performance of a variety of algorithms. However, in most of these experiments, results are averaged over different randomly chosen constraint sets from a given set of labels, thereby masking interesting properties of individual sets. We demonstrate that constraint sets vary significantly in how useful they are for constrained clustering; some constraint sets can actually decrease algorithm performance. We create two quantitative measures, informativeness and coherence, that can be used to identify useful constraint sets. We show that these measures can also help explain differences in performance for four particular constrained clustering algorithms. |
| NASA分類 | Numerical Analysis |
| 権利 | Copyright |
|