Ulrich Bodenhofer, Mario Drobics, Werner Winiwarter,
"Mining Clusters and Corresponding Interpretable Descriptions - A Three-Stage Approach"
, in Expert Systems, Vol. 19, Nummer 4, Seite(n) 224-234, 9-2002, ISSN: 0266-4720
Original Titel:
Mining Clusters and Corresponding Interpretable Descriptions - A Three-Stage Approach
Sprache des Titels:
Englisch
Original Kurzfassung:
This paper presents a three-stage approach to data mining which puts special emphasis on the visualization and interpretability of the results. In the first stage, the input data is represented by a self-organizing map in order to allow visualization and to reduce the amount of data while removing noise, outliers, and missing values. Then this preprocessed information is used to identify and display fuzzy clusters of similarity. Finally, descriptions close to natural language are computed for these clusters in order to provide the analyst with qualitative information. This is accomplished by generating fuzzy rules using an inductive learning method. The proposed approach is applied to three case studies, including image data and real-world data sets.
The results illustrate the robustness, intuitiveness and wide applicability of the method.
Sprache der Kurzfassung:
Englisch
Englische Kurzfassung:
This paper presents a three-stage approach to data mining which puts special emphasis on the visualization and interpretability of the results. In the first stage, the input data is represented by a self-organizing map in order to allow visualization and to reduce the amount of data while removing noise, outliers, and missing values. Then this preprocessed information is used to identify and display fuzzy clusters of similarity. Finally, descriptions close to natural language are computed for these clusters in order to provide the analyst with qualitative information. This is accomplished by generating fuzzy rules using an inductive learning method. The proposed approach is applied to three case studies, including image data and real-world data sets.
The results illustrate the robustness, intuitiveness and wide applicability of the method.