K-Means with their feature selection in RapidMiner
4 min readAug 31, 2021
Related Research
Design

Without Feature Selection
- accuracy: 65.52%

- split data: 0.01, accuracy: 66.22%

Unsupervised Feature Selection with k-means
- balance of simplicity: 1.0 = accuracy: 63.23%

- balance of simplicity: 0.5 = accuracy: 63.23%

- split data: 0.01, balance of simplicity: 1.0 = accuracy: 57.72%

- split data: 0.01, balance of simplicity: 0.5 = accuracy: 66.22%

Try to use different feature selection model
K-Means (H2O)

- notsplit 0.5

- notsplit 1.0

- split 0.5

- split 1.0

X-Means

- notsplit 0.5

- notsplit 1.0

- split 0.5

- split 1.0

K-NN

- split max 20

- split max 30

Naive Bayes

- split max 20

- split max 30

Decission Tree

- split max 20

- split max 30
