Package: wskm 1.4.40

He Zhao

wskm: Weighted k-Means Clustering

Entropy weighted k-means (ewkm) by Liping Jing, Michael K. Ng and Joshua Zhexue Huang (2007) <doi:10.1109/TKDE.2007.1048> is a weighted subspace clustering algorithm that is well suited to very high dimensional data. Weights are calculated as the importance of a variable with regard to cluster membership. The two-level variable weighting clustering algorithm tw-k-means (twkm) by Xiaojun Chen, Xiaofei Xu, Joshua Zhexue Huang and Yunming Ye (2013) <doi:10.1109/TKDE.2011.262> introduces two types of weights, the weights on individual variables and the weights on variable groups, and they are calculated during the clustering process. The feature group weighted k-means (fgkm) by Xiaojun Chen, Yunminng Ye, Xiaofei Xu and Joshua Zhexue Huang (2012) <doi:10.1016/j.patcog.2011.06.004> extends this concept by grouping features and weighting the group in addition to weighting individual features.

Authors:Graham Williams [aut], Joshua Z Huang [aut], Xiaojun Chen [aut], Qiang Wang [aut], Longfei Xiao [aut], He Zhao [cre]

wskm_1.4.40.tar.gz
wskm_1.4.40.tar.gz(r-4.5-noble)wskm_1.4.40.tar.gz(r-4.4-noble)
wskm_1.4.40.tgz(r-4.4-emscripten)wskm_1.4.40.tgz(r-4.3-emscripten)
wskm.pdf |wskm.html
wskm/json (API)

# Install 'wskm' in R:
install.packages('wskm', repos = 'https://cloud.r-project.org')

Bug tracker:https://github.com/simonyansenzhao/wskm/issues1 issues

Datasets:
  • fgkm.sample - Sample dataset to illustrate the fgkm algorithm.
  • twkm.sample - Sample dataset to test the twkm algorithm.

On CRAN:

Conda:

1.00 score 1 stars 459 downloads 3 exports 22 dependencies

Last updated 5 years agofrom:881bfb4787. Checks:3 OK. Indexed: yes.

TargetResultLatest binary
Doc / VignettesOKMar 23 2025
R-4.5-linux-x86_64OKMar 23 2025
R-4.4-linux-x86_64OKMar 23 2025

Exports:ewkmfgkmtwkm

Dependencies:classclusterdeldirDEoptimRdiptestflexmixfpcinterpjpegkernlablatticelatticeExtraMASSmclustmodeltoolsnnetpngprabclusRColorBrewerRcppRcppEigenrobustbase

Citation

To cite "wskm" in publications, please use:

Graham Williams, Joshua Z Huang, Xiaojun Chen, Qiang Wang and Longfei Xiao (2020). wskm: Weighted k-Means Clustering. R package version 1.4.40. https://CRAN.R-project.org/package=wskm

If ewkm() is used, please also cite:

Liping Jing, Michael K. Ng and Joshua Zhexue Huang (2007). An Entropy Weighting k-Means Algorithm for Subspace Clustering of High-Dimensional Sparse Data. IEEE Transactions on Knowledge and Data Engineering, 19(8), 1026--1041.

If fgkm() is used, please also cite:

Xiaojun Chen, Yunming Ye, Xiaofei Xu and Joshua Zhexue Huang (2012). A Feature Group Weighting Method for Subspace Clustering of High-Dimensional Data. Pattern Recognition, 45(1), 434--446.

If twkm() is used, please also cite:

Xiaojun Chen, Xiaofei Xu, Joshua Zhexue Huang and Yunming Ye (2013). TW-k-Means: Automated Two-Level Variable Weighting Clustering Algorithm for Multiview Data. IEEE Transactions on Knowledge and Data Engineering, 25(4), 932--944.

Corresponding BibTeX entries:

  @Manual{wskm2014hz,
    title = {{wskm}: Weighted $k$-Means Clustering},
    author = {Graham Williams and Joshua Zhexue Huang and Xiaojun Chen
      and Qiang Wang and Longfei Xiao},
    year = {2020},
    note = {R package version 1.4.40},
    url = {https://CRAN.R-project.org/package=wskm},
  }
  @Article{ewkm2007lpj,
    title = {An Entropy Weighting $k$-Means Algorithm for Subspace
      Clustering of High-Dimensional Sparse Data},
    author = {Liping Jing and Michael K. Ng and Joshua Zhexue Huang},
    journal = {{IEEE} Transactions on Knowledge and Data Engineering},
    volume = {19},
    number = {8},
    pages = {1026--1041},
    year = {2007},
    doi = {10.1109/TKDE.2007.1048},
  }
  @Article{fgkm2012xjc,
    title = {A Feature Group Weighting Method for Subspace Clustering
      of High-Dimensional Data},
    author = {Xiaojun Chen and Yunming Ye and Xiaofei Xu and Joshua
      Zhexue Huang},
    journal = {Pattern Recognition},
    volume = {45},
    number = {1},
    pages = {434--446},
    year = {2012},
    doi = {10.1016/j.patcog.2011.06.004},
  }
  @Article{twkm2013xjc,
    title = {TW-$k$-Means: Automated Two-Level Variable Weighting
      Clustering Algorithm for Multiview Data},
    author = {Xiaojun Chen and Xiaofei Xu and Joshua Zhexue Huang and
      Yunming Ye},
    journal = {IEEE Transactions on Knowledge and Data Engineering},
    volume = {25},
    number = {4},
    pages = {932--944},
    year = {2013},
    doi = {10.1109/TKDE.2011.262},
  }