Package: tmcn 0.2-13

Jian Li

tmcn: A Text Mining Toolkit for Chinese

A Text mining toolkit for Chinese, which includes facilities for Chinese string processing, Chinese NLP supporting, encoding detecting and converting. Moreover, it provides some functions to support 'tm' package in Chinese.

Authors:Jian Li

tmcn_0.2-13.tar.gz
tmcn_0.2-13.tar.gz(r-4.5-noble)tmcn_0.2-13.tar.gz(r-4.4-noble)
tmcn_0.2-13.tgz(r-4.4-emscripten)tmcn_0.2-13.tgz(r-4.3-emscripten)
tmcn.pdf |tmcn.html
tmcn/json (API)

# Install 'tmcn' in R:
install.packages('tmcn', repos = c('https://cran.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Datasets:
  • GBK - GBK character set
  • NTUSD - National Taiwan University Semantic Dictionary
  • SIMTRA - Dictionary of simplified and traditional Chinese
  • SPORT - Sport news.
  • STOPWORDS - Dictionary of Chinese stop words

This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.

24 exports 1 stars 1.80 score 0 dependencies 6 dependents 1 mentions 153 scripts 1.8k downloads

Last updated 5 years agofrom:408a39fe1a. Checks:OK: 1 NOTE: 1. Indexed: yes.

TargetResultDate
Doc / VignettesOKAug 24 2024
R-4.5-linux-x86_64NOTEAug 24 2024

Exports:catUTF8createDTMcreateTDMcreateWordFreqgetCharsetisBIG5isGB18030isGB2312isGBKisUTF8leftrevUTF8rightsetchssetchtsetukstopwordsCNstrcapstrextractstrpadstrstriptoPinyintoTradtoUTF8

Dependencies: