Package: quanteda 4.1.0
quanteda: Quantitative Analysis of Textual Data
A fast, flexible, and comprehensive framework for quantitative text analysis in R. Provides functionality for corpus management, creating and manipulating tokens and n-grams, exploring keywords in context, forming and manipulating sparse matrices of documents by features and feature co-occurrences, analyzing keywords, computing feature similarities and distances, applying content dictionaries, applying supervised and unsupervised machine learning, visually representing text and text analyses, and more.
Authors:
quanteda_4.1.0.tar.gz
quanteda_4.1.0.tar.gz(r-4.5-noble)quanteda_4.1.0.tar.gz(r-4.4-noble)
quanteda_4.1.0.tgz(r-4.4-emscripten)quanteda_4.1.0.tgz(r-4.3-emscripten)
quanteda.pdf |quanteda.html✨
quanteda/json (API)
NEWS
# Install 'quanteda' in R: |
install.packages('quanteda', repos = c('https://cran.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/quanteda/quanteda/issues
- data_char_sampletext - A paragraph of text for testing various text-based functions
- data_char_ukimmig2010 - Immigration-related sections of 2010 UK party manifestos
- data_corpus_inaugural - US presidential inaugural address texts
- data_dfm_lbgexample - Dfm from data in Table 1 of Laver, Benoit, and Garry
- data_dictionary_LSD2015 - Lexicoder Sentiment Dictionary
Last updated 3 months agofrom:0ae2f07236. Checks:OK: 2. Indexed: no.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Nov 04 2024 |
R-4.5-linux-x86_64 | OK | Nov 04 2024 |
Exports:%>%as.corpusas.dfmas.dictionaryas.fcmas.listas.phraseas.tokensas.tokens_xptras.yamlbootstrap_dfmbreakrules_getbreakrules_resetbreakrules_setchar_keepchar_ngramschar_removechar_segmentchar_selectchar_tolowerchar_toupperchar_trimchar_wordstemcheck_charactercheck_doublecheck_integercheck_logicalcolMeanscolSumsCompareconcatconcatenatorconvertcorpuscorpus_groupcorpus_reshapecorpus_samplecorpus_segmentcorpus_subsetcorpus_trimdfmdfm_compressdfm_groupdfm_keepdfm_lookupdfm_matchdfm_removedfm_replacedfm_sampledfm_selectdfm_smoothdfm_sortdfm_subsetdfm_tfidfdfm_tolowerdfm_toupperdfm_trimdfm_weightdfm_wordstemdictionarydocfreqdociddocnamesdocnames<-docvarsdocvars<-fcmfcm_compressfcm_keepfcm_removefcm_selectfcm_sortfcm_tolowerfcm_toupperfeatfreqfeatnamesflatten_dictionaryindexinfo_tbbis.collocationsis.corpusis.dfmis.dictionaryis.fcmis.indexis.kwicis.phraseis.tokensis.tokens_xptrkwicmetameta<-ndocnfeatnsentencentokenntypeobject2fixedobject2idpattern2fixedpattern2idphraseprintquanteda_optionsrowMeansrownames<-rowSumssegidsparsitystopwordsttextstexts<-tokenize_charactertokenize_customtokenize_fasterwordtokenize_fastestwordtokenize_sentencetokenize_word1tokenize_word2tokenize_word3tokenize_word4tokenstokens_chunktokens_compoundtokens_grouptokens_keeptokens_lookuptokens_ngramstokens_removetokens_replacetokens_restoretokens_sampletokens_segmenttokens_selecttokens_skipgramstokens_splittokens_subsettokens_tolowertokens_touppertokens_trimtokens_wordstemtopfeaturestypes
Dependencies:clifastmatchglueISOcodesjsonlitelatticelifecyclemagrittrMatrixRcpprlangSnowballCstopwordsstringixml2yaml