Package: wordpredictor 0.0.5
wordpredictor: Develop Text Prediction Models Based on N-Grams
A framework for developing n-gram models for text prediction. It provides data cleaning, data sampling, extracting tokens from text, model generation, model evaluation and word prediction. For information on how n-gram models work we referred to: "Speech and Language Processing" <https://web.archive.org/web/20240919222934/https%3A%2F%2Fweb.stanford.edu%2F~jurafsky%2Fslp3%2F3.pdf>. For optimizing R code and using R6 classes we referred to "Advanced R" <https://adv-r.hadley.nz/r6.html>. For writing R extensions we referred to "R Packages", <https://r-pkgs.org/index.html>.
Authors:
wordpredictor_0.0.5.tar.gz
wordpredictor_0.0.5.tar.gz(r-4.5-noble)wordpredictor_0.0.5.tar.gz(r-4.4-noble)
wordpredictor_0.0.5.tgz(r-4.4-emscripten)wordpredictor_0.0.5.tgz(r-4.3-emscripten)
wordpredictor.pdf |wordpredictor.html✨
wordpredictor/json (API)
NEWS
# Install 'wordpredictor' in R: |
install.packages('wordpredictor', repos = c('https://cran.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/pakjiddat/word-predictor/issues
Pkgdown:https://pakjiddat.github.io
Last updated 2 months agofrom:175cfb1676. Checks:OK: 2. Indexed: no.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Dec 08 2024 |
R-4.5-linux | OK | Dec 08 2024 |
Exports:BaseDataAnalyzerDataCleanerDataSamplerEnvManagerModelModelEvaluatorModelGeneratorModelPredictorTokenGeneratorTPGenerator
Dependencies:clicolorspacedigestdplyrfansifarvergenericsggplot2gluegtableisobandlabelinglatticelifecyclemagrittrMASSMatrixmgcvmunsellnlmepatchworkpillarpkgconfigR6RColorBrewerrlangscalesSnowballCstringistringrtibbletidyselectutf8vctrsviridisLitewithr
Readme and manuals
Help Manual
Help page | Topics |
---|---|
Base class for all other classes | Base |
Analyzes input text files and n-gram token files | DataAnalyzer |
Provides data cleaning functionality | DataCleaner |
Generates data samples from text files | DataSampler |
Allows managing the test environment | EnvManager |
Represents n-gram models | Model |
Evaluates performance of n-gram models | ModelEvaluator |
Generates n-gram models from a text file | ModelGenerator |
Allows predicting text, calculating word probabilities and Perplexity | ModelPredictor |
Generates n-grams from text files | TokenGenerator |
Generates transition probabilities for n-grams | TPGenerator |