Package: llmimpute 0.1.0
llmimpute: Missing Data Imputation via Language Models and Statistics
Provides missing data imputation through two complementary engines: a large language model engine that communicates with the 'Anthropic' 'Claude' application programming interface for context-aware semantic imputation, and a fully self-contained offline engine implementing nineteen statistical and machine learning algorithms entirely in base R with no additional package dependencies. Offline methods include mean, median, mode, last observation carried forward, next observation carried backward, hot-deck, predictive mean matching, k-nearest neighbours, ordinary least-squares regression, Lasso with coordinate descent, Ridge with closed-form solution, Bayesian Ridge regression with evidence approximation following MacKay (1992), support vector regression with a radial basis function kernel, classification and regression trees, random forests, gradient boosting, iterative random forest imputation, principal component analysis imputation via iterative singular value decomposition, and nuclear-norm minimisation via singular value thresholding. When no API key is available the package automatically falls back to the offline engine, ensuring full operation in environments without internet access. Every imputed value is accompanied by a confidence score and a plain-language reasoning string, producing reproducible audit trails. The automatic method selector chooses the best algorithm per column based on data type, skewness, missingness rate, and inter-column correlations.
Authors:
llmimpute_0.1.0.tar.gz
llmimpute_0.1.0.tar.gz(r-4.7-any)llmimpute_0.1.0.tar.gz(r-4.6-any)
llmimpute_0.1.0.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
card.svg |card.png
llmimpute/json (API)
NEWS
| # Install 'llmimpute' in R: |
| install.packages('llmimpute', repos = c('https://cran.r-universe.dev', 'https://cloud.r-project.org')) |
This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.
Last updated from:a4214af4b5. Checks:4 OK. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-x86_64 | OK | 133 | ||
| source / vignettes | OK | 172 | ||
| linux-release-x86_64 | OK | 126 | ||
| wasm-release | OK | 275 |
Exports:lmi_diagnoselmi_exportlmi_get_modellmi_imputelmi_impute_offlinelmi_methodslmi_modelslmi_providerslmi_set_api_keylmi_set_model
Dependencies:askpassclicurlgluehttr2jsonlitelifecyclemagrittropensslR6rappdirsrlangsysvctrswithr
Readme and manuals
Help Manual
| Help page | Topics |
|---|---|
| Extract the imputed data frame from an lmi_result | as.data.frame.lmi_result |
| Diagnose missing data in a data frame | lmi_diagnose |
| Export imputed data and audit trail to files | lmi_export |
| Impute missing values using LLM or built-in statistical methods | lmi_impute |
| Impute missing values using built-in statistical and ML methods (no API required) | lmi_impute_offline |
| List all available offline imputation methods | lmi_methods |
| List recommended models for each supported LLM provider | lmi_models |
| List all supported LLM providers | lmi_providers |
| Configure the API key and LLM provider for llmimpute | lmi_set_api_key |
| Set the LLM model used for imputation | lmi_get_model lmi_set_model |
| Print an lmi_result object | print.lmi_result |
| Summarise an lmi_result object | summary.lmi_result |
