Package: llmclean 0.1.1

Sadikul Islam

llmclean: LLM-Assisted Data Cleaning with Multi-Provider Support

Detects and suggests fixes for semantic inconsistencies in data frames by calling large language models (LLMs) through a unified, provider-agnostic interface. Supported providers include 'OpenAI' ('GPT-4o', 'GPT-4o-mini') <https://platform.openai.com>, 'Anthropic' ('Claude') <https://www.anthropic.com>, 'Google' ('Gemini') <https://ai.google.dev>, 'Groq' (free-tier 'LLaMA' and 'Mixtral') <https://groq.com>, and local 'Ollama' models <https://ollama.com>. The package identifies issues that rule-based tools cannot detect: abbreviation variants, typographic errors, case inconsistencies, and malformed values. Results are returned as tidy data frames with column, row index, detected value, issue type, suggested fix, and confidence score. An offline fallback using statistical and fuzzy-matching methods is provided for use without any application programming interface (API) key. Interactive fix application with human review is supported via 'apply_fixes()'. Methods follow de Jonge and van der Loo (2013) <https://cran.r-project.org/doc/contrib/de_Jonge+van_der_Loo-Introduction_to_data_cleaning_with_R.pdf> and Chaudhuri et al. (2003) <doi:10.1145/872757.872796>.

Authors:Sadikul Islam [aut, cre], Rajesh Kaushal [aut]

llmclean_0.1.1.tar.gz
llmclean_0.1.1.tar.gz(r-4.7-any)llmclean_0.1.1.tar.gz(r-4.6-any)
llmclean_0.1.1.tgz(r-4.6-emscripten)
manual.pdf |manual.html
card.svg |card.png
llmclean/json (API)

# Install 'llmclean' in R:
install.packages('llmclean', repos = c('https://cran.r-universe.dev', 'https://cloud.r-project.org'))
Datasets:

On CRAN:

Conda:

This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.

2.30 score 417 downloads 7 exports 15 dependencies

Last updated from:03c9478b9f. Checks:4 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-x86_64OK131
source / vignettesOK229
linux-release-x86_64OK137
wasm-releaseOK109

Exports:apply_fixesdetect_issuesget_llm_providerllmclean_reportoffline_detectset_llm_providersuggest_fixes

Dependencies:clidplyrgenericsgluelifecyclemagrittrpillarpkgconfigR6rlangtibbletidyselectutf8vctrswithr

LLM-Assisted Data Cleaning with llmclean

Rendered fromllmclean-intro.Rmdusingknitr::rmarkdownon Jun 09 2026.

Last update: 2026-04-22
Started: 2026-04-22