Package: dataProfilerR 0.2.1
dataProfilerR: Automated Exploratory Data Analysis and Dataset Profiling
Profiles a data frame with minimal input: column type inference, missing-value analysis, distributional summary statistics (including skewness and kurtosis), normality tests, outlier detection, correlation and categorical-association analysis, date-column profiling, grouped comparisons and an overall data-quality score, alongside a set of 'ggplot2' visualisations. A single entry point, profile_data(), returns a structured S3 object holding metadata, statistics, diagnostics and plots, with print(), summary() and plot() methods, and report() renders the whole profile to a self-contained HTML file. Statistical methods include the Shapiro-Wilk normality test as implemented by Royston (1995) <doi:10.2307/2986146> and the Anderson-Darling test following Stephens (1974) <doi:10.1080/01621459.1974.10480196>, with power comparisons of these tests in Yap and Sim (2011) <doi:10.1080/00949655.2010.520163>, and the categorical association measure of Cramer (1946, ISBN:9780691080048).
Authors:
dataProfilerR_0.2.1.tar.gz
dataProfilerR_0.2.1.tar.gz(r-4.7-any)dataProfilerR_0.2.1.tar.gz(r-4.6-any)
dataProfilerR_0.2.1.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
DESCRIPTION |NEWS
card.svg |card.png
dataProfilerR/json (API)
| # Install 'dataProfilerR' in R: |
| install.packages('dataProfilerR', repos = c('https://cran.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/mqfarooqi1/dataprofilerr/issues
Last updated from:fc17232ddf. Checks:4 OK. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-x86_64 | OK | 153 | ||
| source / vignettes | OK | 167 | ||
| linux-release-x86_64 | OK | 146 | ||
| wasm-release | OK | 132 |
Exports:analyze_datesanalyze_missingcategorical_associationcompare_groupscorrelation_analysisdata_quality_scoredetect_outliersinfer_column_typesis_data_profilekurtosisnormality_testsoutlier_summaryplot_associationplot_boxplotsplot_correlationplot_distributionplot_missingplot_pairsprofile_datareportskewnesssummarize_columns
Dependencies:clicpp11farverggplot2gluegtableisobandlabelinglifecycleR6RColorBrewerrlangS7scalesvctrsviridisLitewithr
