Package: dataProfilerR 0.2.1

Muhammad Farooqi

dataProfilerR: Automated Exploratory Data Analysis and Dataset Profiling

Profiles a data frame with minimal input: column type inference, missing-value analysis, distributional summary statistics (including skewness and kurtosis), normality tests, outlier detection, correlation and categorical-association analysis, date-column profiling, grouped comparisons and an overall data-quality score, alongside a set of 'ggplot2' visualisations. A single entry point, profile_data(), returns a structured S3 object holding metadata, statistics, diagnostics and plots, with print(), summary() and plot() methods, and report() renders the whole profile to a self-contained HTML file. Statistical methods include the Shapiro-Wilk normality test as implemented by Royston (1995) <doi:10.2307/2986146> and the Anderson-Darling test following Stephens (1974) <doi:10.1080/01621459.1974.10480196>, with power comparisons of these tests in Yap and Sim (2011) <doi:10.1080/00949655.2010.520163>, and the categorical association measure of Cramer (1946, ISBN:9780691080048).

Authors:Muhammad Farooqi [aut, cre]

dataProfilerR_0.2.1.tar.gz
dataProfilerR_0.2.1.tar.gz(r-4.7-any)dataProfilerR_0.2.1.tar.gz(r-4.6-any)
dataProfilerR_0.2.1.tgz(r-4.6-emscripten)
manual.pdf |manual.html
DESCRIPTION |NEWS
card.svg |card.png
dataProfilerR/json (API)

# Install 'dataProfilerR' in R:
install.packages('dataProfilerR', repos = c('https://cran.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/mqfarooqi1/dataprofilerr/issues

On CRAN:

Conda:

2.70 score 22 exports 17 dependencies

Last updated from:fc17232ddf. Checks:4 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-x86_64OK153
source / vignettesOK167
linux-release-x86_64OK146
wasm-releaseOK132

Exports:analyze_datesanalyze_missingcategorical_associationcompare_groupscorrelation_analysisdata_quality_scoredetect_outliersinfer_column_typesis_data_profilekurtosisnormality_testsoutlier_summaryplot_associationplot_boxplotsplot_correlationplot_distributionplot_missingplot_pairsprofile_datareportskewnesssummarize_columns

Dependencies:clicpp11farverggplot2gluegtableisobandlabelinglifecycleR6RColorBrewerrlangS7scalesvctrsviridisLitewithr

Profiling a dataset with dataProfilerR

Rendered fromdataProfilerR.Rmdusingknitr::rmarkdownon Jun 24 2026.

Last update: 2026-06-24
Started: 2026-06-24