Plot()
: For a time series exponential smoothing forecast, now display SSE and MSE fit indices, the linear trend and seasonal coefficients, and the obtained smoothing parameter values.
Plot()
: Improved conversion of character string numeric dates to R type Date.
Plot()
: Read dates entered as character strings formatted as:
Plot()
: Forecasting output now written to an output list object such as p <- Plot(...)
with names out_frcst, out_fitted, out_coefs, and out_params. This output is accessible for showing just part of the output, such as p$out_frcst
when writing R Markdown documents from the results.
STL()
: Set parameter show_range
to TRUE
to show the range of each component. Before TRUE
was the default.
STL()
: Parameter quiet
added to suppress text output and parameter do_plot
added to suppress the visualization.
Plot()
: Exponential smoothing of multiplicative models now working, indicated by es_type="multiplicative"
.
Plot()
: Can set style(suggest=FALSE)
for a time series plot.
Plot()
: If missing data for a run chart, a proper error message is now displayed if attempting to show the runs, for which missing data does not work.
Plot()
: Turn off the plot with do_plot=FALSE
, leaving only text output, now works.
Plot()
: If plotting a run chart with multiple y
variables, adjacent points are now connected with line segments for each of the y
variables. Example: Plot(.Index, c(y1,y2,y3))
.
Plot()
: Exponential smoothing forecasting implemented with accompanying visualization. New parameters include time_ahead
for the number of time_units
to forecast into the future, and time_format
to provide a specific format for the date variable if not detected correctly by default. Control aspects of the exponential smoothing estimation and prediction algorithms with parameters es_level
(alpha), es_trend
(beta), es_seasons
(gamma), es_type
for additive or multiplicative seasonality, and es_PIlevel
for the level of the prediction intervals.
Plot()
: Character string versions of a date as in a variety of forms as digits, such as "08/18/2024", are now by default converted to variable type of Date
. However, this conversion is inherently ambiguous, so the time_format
parameter is provided as a means to provide the precise format if needed, including other formats such as "August 18, 2024". Also, a sequence of four-digit integers within the usual range of dates will also convert automatically to a variable of type `Dates.
STL()
: A wrapper for Base R stl()
that provides additional information and utility:
Date
as the x
-variable and the time series values as the y
-variable instead of an R time series.Date
.BarChart()
, Histogram()
, Plot()
: If the x
- or y
- axis values in the resulting plots all end in 000, then the 000 is replaced with a K, such as 120000
plotted as an axis value of 120K
.
LineChart()
: Deprecated for years, now removed as its functionality has been incorporated into Plot()
and extended with the x
-variable of type Date
.
Plot()
: Parameter run
dropped. Instead, to maintain the position of the variable of interest to be plotted, as the y
-variable, indicate a run chart by specifying .Index
as the x
-variable, that is, the first variable listed. That name, beginning with a period, .
, indicates data values for the variable Will be created automatically.
Plot()
: Increase the default size of the dot in the Cleveland dot plot and related.
Plot()
: When parameter time_unit
is explicitly set, proper dates are now displayed on the x-axis when there is a by
variable.
Plot()
: Suggestions are working. To turn off: style(suggest=FALSE)
.
BarChart()
, Histogram()
, Plot()
: Parameter names by1
and, where applicable, by2
, deprecated, replaced with more descriptive new names facet1
and facet2
.
Plot()
: Using functions from the xts package, when plotting a time series dates are now formatted according to their natural unit. For example, when plotting by years, just years are listed without the month and the day.
Plot()
: When plotting a times series and requesting a level of time aggregation with the time_unit
parameter that is more detailed than the available data, the analysis appropriately terminates with an error message. For example, if the time series data is monthly and an aggregation of weeks
is requested, no analysis is done.
Plot()
: For plotting time series data, new parameter n_date_ticks
added to override the default number of ticks on the x
axis, the date axis.
Plot()
: For Trellis (facet or lattice) plots of bar charts and histograms, each panel is labeled with the proper size of text in the corresponding panel strip.vignettes
: These extensive examples are moved to the web to save space as the installed lessR was exceeding the accepted maximum limit. The web address is given from library(lessR)
.
BarChart()
: When doing a composite plot, if calling fill=getColors(...)
for a divergent palette, such as for one bar for all items on an attitude survey, no longer need to explicitly add n=
for the number of levels.
getColors()
: Added three pre-defined palettes from the colorpace
package: rainbow_hcl
, terrain_hcl
, and heat_hcl
, e.g., getColors("rainbow_hcl")
.
getColors()
: Adjusted the margins to better fit the plot window when the color wheel or rectangle is displayed.
getColors()
: Default Tableau qualitative palette added as a name to recognize and translate to the colors, also for visualization functions, such as fill="Tableau"
.
Plot()
: New parameter time_unit
for when the x-variable is a Date
variable and a time series is plotted with automatic aggregation of the time unit according to the specified value, such as "years"
plotted from daily dates.
Plot()
: New parameter time_agg
for when the x-variable is a Date
variable and a time series is plotted with automatic aggregation to specify the type of aggregation with "sum"
the default.
Plot()
: Pearson correlation analysis no longer displayed if the x-variable is a date, that is, if a time series is plotted.
all visualizations
: Grid lines not affected in some plots that would add some unintended transparency to the visualizations.
PieChart()
: User now warned that the values
family of parameters is changed to labels
, though the old reference still will work.
Read()
: If quiet=TRUE
, automatically modified variable names to legal R names are not reported.
all visualizations
: Default colors softened a bit with default transparency
level moved from 0 to 0.10.
ANOVA()
, BarChart()
, Histogram()
, PieChart()
, pivot()
, Plot()
, Regression()
, ttest()
: Parameter rows
, for subsetting data, deprecated, now named filter
to be more consistent with other systems, although rows
parameter removed entirely for ANOVA()
, pivot()
, Regression(), and
ttest()to avoid needing the
rowsparameter as part of the function definition in addition to
filter`.
BarChart()
, Histogram()
, PieChart()
, Plot()
: Parameter filter
(formerly rows
), now reports the number of rows of data before and after the filtering along with the logical statement that defines the filter.
BarChart()
: Default family of values
parameters now called labels
, such as labels_color
, to be more consistent with other systems.
BarChart()
: values_digits
now labels_decimals
though old value still works.
BarChart()
: Default labels
(formerly values
) that are labeled on each bar now set at input
except when aggregating counts, where it remains at %
.
BarChart()
: When labels
is set to input
, default decimal digits is now 2, which can be customized with labels_digits
.
BarChart()
: Default labels
(formerly values
) that are labeled on each bar now set at input
except when aggregating counts, where it remains at %
.
Plot()
: Parameter area_fill
on time series plots now extends down to the level of the y-variable specified with scale_y
instead of the minimum value of y.
Plot()
: Binning large data sets now results in larger bubbles.
dataStockPrice
: Data table updated with stock prices through May 2024, and trading volume added as the fourth variable.
BarChart()
: For an unstacked or grouped bar chart, beside=TRUE
, if there is missing data for the by
variable, the color distribution across the bars is now consistent across levels and with the legend.
BarChart()
: For two categorical variables, x
and by
, evaluation if input data is a summary (pivot) table of a prior aggregation is now correctly computed.
interact()
: User-written functions are able to again use BarChart()
, Histogram()
, PieChart()
, and Plot()
.
pivot()
: Output variable names now consistently labeled: variable_stat, e.g., Salary_mean.
Plot()
: Shapes now properly plot with a by
parameter stratifying according to the specified categorical variable.
Plot()
: by1
parameter for Trellis (facet) plot with fit
line specified now properly does not display text output when quiet=TRUE
.
reshape_wide()
: After the transformation, all other variables except those specified as the group, response, or ID are deleted as their values are not evenly distributed over the new wide format variables.
pivot()
: Fixed a bug on the output. The results were correct but not formatted correctly.BarChart()
: Parameter stat_x
added to specify how to plot the y
-axis if there is no y
variable specified, with default value of counts
and option proportion
or %
. These values are separated from the stat
parameter which now exclusively pertains to the transformation of a specified y
numerical variable.
BarChart()
, Histogram()
, Plot()
: Parameters lab_adj
, margin_adj
, and legend_adj
renamed for clarity to lab_adjust
, margin_adjust
, and legend_adjust
.
pivot()
: When pivot table is of counts, using function table
, the values of n
and na
are now listed last in the resulting pivot table.
Plot()
: shape
parameter default for multiple groups according to parameter by
returns to "circle"
with the new shape
value "vary"
if desired to retain a sequence of default shapes across the different groups.
Plot()
: fit
parameter now set only to the values that it can analyze such as lm
, so no longer can be set to TRUE or FALSE.
Read()
and Write()
: Support the modern, fast, and efficient data file formats from the arrow
package, feather
and parquet
.
details()
: Shows the proportion of all missing data values in addition to the total, and displays each row of data with missing data, also obtained with Read()
when brief=FALSE
.
details()
: miss_zero
parameter removed.
rename()
: Parameter to
can now specify a vector of names to change, with the corresponding from
vector of new names.
BarChart()
: If a numerical y variable is specified for analysis of the original,raw data, then if a value of the stat
parameter is not specified its value now defaults to "mean"
.
Plot()
: If a time series is plotted from a x-axis variable that is not of type Date
, then, setting segments=TRUE
connects the points (which can vanish if size=0
), but the line width parameter lwd
was previously not responsive in this context.
Logit(), Regression()
: For categorical predictor variables, the conversion to indicator variables now happens before the analysis. All analyses are now only of numeric variables so that the correlation matrix, scatterplot matrix, tolerance, and best subset analyses are now conducted for categorical predictor variables.
Logit(), Regression()
: Redundant predictor variables that lead to a singularity is now detected and noted instead of the function crashing.
Logit()
: For sigmoid function plot for a single predictor model, right-hand margin adjusted to better display labels of the target variable of varying lengths.
Regression()
: For the model fit section, the displayed standard deviation of the response variable is now computed from the same data from which the model was estimated, which can differ depending on missing data.
BarChart()
: Missing data is now permitted for the numerical or y
variable.
pivot()
: Aggregated variables are named the original name of the aggregated variable concatenated with an underscore and the name of the operation.
Plot()
: For plots of two or more variables, the right-margin legend is displayed with a smaller font size and centered more effectively.
prob_norm()
: Function terminates if value of lo
is greater than hi
instead of returning a meaningless result.
getColors()
: Default for parameter output
is TRUE
only for a direct call from the console. If embedded in a function call such as for the fill
parameter, or even if directly called in R Markdown, then set output=TRUE
to view the calculated. palette
BarChart()
: Parameter theme
now works.
getColors()
: Now can be called directly in R Markdown documents though specify output=TRUE
for output to be displayed.
pivot()
: Parameter out_names
now works if the assigned name for a single variable analysis contains the name of the variable analyzed from the input data frame.
Plot()
: For stacked time series plots, default for size
of points is now 0.
Plot()
: For lattice plots, if there is only a single point to plot in a panel, just the point is plotted instead of also attempting to plot a violin and a box plot, which necessarily fails.
Plot()
: Variable and axis value labels increased in size for R Markdown and R by itself analyses.
ANOVA()
, BarChart()
, Histogram()
, Plot()
, Regression()
, ttest()
: A tibble that is not in the user workspace, instead in a package, is now properly converted to a standard R data frame upon which the internal function's code depends, which also permits more flexibility in the packaging of data in general to these functions, such as with the pipe operator.
ANOVA()
: Overall p-value now available for the one-way ANOVA as the statistic in an output object, called p_value
, such as output object a$p_value
in a <- ANOVA( ... )
.
getColors()
: To address color blindness, the Okabe-Ito
color palette was added, and available as a fill
parameter in Plot()
and BarChart()
.
interact()
: To address color blindness, the viridis
and Okabe-Ito
were added as qualitative palettes, such as for the bars of a bar chart.
sort_by()
: New name for the Sort()
function, now deprecated, to maintain consistency of lower-case functions being utility functions.
Plot()
: For Trellis plots, indicated with by1
and by2
parameters, assessment of replications of unique values is done only if 1000 or fewer unique values to save both space and compute time for information that is not of such value anyhow.
Plot()
: For a run chart, parameter run=TRUE
, the successive points in the plot are now by default joined with line segments, which can be stopped by setting parameter segments=FALSE
.
Plot()
: Parameters lab_cex
and axis_cex
for size of value labels and axis labels, scaled smaller for visualizations in R run alone, without RStudio.
Plot()
: To address color blindness, when there is a by
grouping, default is now to not only vary the fill color but also the shape of the plotted points.
Regression()
: Parameter mod
for moderation analysis in a two-predictor model added.
ttest()
: Parameter quiet
added to turn off text display at the console if set to TRUE
.
ANOVA()
: One-way ANOVA with parameter graphics=FALSE
now works.
BarChart()
, Plot()
: Qualitative color palette hues
was the default for the default color theme, colors
, but now also works explicitly when specified as the value for the fill
parameter.
BarChart()
: Now works for color
parameter specified as a vector.
interact()
: Interactive analyses resets label and axis plot sizes to larger, now set back when saving the pdf and also when re-plotting.
PieChart()
: Suggestions for alternative analyses had wrong lowercase expression piechart()
instead of PieChart()
.
Plot()
: With a linear fit line, b0 and b1 now reflect the specified number of digits according to parameter digits_d
.
Plot()
: For plotting a continuous and a categorical variable scatterplot, when listing the continuous variable first, the name of the categorical variable is now listed in the title of the text output.
Plot()
: For a run chart with parameter run=TRUE
, parameter scale_x
now properly allows for a custom set of ticks for the Index
on the x-axis.
Histogram()
, Plot()
, Correlation()
, ttest()
: Vastly speeded up for data sets much over 500 data values, substantial improvement on large data sets.Plot()
: Parameters jitter_x
and jitter_y
are reset for the scatterplot (not VBS plot) to explicitly set the amount of jitter within the negative and positive values of their assigned values, or, set to NULL
to activate the default value of the range of the variable divided by 50.
Plot()
: Text output for Mahalanobis distance more nicely formatted.
interact()
: Reading of text files was not detecting blank data values for variables of type character
as missing, which now it does.
interact()
: Various revisions of the interface styles and efficiency increases.
interact()
: A Help
option added for each analysis, which, when clicked, displays a web page of explanation of the analysis and listing and definition of each presented parameter.
interact("BarChart")
: Numerical y
variable and associated stat
parameter added for analysis of means and related statistics across the levels of the categorical variable x
.
interact("BarChart")
: The summary table of each category level paired with a number, to translate into bar height, can now be read as the input data table.
`interact("Plot"): ID lableing of outliers added.
pivot()
: For a single variable
to process, the variable name no longer repeated for each column of the output, one column per statistic computed.
BarChart()
: When plotting a statistics for a numerical variable, y
, the plotted values are now displayed at the console with sufficient width.
interact("ScatterPlot")
: Parameter enhance
now works.
interact("ScatterPlot")
: Choose parameter enhance
then parameter by
now works.
interact("ScatterPlot")
: Parameters by
and size
now work together and properly written to the R code file.
interact("Trellis")
: Single VBS now plotted without a by1
variable for Trellis plots per se.
Regression()
: For parameter Rmd
, collinearity issue fixed.
ref_group
for Logit()
, new, more explanatory name for ref
, the level of the binary target variable explicitly defined as the reference group instead of relying on the default.
train_test()
has new parameter matrix_out
which outputs the data structures as matrices instead of as data frames.
Regression()
: For automatically generating and processing R markdown files, users can now 100% customize the output directly by creating their own input files, just one or all eight of the files.
Regression()
: Added parameters Rmd_custom
to specify one or more custom input files and Rmd_dir
to specify a custom directory where the files are stored.
Regression()
: For output, vars
component added that is a vector of the variable names in the model beginning with the response variable.
Regression()
: Parameters res_rows
and pred_rows
changed to the more descriptive n_res_rows
and n_pred_rows
.
interact("Histogram")
: The bandwidth
slider for the density
option now re-adjusts if a new variable is selected from the same data set.
interact("Plot")
: Points now plotted properly.
interact("Plot")
: Fill color properly set if a by
variable is selected.
Plot()
: Transparency (trans parameter) now works when size as a variable and by are activated.
Plot()
: for a fit
function, mse now properly computed.
recode()
: Entire data frame now not written to the console.
Regression()
: Text enlarged for scatterplot matrix.
Regression()
: Number of allowable terms in the model definition now greatly expanded.
Regression()
: When generating a markdown file with parameter Rmd
, if reading a label file included with lessR
, the correct data file is now properly listed instead of the label file.
interact()
: lessR data analysis is now interactive, with each interactive analysis now displayed for analyzing the user's own data, which also includes summary statistics as output with the option to save the plot to a pdf file plus the corresponding R code.BarChart()
: Parameter do_plot
added for the option of computing summary statistics but not plotting.
Histogram()
: Underlying algorithm more efficient.
Histogram(..., density=TRUE)
: fill_gen
and color_gen
revised to fill_general
and color_general
, and fill_nrm
and color_nrm
to fill_normal
and color_normal
though the old names still accepted, as with bandwidth
in place of bw
and include_histogram
in place of dn.hist
.
Histogram()
: for density curve, provided values for fill_general
and fill_normal
that are color names now are converted to the same transparency level as their default colors.
PieChart()
: value labels now separated from the category label so that size can be set independently.
pivot()
: Long labels now abbreviated to stay within column widths.
Plot()
: For numbers < 10000, more decimal digits for stat outcomes of fit
parameter, such as MSE.
Plot()
: With a by
variable a named, sequential color range such as "reds" can now be specified in addition to the qualitative range of "hues".
tt.brief()
removed, long since replaced by tt_brief()
.
loading lessR
: Now lessR loads properly if another previously loaded package had loaded shiny.
Histogram()
: Returns in list structure, such as h
, saved output from h <- Histogram(...)
.
Histogram()
: When bin_start
was < 0 the number of displayed digits for the mid point of each bin could be excessive, now fixed.
Histogram()
: spelling error of out_suggewt to out_suggest
fixed for output name.s
pivot()
: When constructing a table (instead of a dataframe), NA
's now properly displayed.
Plot()
: Restored title that indicates scaling of bubble plot.
Plot()
: Transparency on bubble plots now works.
Read2()
: Now works properly.
BoxPlot()
, Histogram()
: Recent message from outlier check from package robustbase
removed.
Logit()
: Confusion matrix now displayed even if all predictions are for a single outcome category.
pivot()
: If not specified, output defaults to 3 decimal digits, otherwise displays the needed number of decimal digits to avoid rounding to 0.000.
pivot()
: For consistency, parameter n_show_group
changed to n_group_show
.
Plot()
: New parameter n_bins
> 1 indicates to bin the numeric x-variable and plot the mean or median of the numeric y-variable for each bin, with each point size dependent on the corresponding bin sample size.
Plot()
: More stats displayed regarding each panel when doing a fit line.
Read()
: Browsing for data file in RStudio, message about hidden window removed.
pivot()
: If na_remove
is FALSE
, the missing data value for the aggregated statistic was reported as 0 instead of NA
.
Plot()
: y-axis label now evaluated on all axis values instead of only the maximum value, which, if 1.0, rounds to 1, only a single digit.
Plot()
: segments=TRUE
now applies to all scatter plots, not just with by=
.
interact()
: Each interactive display now shows the underlying function call.
Plot()
: If x
is equally spaced, a line chart not default if a fit
line.
Plot()
: Parameter sqrt
deprecated for quat
.
Plot()
: Parameter root
deprecated for power
.
Plot()
: fit_color
parameter added to set directly, not just with style()
function.
Plot()
: When curve fitting with fit
parameter, b0
and b1
given for the linearized line from nonlinear functions.
Prop_test()
: If the hypothesis test of a proportion is one-sided, the alternative
hypothesis is now provided.
Histogram()
: density=TRUE
properly generates console output.
Plot()
: For fit
parameter, "reciprocal"
option removed as better covered with "exp"
which also does exponential decay in place of "reciprocal"
if no by
variable.
Plot()
: fit_color
now works for a fit
line.
Prop_test()
: parameters such as alternative
properly passed to R functions binom.test()
, prop.test()
, and chisq.test()
.
ttest()
: One-tailed alternatives less
and greater
options work.
ANOVA()
: One-way, scatterplot labels separated more from plot
Histogram()
: Summary stats headings adjusted to better align with numerical values
Prop_test()
: Parameter p0
changed to pi
for consistency with ttest()
, p0
still works
reshape_long()
: Default name of Response
changed from "Value"
to "Response"
style()
: Slightly lighten grayscale bars and points from "gray35"
to "gray42"
and point fill
and color
from "gray30"
to "gray42"
train_test()
: New function to either create training and testing data from a data frame with x
and y
combined or separated
Plot()
: a --run-donttest
test example was improperly configured, now works
Plot()
: Plotting a vector for x
or y
with fit
now worksANOVA()
: Interaction plot for 2-way factorial reflects color theme
BarChart()
: Appropriate error message if by
and by1
parameters both specified as only one for one function call is allowed
BarChart()
: Bar chart from a pivot table for two vars now possible with NA
column
data set
: dataAnova_rbf
data set for randomized block factorial ANOVA added
data set
: dataAnova_sp
data set for split-plot design ANOVA added
interact()
: More colors added
interact()
: Trellis plot has added violin_fill
and box_fill
parameters
pivot()
: by
variables except for Date
variables by default now converted to factors, though controlled with parameter: factors
reshape_long()
: If needed, an ID variable is now provided by default
reshape_long()
: Generated ID
variable now has a prefix in front of the integer ID by default, can be changed with the parameter: prefix
reshape_long()
: sep
parameter specifies a separator between the prefix and the ID integer output data frame has ID variable listed first
Transform()
: Previously deprecated, its abbreviation trans() now removed
details()
: Now properly reports details of a tibble
version of a data frame
interact()
: Histogram()
fill
now set properly
Plot()
: Panels in Trellis plots with no data now properly display as blank with grid lines instead of showing an error message
Plot()
: Interaction plot with segments=TRUE
of a pivot table now has all segments
Plot()
: Can now plot one continuous variable with violin/box/scatter plots with by
, by1
, and by2
parameters
Plot()
: Bubble plot transparency option now works with parameter trans
Plot()
: Bubble plot fill
and color
now properly specified
ANOVA()
: ANOVA 2-way factorial analysis provides Type II sum of squares for unbalanced designs
ANOVA()
: R-squared and related indices now reported to three decimal digits
ANOVA()
: Plotted point in interaction plot a little smaller
ANOVA()
: Scatterplot title removed for more room for the plot, labels adjusted
ANOVA()
: Scatterplot moved to the first plot displayed
BarChart()
: Right-margin legend for two variable chart no longer defaults to abbreviations for title and values, use legend_abbrev
to specify
Histogram()
: Bars slightly more gray and slightly less blue
Plot()
: For by
and fit
parameters together, equation, MSE, and R2 given for each fit line
Regression()
: Analysis of covariance with one categorical variable and one continuous variable displays the ...\cr
Regression()
: n_cat
parameter added to allow integer variables to be treated as categorical without declaring as a formal R factor
Regression()
: kfold
parameter now applies to models with categorical variables with dummy variables created automatically
Regression()
: Extra line of spacing between major sections of output
Regression()
: Grid lines added to scatterplot
Regression()
: title removed from scatterplot, remains in text console output
Regression()
: Parameter best_sub
provides the value "Cp"
for Mallow's "Cp"
statistic in addition to the default value "adjr2"
for adjusted R-squared
rename()
: New utility function for renaming a variable in the specified data frame
ANOVA()
: Interaction plot for 2-way factorial generalizes better beyond two levels brief output works more reliably for 2-way factorial
Regression()
: scale_response
parameter now works, applicable when new_scale
is implemented, FALSE
by default but if TRUE
then the response variable y
is also rescaled
ANOVA()
: Better calculation of size of left-margin for pairwise mean differences plot
Logit()
: For a single predictor, for a given probability classification
threshold, the x
-cutoff value now provided
Logit()
: For a single predictor, annotations added to the sigmoid curve
Logit()
: ref
parameter for designating the value of the response variable that is the reference group instead of relying upon defaults
Logit()
: Rows in the classification table reversed to correspond to sigmoid curve plot
pivot()
: For consistency with other na
parameters, show_group_na
parameter changed to na_show_group
Plot()
: Boxplot median line width increased to 2 for better visibility in the presence of an embedded scatterplot
Logit()
: Bug fixed that occurred for 50% of the analyses when the response variable has integer input values of 0,1 instead of a factor
such as Man/Woman
Logit()
: Fitted values now properly displayed when X1_new
, etc. specified
Logit()
: Edge condition that crashed with very little variance among fitted probabilities now fixed
Plot()
: by1
parameter now works when x
is categorical
Regression()
: Table of estimates now properly displayed for indicator variables with large number of characters for the variable names
BarChart()
: digits_d
parameter added for text output to control number of decimal digits for two categorical variable output
BarChart()
: Default gray fill color for the bars lightened slightly
data()
:\cr
dataStockPrice
updated to Jan 1, 2022\cr
dataEmployee_lbl
updated\cr
dataWeightLoss
data file added\cr
dataAnova_1way
data file added\cr
dataAnova_rb
data file added\cr
dataAnova_2way
data file added\cr
factors()
: Now processes tidyverse tibble
version of a data frame
Help()
: Deprecated, now removed, replaced by vignettes
interact()
: When prompting for valid names by passing no arguments, no longer displays an error message
interact()
: More colors added to choice of fill colors for interactive displays
Logit()
: For a single predictor variable, if the response y
is non-numeric, then the value set at 1 has the highest mean, for a positive difference
Logit()
: For a single predictor variable, logistic curve better labeled
Nest()
: If response variable is a character variable, automatically converted to a factor
to allow to run without an error
Plot()
: For Trellis plots, default changed to a single column (n_col=1
)
Read()
: If specified a lessR
data file improperly, the list of valid data files is displayed
Read()
: Read data files in the Open Document Spreadsheet format, .ods
Read()
: New parameter row_names for consistent interface across the various file formats, for cs
files R row.names
still works
Read()
: Report of variables with all unique values moved to brief report
Regression()
: If x
is a factor with two levels, the regression line is plotted in the scatterplot
Regression()
: Scatterplot can now have customized axis labels size, set by style()
reshape_long()
: New function that is a simple wrapper for Base R reshape()
with sensible parameter names and sensible defaults, and allow for variable ranges of columns to transfer from wide to long
reshape_wide()
: New function that is a simple wrapper for Base R reshape()
with sensible parameter names to convert a long-form data frame to a wide-form
Write()
: Parameters -- data
and to
-- switched to a more natural order with data
now the first parameter: Write(data=, to=, ...)
Write()
: Write data frames in the Open Document Spreadsheet format, .ods
, with format="ODS"
Write()
: Write data frames in the SPSS format, .sav
, with format="SPSS"
ANOVA()
: Family-wise confidence level now reported in output
BarChart()
: Formatting so that columns do not run together for some output with integers represented as integers instead of double
BarChart()
: Specifying one_plot=TRUE
and multiple variables now works
BarChart()
: Custom ylab
now works when the stat parameter is activated
Histogram()
: For density=TRUE
, quiet=TRUE
now works
Logit()
: If many variables, such as from created dummy variables, columns of estimated coefficients now align correctly
PieChart()
: Text now responds to corresponding style parameters: lab_cex
, main_cex=1.5
, and values_size
Plot()
: If a by
or by1
variable, misleading warning message sort order deleted
ANOVA()
, BarChart()
, Histogram()
, pivot()
, Plot()
, Regression()
, ttest()
: Informative error message added if a subset with the rows parameter returns no rows to subset
getColors()
: Little more differentiation between default sequential colors by lightening a bit more the lightest color
Histogram()
: Output objects n.bins
and n.miss
updated to n_bins and n_miss
Histogram()
: density=TRUE
output provides more summary statistics and suggestions
Plot()
: When specifying a fit line with a by categorical variable the sum of squared errors for each category are better labeled
Plot()
: For a by
categorical variable, no longer is the correlational analysis done for all the data, but the SSE is reported for each group
Plot()
: Pairwise correlation output takes up fewer lines
Plot()
: Variable labels for y
-axis made a little longer and still fit without breaking into an additional line
Plot()
: Some suggestions now change depending on random "coin flip"
Plot()
: Trellis plot default changed from n_col=1
to n_row=1
Regression()
: Standard deviation of response variable added to fit information, other stats removed
Regression()
: Parameter fit_line
width set at 1.5 if plotting w/o outlier line also and no standard error region for original line
to()
: Parameter same.size
updated to same_size
, old value still works
BarChart()
: For theme other than "hues"
, fill
colors the same by default unless an ordered factor
, then a sequential palette
Correlation()
: Pairwise correlation analysis properly returns lower and upper bounds of confidence interval about the sample correlation coefficient
Correlation()
: Pairwise correlation when called from Correlation()
function works
PieChart()
: Default colors with default color theme, "colors"
, now has qualitative color palette "hues"
, the same as bar chart, for a non-ordinal categorical variable
PieChart()
: Inner circle that creates the hole now with higher resolution so as to make a smoother circle
PieChart()
: Better placement of the plot title if there is one
ANOVA()
: Randomized blocks fitted means visualizations enhanced
ANOVA()
: rb_points
parameter removed
Histogram()
: For grayscale with a black background, histogram bars have no transparency and are lighter
interact()
: "PieChart"
option added for interactive, Shiny display
Logit()
: Grid lines added to logit plot for a single predictor transparency of 0.7 added to the points in the logit plot
pivot()
: If n=0
for an output row or column, then converting to proportions returns 0's instead of NaN
's
pivot()
: New parameter show_group_na
with default of TRUE
that will list NA
for missing data of a grouping variable as a level on the output
PieChart()
: Labels proper size outside of RStudio
Plot()
: plot_errors
now plots the connecting segments from each point to the fitted line with gray if style("gray")
Plot()
: Parameter size
for size of points now applies to a scatterplot matrix
Plot()
: fill
color for points and bubbles lightened slightly for gray scale
Plot()
: Trellis line plots increase slightly the width of the lines
recode()
: Transformed variables converted to be consistent with their type ex: convert a character
variable to integer
, but before was still character
Regression()
: For parameter new_scale
, new option is "centered"
Regression()
: For all rescaling, now default is only predictor numeric variables and only for numerical variables with more than two values
Regression()
: If rescaling, set new parameter rescale_response
to TRUE
to also rescale the response variable
Regression()
: For model fit, standard deviation and min and max of the target variable displayed along with the 95% range of variation of the residuals, with values displayed with commas for large numbers
Regression()
: Fitted vs residuals plot more efficiently sized and largest Cooks Distance value displayed as a sub-title
Regression()
: Summary stats of response variable added to Basic Analysis
Sort()
: Feedback includes names of sorted variables in place of their index
ttest()
: For one group t-test, density curve now has dark gray border and value of smd added to the top text display over the visualization
ANOVA()
: Randomized blocks ANOVA displays correct visualizations
BarChart()
: Ordinal factors properly display as color gradients, e.g., "blues"
PieChart()
: theme
other than default now implemented
Plot()
: For plotting a Trellis plot with a by
variable, all levels correctly plotted and legend correct
Plot()
: If x
or y
is categorical and the other continuous, if a by
variable then fill
and color
colors properly displayed at end of display
Sort()
: Internal coding simplified for improved functionality and generality
ttest()
: pdf_file
now works for one group t-test of the mean
interact()
: New function for running one of the following shiny apps for interactive visualizations: "BarChart1", "BarChart2", "Histogram", "ScatterPlot", "Trellis", e.g., interact("BarChart1")
pivot()
: Read a tidyverse tibble
in addition to standard R data frames
pivot()
: When naming variables, for variable y
, now y_mean
instead of y_mn
Plot()
: For x=categorical
and y=continuous
, or vice versa, no jitter added if there are no duplications of y
at each level of x
, plus more jitter added for larger sample size if jitter is needed
BarChart()
: Sort now works properly with a horizontal plot
pivot()
: Now can have one variable to aggregate and one aggregation variable named x
Regression()
: Debugging code removed from the k-fold cross-validation analysis
ANOVA()
: Scatterplot for 1-way ANOVA now has grid lines and jitter
ANOVA()
: Scatterplot for 1-way ANOVA now has jitter_x
parameter added to customize the jitter level
ANOVA()
: Scatterplot for 1-way ANOVA now has means plotted in a dark red instead of a dark gray
BarChart()
: Tilde removed from each label if for a by
variable in the legend where it is nonfunctional
pivot()
: User defined functions accounted for and named in the output
pivot()
: When analysis of all data values, dropped the Grand_Stat
label
Plot()
: Point color and fit line now a little darker and complements of each other, plot_errors
segments a little darker as well
Plot()
: For fit="reciprocal"
, fit line present for all values of x
, even if plot window does not accommodate the full range, can adjust range with parameter pad_y
Plot()
: Parameter root
added for general root transformation beyond the value of 0.5 for the square root
Plot()
: Parameter fit_power
added to raise y
to the designated power for the exp
, root
, and reciprocal
transformations
Plot()
: Correlational analysis no longer displayed for a non-linear fit line
Plot()
: SSE displayed for corresponding fit
line (curve)
BarChart()
: Two variable chart with parameters x
and by
specified now correctly displays a color range when the style is not the default theme
BarChart()
: Base R text()
function misleadingly displays text at size 1 if a value of 0 is entered, lessR functions that rely on the R text function now fixed by changing an input value of 0 to 0.01
pivot()
: Output variables now properly named
Plot()
: Trellis plots with by1
and by2
parameters work in Shiny environment
BarChart()
: If parameter sta
not specified, there is now no default if the parameter y
variable is specified without an indication of what statistic to compute
BarChart()
: Gentle termination with a message if stats
specifies a transform that results in missing data in the computed summary table
BarChart()
: For analysis of computing a summary table with the parameter stat
parameter, no longer display the summary stats for the parameter x
variable when a by variable is present
getColors()
: Default qualitative color palette softened a bit by increasing luminance from 55 to 60
getColors()
: By default, now does not generate output when called from visualization parameters parameter fill
and parameter color
for parameter BarChart()
, etc.
getColors()
: Output parameter values changed to parameter logical
, parameter TRUE
and parameter FALSE
pivot()
: New parameter parameter table_long
allows the table computation for a cross-tabs table be output in long form
pivot()
: Tabulate computation continues to work, though no longer documented as it is supersede
BarChart()
: Further adjustment to accommodate very large legend labels for two variable plots
BarChart()
: If the y values are input, the display of the values will be properly turned off if the parameter style
option for values is set to parameter "off"
Plot()
: Default jitter for 1-D scatterplot works again in VBS plots
Plot()
: Parameter by
works again for VBS plots, multiple plots on same panel
Plot()
: Parameter color
now works in conjunction with the by
parameter
Plot()
: Applied to a scatterplot of two continuous variables:\cr
if color
is set to "off"
(or "transparent"
), a requested fit line
still displays\cr
means are now properly plotted for a scatterplot of a continuous and
categorical variable when jitter is applied to the plotted points\cr
default outlier shape in gray scale changes to diamond as documented
BarChart()
: For a two categorical variable bar chart, the legend
labels are by default no longer abbreviated, instead sufficient room is generated by the plot
BarChart()
: New parameter legend_abbrev
allows for the specification of the maximum number of characters to display for the legend
labels
BarChart()
: For a two categorical variable bar chart, better default placement of the legend
new parameter legend_adj
allows for horizontal adjustment of the legend
BarChart()
: First variable x
is a vector works again
Histogram()
: Parameter pdf_file
works
Plot()
: Parameter enhance=TRUE
correctly prints least-squares line when fill changed
BarChart()
: When fill or color set with getColors()
, the value of n, the number of colors, is provided according to the number of bars
BarChart()
: When horiz is TRUE
, order of sort changed so that "-"
starts at top
BarChart()
: Space to separate the x
-axis label when several lines revised parameter pdf
to pdf_file
corProp()
: Alias changed from prop
to cp
, so prop
for new function Proportion()
Density()
: Parameter pdf
to pdf_file
Histogram()
: Parameter pdf
to pdf_file
kurtosis()
: New function for computing kurtosis, available for the pivot()
function
pivot()
: Choose any two of the following three parameters: multiple compute functions, multiple values over which to compute, and multiple grouping (by
) variables
pivot()
: No by
variables imply to do grand total(s) without an aggregation
pivot()
: Choose any two of the following three parameters: multiple compute functions, multiple variables, and multiple by
-variables.
pivot()
: Sort parameter applies for a single aggregated numerical variable and specifies the variable or column number to sort
pivot()
: For each non-decimal digit variable with unique data values, display
pivot()
: The variable name and the unique values to better label the output when drilling down into levels of one or more categorical variables
pivot()
: Compute quantiles()
function provides a standard data frame
pivot()
: Compute table()
function provides a cross-classification table for a discrete aggregated variable
pivot()
: Table frequencies can be converted to proportions, and row and column proportions
pivot()
: Parameter out_names
parameter to custom-name aggregated variables
pivot()
: Parameter na_by
renamed to na_by_show
to better communicate its meaning skew and kurtosis added to the list of available descriptive statistics
pivot()
: Parameter q_num
specifies number of intervals for quantiles
pivot()
: Parameter rows
subsets rows of the data frame for analysis
pivot()
: Parameter valu
e changed to parameter variable
pivot()
: Parameters rows_by
and cols_by
changed to by
and by_cols
Plot()
: Scatterplot matrix does a loess fit if fit=TRUE
Plot()
: If multiple x
or y
variables, text correlation analysis displayed for each combination
Plot()
: If a by
variable, then text correlation for each level not yet available, so not displayed
Plot()
: Default for plotting a fit line with by groups is to not plot the standard errors because the plot becomes overcrowded
Plot()
: Warning displayed if the date field or a by/by1 field is not ordered
Plot()
: Stacked time series chart of different shades of same hue, such as "blues"
now have each sub-plot with a black border
Plot()
: Trellis plot distinguishes area_fill
under a line from fill
for a point
Plot()
: radius
value for bubble plot frequency matrix dependent on size of
Plot()
: Largest frequency radius
value displayed for bubble plot frequency matrix
prob_norm()
: Probability is returned as the value of the function, not just printed
prob_tcut()
: Cutoff is returned as the value of the function, not just printed
Prop_test()
: New function for the analysis of proportions: test of homogeneity for one or more samples based on the proportion of successes for a specific value of a variable, and for a single sample, goodness-of-fit for a single categorical variable and test of independence for two categorical variables
Prop_test()
: prop
is alias
recode()
: Changed to lower case function name
rescale()
: Changed to lower case function name
Regression()
: Bubble plot form of a scatterplot for integer
variables with less than 10 unique values and a single predictor variable
skew()
: New function for computing skew, available for the pivot()
function
style()
: slatered
theme added
style()
: ellipse_fill
for default "colors"
theme now a pale brown instead of blue
BarChart()
: With more than 30 categories, a message to indicate how to retrieve now properly displays instead of crashing
BarChart()
: fill=(count)
would fail for a large number of levels
BarChart()
: Stacked bars from multiple items now works with variable labels
BarChart()
: Stacked bars from multiple items now works for binary responses
Extract()
: Now works if name of data frame is a function in another context
Extract()
: Now works even if one variable name is embedded in another name
Plot()
: Forcing an evaluation of a scalar logical on a vector fixed
Plot()
: theme
parameter works for most common parameters
Plot()
: Bubble plot frequency matrix text output works with labels
Plot()
: Bubble plot frequency matrix plot x
-axis works with binary response
Plot()
: area_fill
works correctly
ttest()
: For paired analysis, x
-axis properly labeled Difference
`