Package: dataSDA 0.2.6

Han-Ming Wu

dataSDA: Datasets and Basic Statistics for Symbolic Data Analysis

Provides benchmark datasets and foundational tools for Symbolic Data Analysis (SDA). The package includes functions for constructing symbolic data objects from classical data, converting among different interval-valued data formats, managing interval-valued, histogram-valued, modal-valued, and multi-valued data, and performing basic descriptive statistics. It is designed to support teaching, methodological research, and the development of SDA techniques.

Authors:Po-Wei Chen [aut], Chun-houh Chen [aut], Han-Ming Wu [cre]

dataSDA_0.2.6.tar.gz
dataSDA_0.2.6.tar.gz(r-4.7-any)dataSDA_0.2.6.tar.gz(r-4.6-any)
dataSDA_0.2.6.tgz(r-4.6-emscripten)
manual.pdf |manual.html
card.svg |card.png
dataSDA/json (API)
NEWS

# Install 'dataSDA' in R:
install.packages('dataSDA', repos = c('https://cran.r-universe.dev', 'https://cloud.r-project.org'))
Datasets:

On CRAN:

Conda:

This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.

3.18 score 6 scripts 527 downloads 72 exports 161 dependencies

Last updated from:c061166d40. Checks:4 OK. Indexed: no.

TargetResultTimeFilesSyslog
linux-devel-x86_64OK251
source / vignettesOK252
linux-release-x86_64OK260
wasm-releaseOK222

Exports:aggregate_to_symbolicARRAY_to_iGAPARRAY_to_MMARRAY_to_RSDAcheck_zero_width_intervalsclean_colnameshist_corhist_covhist_meanhist_variGAP_to_ARRAYiGAP_to_MMiGAP_to_RSDAint_centerint_containmentint_convert_formatint_corint_cosineint_covint_cvint_detect_formatint_diceint_dispersionint_distint_dist_allint_dist_matrixint_entropyint_granularityint_imprecisionint_information_contentint_iqrint_jaccardint_kurtosisint_list_conversionsint_madint_meanint_medianint_midrangeint_modeint_overlapint_overlap_coefficientint_pairwise_distint_quantileint_radiusint_rangeint_similarity_matrixint_skewnessint_symmetryint_tailednessint_tanimotoint_trimmed_meanint_trimmed_varint_uniformityint_varint_widthint_winsorized_meanint_winsorized_varMM_to_ARRAYMM_to_iGAPMM_to_RSDAread_symbolic_csvRSDA_formatRSDA_to_ARRAYRSDA_to_iGAPRSDA_to_MMsearch_dataset_variable_formatSODAS_to_ARRAYSODAS_to_iGAPSODAS_to_MMto_all_interval_formatswrite_symbolic_csv

Dependencies:abindaskpassbackportsbase64encbitbit64blobbootbroombslibcachemcarcarDatachronclasscliclustercodetoolscolorspacecowplotcpp11crosstalkcurldata.tableDBIDerivdigestdoBydplyrDTe1071ellipseemmeansestimabilityevaluateFactoMineRfarverfastmapflashClustfontawesomeforcatsforeachforecastFormulafracdifffsgbmgenericsggplot2ggpolypathggrepelggridgesglmnetgluegsubfngtableherehighrHistDAWasshistogramhtmltoolshtmlwidgetshttrigraphirlbaisobanditeratorsjquerylibjsonlitekknnknitrlabelinglaterlatticelazyevalleapslifecyclelme4lmtestmagrittrMASSMatrixMatrixModelsmemoisemgcvmicrobenchmarkmimeminqamodelrmultcompViewmvtnormneuralnetnlmenloptrnnetnumDerivopensslotelpbkrtestpillarpkgconfigplotlyplyrpngprincurvepromisesprotoproxypurrrquantregR6randomcoloRrandomForestrappdirsrbibutilsRColorBrewerRcppRcppArmadilloRcppEigenRcppTOMLRdpackreformulasreshapereticulateRJSONIOrlangrmarkdownrpartrprojrootRSDARSpectraRSQLiteRtsneS7sassscalesscatterplot3dshapeSparseMsqldfstringistringrsurvivalsystibbletidyrtidyselecttimeDatetinytexumapurcautf8V8vctrsviridisLitewithrxfunXMLxtableyamlzoo

Introduction to dataSDA

Rendered fromdataSDA_intro.Rmdusingknitr::rmarkdownon Jun 12 2026.

Last update: 2026-06-12
Started: 2026-02-11

Readme and manuals

Help Manual

Help pageTopics
Abalone Dataset (iGAP Format)abalone.iGAP
Abalone Interval Datasetabalone.int
Acid Rain Pollution Indices Interval Datasetacid_rain.int
Age-Cholesterol-Weight Interval Datasetage_cholesterol_weight.int
World Age Pyramids Histogram-Valued Dataset (2014)age_pyramids.hist
Aggregate Tabular Data to Symbolic Dataaggregate_to_symbolic
JFK Airport Airline Flights Histogram-Valued Datasetairline_flights.hist
JFK Airport Airline Flights Modal-Valued Datasetairline_flights2.modal
ARRAY to iGAPARRAY_to_iGAP
ARRAY to MMARRAY_to_MM
ARRAY to RSDAARRAY_to_RSDA
Bank Interest Rates AR Model Symbolic Datasetbank_rates
Baseball Teams Interval Datasetbaseball.int
Bat Species Interval Datasetbats.int
Bird Color Taxonomy Histogram Datasetbird_color_taxonomy.hist
Bird Species Extended Mixed Symbolic Datasetbird_species_extended.mix
Bird Species Mixed Symbolic Datasetbird_species.mix
Bird Species Mixed Symbolic Datasetbird.mix
Blood Pressure Interval Datasetblood_pressure.int
Blood Test Histogram Datasetblood.hist
Italian Car Models Interval Datasetcar_models.int
Car Models Interval Datasetcar.int
Cardiological Examination Interval Datasetcardiological.int
Cars Interval Datasetcars.int
Census Mixed Symbolic Datasetcensus.mix
Check for Zero-Width Intervalscheck_zero_width_intervals
Chinese Climate Monthly Histogram Datasetchina_climate_month.hist
Chinese Climate Seasonal Histogram Datasetchina_climate_season.hist
China Monthly Temperature Intervals (15 Stations)china_temp_monthly.int
China Meteorological Stations Quarterly Temperature Interval Datasetchina_temp.int
Cholesterol by Gender and Age Histogram-Valued Datasetcholesterol.hist
clean_colnamesclean_colnames
County Income by Gender Histogram-Valued Datasetcounty_income_gender.hist
Forest Cover Types Histogram-Valued Datasetcover_types.hist
Credit Card Expenses Interval Datasetcredit_card.int
Crime Demographics Datasetcrime.modal
Crime Demographics Modal-Valued Datasetcrime2.modal
WTI Crude Oil Futures Daily High/Low Interval Time Seriescrude_oil_wti.its
Dow Jones Industrial Average Daily High/Low Interval Time Seriesdjia.its
E. coli Transport Routes Interval Datasetecoli_routes.int
European Employment by Gender and Age Interval Datasetemployment.int
US Energy Consumption Distribution-Valued Datasetenergy_consumption.distr
Energy Usage Distribution-Valued Datasetenergy_usage.distr
EPA Environmental Data Mixed Symbolic Datasetenvironment.mix
Euro/Dollar Exchange Rate Daily High/Low Interval Time Serieseuro_usd.its
Exchange Rate Returns Histogram Time Seriesexchange_rate_returns.hist
Face Dataset (iGAP Format)face.iGAP
Finance Sector Interval Datasetfinance.int
Airline Flights Detailed Histogram-Valued Datasetflights_detail.hist
French Agriculture Histogram-Valued Datasetfrench_agriculture.hist
Freshwater Fish Heavy Metal Bioaccumulation Interval Datasetfreshwater_fish.int
Fuel Consumption by Region Datasetfuel_consumption.modal
Fungi Morphological Measurements Interval Datasetfungi.int
Genome Dinucleotide Abundance Intervalsgenome_abundances.int
Blood Glucose Histogram-Valued Datasetglucose.hist
Hardwood Tree Species Histogram-Valued Datasethardwood.hist
Human Development Index and Gender Indicators Interval Datasethdi_gender.int
Health Insurance Mixed Symbolic Datasethealth_insurance.mix
Health Insurance Modal-Valued Datasethealth_insurance2.modal
Hematocrit and Hemoglobin Bivariate Histogram-Valued Datasethematocrit_hemoglobin.hist
Hematocrit by Gender and Age Histogram-Valued Datasethematocrit.hist
Hemoglobin by Gender and Age Histogram-Valued Datasethemoglobin.hist
Hierarchy Datasethierarchy
Hierarchical Symbolic Dataset with Mixed Typeshierarchy.hist
Hierarchy Interval Datasethierarchy.int
Statistics for Histogram Datahistogram_stats hist_cor hist_cov hist_mean hist_var
Horse Breeds Interval Datasethorses.int
Hospital Costs Histogram-Valued Datasethospital.hist
Household Characteristics Distribution-Valued Datasethousehold_characteristics.distr
IBOVESPA Daily High/Low Interval Time Seriesibovespa.its
iGAP to ARRAYiGAP_to_ARRAY
iGAP to MMiGAP_to_MM
iGAP to RSDAiGAP_to_RSDA
Convert Interval Data Formatint_convert_format
Detect Interval Data Formatint_detect_format
List Available Format Conversionsint_list_conversions
Distance Measures for Interval Datainterval_distance int_dist int_dist_all int_dist_matrix int_pairwise_dist
Geometric Properties of Interval Datainterval_geometry int_center int_containment int_midrange int_overlap int_radius int_width
Position and Scale Measures for Interval Datainterval_position int_iqr int_mad int_median int_mode int_quantile int_range
Robust Statistics for Interval Datainterval_robust int_trimmed_mean int_trimmed_var int_winsorized_mean int_winsorized_var
Distribution Shape Measures for Interval Datainterval_shape int_kurtosis int_skewness int_symmetry int_tailedness
Similarity Measures for Interval Datainterval_similarity int_cosine int_dice int_jaccard int_overlap_coefficient int_similarity_matrix int_tanimoto
Statistics for Interval Datainterval_stats int_cor int_cov int_mean int_var
Uncertainty and Variability Measures for Interval Datainterval_uncertainty int_cv int_dispersion int_entropy int_granularity int_imprecision int_information_content int_uniformity
Iris Species Histogram-Valued Datasetiris_species.hist
Iris Species Interval Datasetiris.int
Irish Wind Speed Monthly Interval Time Seriesirish_wind.its
Joggers Mixed Symbolic Datasetjoggers.mix
Judge 1 Interval-Valued Ratingsjudge1.int
Judge 2 Interval-Valued Ratingsjudge2.int
Judge 3 Interval-Valued Ratingsjudge3.int
Lack of Information Questionnaire Interval Datasetlackinfo.int
Lisbon Air Quality Daily Interval Datasetlisbon_air_quality.int
Loans by Purpose Interval Datasetloans_by_purpose.int
Lending Club Loans by Risk Level (Quantile-Based Intervals)loans_by_risk_quantile.int
Lending Club Loans by Risk Levelloans_by_risk.int
Lung Cancer Treatments by State Histogram-Valued Datasetlung_cancer.hist
Lynne1 Blood Pressure Interval Datasetlynne1.int
MERVAL Index Weekly Min/Max Interval Time Seriesmerval.its
MM to ARRAYMM_to_ARRAY
MM to iGAPMM_to_iGAP
MM to RSDAMM_to_RSDA
Motor Trend Cars Mixed Symbolic Datasetmtcars.mix
Mushroom Species Fuzzy/Symbolic Datasetmushroom_fuzzy.mix
Mushroom Species Interval Datasetmushroom.int
Mushroom Species Dataset (Original Format)mushroom.int.mm
New York City Flights Interval Datasetnycflights.int
Occupation Salaries Datasetoccupations.modal
Occupation Salaries Modal-Valued Datasetoccupations2.modal
Ohio River Basin 30-Year Trimmed Mean Daily Temperatures Interval Datasetohtemp.int
Oils and Fats Interval Datasetoils.int
Ozone Air Quality Histogram-Valued Datasetozone.hist
Petrobras Stock Daily High/Low Interval Time Seriespetrobras.its
Polish Car Models Mixed Symbolic Datasetpolish_cars.mix
Polish Voivodships Socio-Economic Intervalspolish_voivodships.int
Profession Work Salary Time Interval Datasetprofession.int
Prostate Cancer Clinical Interval Datasetprostate.int
Read a Symbolic Data CSV Fileread_symbolic_csv
RSDA FormatRSDA_format
RSDA to ARRAYRSDA_to_ARRAY
RSDA to iGAPRSDA_to_iGAP
RSDA to MMRSDA_to_MM
Search Datasetssearch_data
Set Variable Formatset_variable_format
Shanghai Stock Exchange Composite Index Daily High/Low Interval Time Seriesshanghai_stock.its
Simulated Histogram-Valued Datasetsimulated.hist
French Soccer Championship Bivariate Interval Datasetsoccer_bivar.int
SODAS to ARRAYSODAS_to_ARRAY
SODAS to iGAPSODAS_to_iGAP
SODAS to MMSODAS_to_MM
S&P 500 Daily High/Low Interval Time Seriessp500.its
State Income Histogram-Valued Datasetstate_income.hist
Synthetic Interval Clusters Datasetsynthetic_clusters.int
Pickup League Teams Interval Datasetteams.int
World Cities Monthly Temperature Interval Datasettemperature_city.int
Tennis Court Types Interval Datasettennis.int
Convert Interval Data to All Supported Formatsto_all_interval_formats
Town Services Concatenated Mixed Symbolic Datasettown_services.mix
Trivial and Non-Trivial Intervals Example Datasettrivial_intervals.int
US Crime Statistics Interval Datasetuscrime.int
Utah Snow Load Interval Datasetutsnow.int
Veterinary Interval Datasetveterinary.int
Video Platform User Engagement Intervals (Dataset 1)video1.int
Video Platform User Engagement Intervals (Dataset 2)video2.int
Video Platform User Engagement Intervals (Dataset 3)video3.int
Water Flow Sensor Readings Interval Datasetwater_flow.int
Weight by Age Group Histogram-Valued Datasetweight_age.hist
Wine Chemical Properties Interval Datasetwine.int
World Cup Soccer Teams Interval Datasetworld_cup.int
Write Symbolic Data to a CSV Filewrite_symbolic_csv