Package: registr 2.1.0

Julia Wrobel

registr: Curve Registration for Exponential Family Functional Data

A method for performing joint registration and functional principal component analysis for curves (functional data) that are generated from exponential family distributions. This mainly implements the algorithms described in 'Wrobel et al. (2019)' <doi:10.1111/biom.12963> and further adapts them to potentially incomplete curves where (some) curves are not observed from the beginning and/or until the end of the common domain. Curve registration can be used to better understand patterns in functional data by separating curves into phase and amplitude variability. This software handles both binary and continuous functional data, and is especially applicable in accelerometry and wearable technology.

Authors:Julia Wrobel [aut, cre], Alexander Bauer [aut], Erin McDonnell [aut], Fabian Scheipl [ctb], Jeff Goldsmith [aut]

registr_2.1.0.tar.gz
registr_2.1.0.tar.gz(r-4.5-noble)registr_2.1.0.tar.gz(r-4.4-noble)
registr_2.1.0.tgz(r-4.4-emscripten)registr_2.1.0.tgz(r-4.3-emscripten)
registr.pdf |registr.html
registr/json (API)
NEWS

# Install 'registr' in R:
install.packages('registr', repos = 'https://cloud.r-project.org')
Uses libs:
  • openblas– Optimized BLAS
  • c++– GNU Standard C++ Library v3
Datasets:

On CRAN:

Conda:

This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.

openblascpp

3.00 score 164 downloads 1 mentions 12 exports 38 dependencies

Last updated 3 years agofrom:ef2f4edda6. Checks:1 OK, 2 NOTE. Indexed: no.

TargetResultLatest binary
Doc / VignettesOKMar 28 2025
R-4.5-linux-x86_64NOTEMar 28 2025
R-4.4-linux-x86_64NOTEMar 28 2025

Exports:bfpcabs_derivconstraintsdata_cleanfpca_gaussgfpca_twoSteploss_hloss_h_gradientregister_fpcaregistrsimulate_functional_datasimulate_unregistered_curves

Dependencies:bootclicpp11dplyrfansigamm4genericsgluelatticelifecyclelme4magrittrMASSMatrixmgcvminqanlmenloptrpbspillarpkgconfigpurrrR6rbibutilsRcppRcppArmadilloRcppEigenRdpackreformulasrlangstringistringrtibbletidyrtidyselectutf8vctrswithr

Registering Incomplete Curves

Rendered fromincomplete_curves.Rmdusingknitr::rmarkdownon Mar 28 2025.

Last update: 2022-10-02
Started: 2022-10-02

registr: a vignette

Rendered fromregistr.Rmdusingknitr::rmarkdownon Mar 28 2025.

Last update: 2022-10-02
Started: 2020-03-16

Citation

To cite package ‘registr’ in publications use:

Wrobel J, Bauer A, McDonnell E, Goldsmith J (2022). registr: Curve Registration for Exponential Family Functional Data. R package version 2.1.0, https://CRAN.R-project.org/package=registr.

Corresponding BibTeX entry:

  @Manual{,
    title = {registr: Curve Registration for Exponential Family
      Functional Data},
    author = {Julia Wrobel and Alexander Bauer and Erin McDonnell and
      Jeff Goldsmith},
    year = {2022},
    note = {R package version 2.1.0},
    url = {https://CRAN.R-project.org/package=registr},
  }

Readme and manuals

registr

Registration for incomplete exponential family functional data.

What it does

Functional data analysis is a set of tools for understanding patterns and variability in data where the basic unit of observation is a curve measured over some domain such as time or space. An example is an accelerometer study where intensity of physical activity was measured at each minute over 24 hours for 50 subjects. The data will contain 50 curves, where each curve is the 24-hour activity profile for a particular subject.

Classic functional data analysis assumes that each curve is continuous or comes from a Gaussian distribution. However, applications with exponential family functional data – curves that arise from any exponential family distribution, and have a smooth latent mean – are increasingly common. For example, take the accelerometer data just mentioned, but assume researchers are interested in sedentary behavior instead of activity intensity. At each minute over 24 hours they collect a binary measurement that indicates whether a subject was active or inactive (sedentary). Now we have a binary curve for each subject – a trajectory where each time point can take on a value of 0 or 1. We assume the binary curve has a smooth latent mean, which in this case is interpreted as the probability of being active at each minute over 24 hours. This is a example of exponential family functional data.

Often in a functional dataset curves have similar underlying patterns but the main features of each curve, such as the minimum and maximum, have shifts such that the data appear misaligned. This misalignment can obscure patterns shared across curves and produce messy summary statistics. Registration methods reduce variability in functional data and clarify underlying patterns by aligning curves.

This package implements statistical methods for registering exponential family functional data. The basic methods are described in more detail in our paper and were further adapted to (potentially) incomplete curve settings where (some) curves are not observed from the very beginning and/or until the very end of the common domain. For details on the incomplete curve methodology and how to use it see the corresponding package vignette. Instructions for installing the software and using it to register simulated binary data are provided below.

Installation

To install from CRAN, please use:

install.packages("registr")

To install the latest version directly from Github, please use:

install.packages("devtools")
devtools::install_github("julia-wrobel/registr")

The registr package includes vignettes with more details on package use and functionality. To install the latest version and pull up the vignettes please use:

devtools::install_github("julia-wrobel/registr", build_vignettes = TRUE)
vignette(package = "registr")
How to use it

This example registers simulated binary data. More details on the use of the package can be found in the vignettes mentioned above.

The code below uses registr::simulate_unregistered_curves() to simulate curves for 100 subjects with 200 timepoints each, observed over domain (0, 1). All curves have similar structure but the location of the peak is shifted. On the observed domain t^* the curves are unregistered (misaligned). On the domain t the curves are registered (aligned).

library(registr)

registration_data = simulate_unregistered_curves(I = 100, D = 200, seed = 2018)

The plot below shows the unregistered curves and registered curves.

Continuously observed curves are shown above in order to illustrate the misalignment problem and our simulated data; the simulated dataset also includes binary values which have been generated by using these continuous curves as probabilities. The unregistered and registered binary curves for two subjects are shown below.

Our software registers curves by estimating t. For this we use the function registration_fpca().

binary_registration = register_fpca(Y = registration_data, family = "binomial", 
                                    Kt = 6, Kh = 4, npc  = 1)
## Running initial registration step
## current iteration: 1
## Running final FPCA step

The plot below shows unregistered, true registered, and estimated registered binary curves for two subjects after fitting our method.

Citation

If you like our software, please cite it in your work! To cite the latest CRAN version of the package with BibTeX, use

@Manual{,
    title = {registr: Registration for Exponential Family Functional Data},
    author = {Julia Wrobel and Alexander Bauer and Erin McDonnell and Jeff Goldsmith},
    year = {2022},
    note = {R package version 2.1.0},
    url = {https://CRAN.R-project.org/package=registr},
  }

To cite the 2021 Journal of Open Source Software paper, use

@article{wrobel2021registr,
  title={registr 2.0: Incomplete Curve Registration for Exponential Family Functional Data},
  author={Wrobel, Julia and Bauer, Alexander},
  journal={Journal of Open Source Software},
  volume={6},
  number={61},
  pages={2964},
  year={2021}
}

To cite the 2018 Journal of Open Source Software paper, use

@article{wrobel2018regis,
  title={registr: Registration for Exponential Family Functional Data},
  author={Wrobel, Julia},
  journal={The Journal of Open Source Software},
  volume={3},
  year={2018}
}
Contributions

If you find small bugs, larger issues, or have suggestions, please file them using the issue tracker or email the maintainer at julia.wrobel@cuanschutz.edu. Contributions (via pull requests or otherwise) are welcome.

Help Manual

Help pageTopics
Simulate amplitude varianceamp_curve
Binary functional principal components analysisbfpca
Internal main preparation function for bfpcabfpca_argPreparation
Internal main optimization for bfpcabfpca_optimization
Nth derivative of spline basisbs_deriv
Coarsen an index vector to a given resolutioncoarsen_index
Define constraints for optimization of warping functionsconstraints
Covariance estimation after Hall et al. (2008)cov_hall
Crossproduct computation for highly irregular gridscrossprods_irregular
Crossproduct computation for mostly regular gridscrossprods_regular
Convert data to a 'refund' objectdata_clean
Estimate the derivative of the logit functionderiv.inv.logit
Determine the number of FPCs based on the share of explained variancedetermine_npc
Correct slightly improper parameter vectorsensure_proper_beta
Calculate expected score and score variance for the current subject.expectedScores
Estimate variational parameter for the current subject.expectedXi
Functional principal components analysis via variational EMfpca_gauss
Internal main preparation function for fpca_gaussfpca_gauss_argPreparation
Internal main optimization for fpca_gaussfpca_gauss_optimization
Generalized functional principal component analysisgfpca_twoStep
Generate subject-specific grid (t_star)grid_subj_create
Berkeley Growth Study data with simulated incompletenessgrowth_incomplete
Create initial parameters for (inverse) warping functionsinitial_params
Apply lambda transformation of variational parameter.lambdaF
Loss function for registration step optimizationloss_h
Gradient of loss function for registration steploss_h_gradient
Simulate mean curvemean_curve
Simulate meanmean_sim
NHANES activity datanhanes
Create two-parameter piecewise linear (inverse) warping functionspiecewise_linear2_hinv
Plot the results of a functional PCAplot.fpca
Simulate PC1psi1_sim
Simulate PC2psi2_sim
Register curves using constrained optimization and GFPCAregister_fpca
Register Exponential Family Functional Dataregistr
Internal function to register one curveregistr_oneCurve
Simulate functional datasimulate_functional_data
Simulate unregistered curvessimulate_unregistered_curves
Calculate quadratic form of spline basis functions for the current subject.squareTheta