Package 'DAKS' reference manual

Title:	Data Analysis and Knowledge Spaces
Description:	Functions and an example dataset for the psychometric theory of knowledge spaces. This package implements data analysis methods and procedures for simulating data and quasi orders and transforming different formulations in knowledge space theory. See package?DAKS for an overview.
Authors:	Ali Uenlue [aut, cre], Anatol Sargin [aut]
Maintainer:	Ali Uenlue <[email protected]>
License:	GPL (>= 2)
Version:	2.1-3
Built:	2025-02-03 06:26:54 UTC
Source:	CRAN

Data Analysis and Knowledge Spaces: The R Package DAKS

Description

The package DAKS implements three inductive item tree analysis algorithms for deriving quasi orders from binary data, the original, corrected, and minimized corrected algorithms. It provides functions for computing population and estimated asymptotic variances of the diff fit measures, and for switching between test item and knowledge state representations. Other features are a Hasse diagram drawing device, a data and quasi order simulation tool based on a finite mixture latent variable model, and a function for computing response pattern and knowledge state frequencies.

Details

Package:	DAKS
Type:	Package
Version:	2.1-3
Date:	2016-06-05
License:	GPL (>= 2)

Knowledge space theory is a recent psychometric test theory based on combinatorial mathematical structures (order and lattice theory); see Doignon and Falmagne (1999). Solvability dependencies between dichotomous test items play an important role in knowledge space theory. Utilizing hypothesized dependencies between items, knowledge space theory has been successfully applied for the computerized, adaptive assessment and training of knowledge. For instance, see the ALEKS system, a fully automated math tutor on the Internet (http://www.aleks.com/).

The package DAKS is implemented based on the S3 system. It comes with a namespace and consists of the following functions (all functions are external, there are no internal functions): corr_iita, hasse, iita, imp2state, ind_gen, mini_iita, ob_counter, orig_iita, pattern, pop_iita, pop_variance, print.iita, print.pat, print.popiita, print.summpopiita, print.ztest, simu, state2imp, summary.iita, summary.popiita, variance, and z_test. There is an empirical dataset, pisa, accompanying the package DAKS. This dataset is part of the 2003 Programme for International Student Assessment (PISA; http://www.pisa.oecd.org/).

Author(s)

Anatol Sargin, Ali Uenlue

Maintainer: Ali Uenlue [email protected]

References

Doignon, J.-P. and Falmagne, J.-C. (1999) Knowledge Spaces. Berlin, Heidelberg, and New York: Springer-Verlag.

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Schrepp, M. (1999) On the empirical construction of implications between bi-valued test items. Mathematical Social Sciences, 38, 361–375.

Schrepp, M. (2003) A method for the analysis of hierarchical dependencies between items of a questionnaire. Methods of Psychological Research, 19, 43–79.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Corrected Inductive Item Tree Analysis

Description

corr_iita performs the corrected inductive item tree analysis procedure and returns the corresponding diff values.

Usage

corr_iita(dataset, A)
corr_iita(dataset, A)

Arguments

`dataset`	a required data frame or matrix consisting of binary, $1$ or $0$ , numeric data.
`A`	a required list of competing quasi orders (surmise relations), for instance obtained from a call to `ind_gen`.

Details

Corrected inductive item tree analysis is a data analysis method for deriving knowledge structures (more precisely, surmise relations) from binary data. Details on this procedure can be found in iita. The set of competing quasi orders is passed via the argument A, so any selection set of quasi orders can be used.

The set of competing quasi orders must be a list of objects of the class set. These objects (quasi orders) consist of $2$ -tuples $(i, j)$ of the class tuple, where a $2$ -tuple $(i, j)$ is interpreted as 'mastering item $j$ implies mastering item $i$ .'

The data must contain only ones and zeros, which encode solving or failing to solve an item, respectively.

Value

If the arguments dataset and A are of required types, corr_iita returns a named list of the following components:

`diff.value`	a vector of the diff values corresponding to the competing quasi orders in `A`.
`error.rate`	a vector of the error rates corresponding to the competing quasi orders in `A`.

Note

The function iita can be used to perform one of the three inductive item tree analysis procedures (including the corrected inductive item tree analysis method) selectively. Whereas for the function corr_iita a selection set of competing quasi orders has to be passed via the argument A manually, iita automatically generates a selection set from the data using the inductive generation procedure implemented in ind_gen.

The latter approach using iita is common so far, in knowledge space theory, where the inductive data analysis methods have been utilized for exploratory derivations of surmise relations from data. The function corr_iita, on the other hand, can be used to select among surmise relations for instance obtained from querying experts or from competing psychological theories.

Author(s)

Anatol Sargin, Ali Uenlue

References

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Examples

ind <- ind_gen(ob_counter(pisa))
corr_iita(pisa, ind)
ind <- ind_gen(ob_counter(pisa))
corr_iita(pisa, ind)

Hasse Diagram of Surmise Relation

Description

hasse plots the Hasse diagram of a surmise relation (more precisely, of its corresponding quotient set).

Usage

hasse(imp, items)
hasse(imp, items)

Arguments

`imp`	a required object of class `set` representing the set of implications, for instance obtained from a call to `iita`.
`items`	a required numeric giving the number of items of the domain taken as basis for `imp`.

Value

If the arguments imp and items are of required types, hasse produces a plot, and returns a list of the equally informative items.

Note

The function hasse is not capable of plotting equally informative items. This is why equally informative items are returned in a list.

A set of implications, an object of the class set, consists of $2$ -tuples $(i, j)$ of the class tuple, where a $2$ -tuple $(i, j)$ is interpreted as 'mastering item $j$ implies mastering item $i$ .'

Author(s)

Anatol Sargin, Ali Uenlue

References

Doignon, J.-P. and Falmagne, J.-C. (1999) Knowledge Spaces. Berlin, Heidelberg, and New York: Springer-Verlag.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Examples

## requires the package Rgraphviz from Bioconductor
## users must have installed Graphviz on their computers
## Not run: 
hasse(iita(pisa, v = 2)$implications, 5)

## End(Not run)
## requires the package Rgraphviz from Bioconductor
## users must have installed Graphviz on their computers
## Not run: 
hasse(iita(pisa, v = 2)$implications, 5)

## End(Not run)

Inductive Item Tree Analysis

Description

iita can be used to perform one of the three inductive item tree analysis algorithms (original, corrected, and minimized corrected) selectively.

Usage

iita(dataset, v)
iita(dataset, v)

Arguments

`dataset`	a required data frame or matrix consisting of binary, $1$ or $0$ , numeric data.
`v`	a required numeric giving the inductive item tree analysis algorithm to be performed; `v = 1` (minimized corrected), `v = 2` (corrected), and `v = 3` (original).

Details

The three inductive item tree analysis algorithms are exploratory methods for extracting quasi orders (surmise relations) from data. In each algorithm, competing binary relations are generated (in the same way for all three versions), and a fit measure (differing from version to version) is computed for every relation of the selection set in order to find the quasi order that fits the data best. In all three algorithms, the idea is to estimate the numbers of counterexamples for each quasi order, and to find, over all competing quasi orders, the minimum value for the discrepancy between the observed and expected numbers of counterexamples. The three data analysis methods differ in their choices of estimates for the expected numbers of counterexamples. (For an item pair $(i, j)$ , the number of subjects solving item $j$ but failing to solve item $i$ , is the corresponding number of counterexamples. Their response patterns contradict the interpretation of $(i, j)$ as 'mastering item $j$ implies mastering item $i$ .') The algorithms are described in the paper about the DAKS package by Uenlue and Sargin (2010), and in the paper by Sargin and Uenlue (2009).

iita calls ind_gen for constructing the set of competing quasi orders according to the inductive generation procedure. Subject to the selected version to be performed, iita computes the discrepancies between observed and expected numbers of counterexamples under each relation, and finds a quasi order with the minimum discrepancy (diff) value.

The data must contain only ones and zeros, which encode solving or failing to solve an item, respectively.

Value

If the arguments dataset and v are of required types, iita returns a named list consisting of the following five components:

`diff`	a vector giving the diff values corresponding to the (inductively generated) competing quasi orders.
`implications`	an object of class `set` representing the solution quasi order (with smallest diff value) under the selected algorithm.
`error.rate`	a value giving the estimated error rate corresponding to the best fitting quasi order.
`selection.set.index`	a numeric giving the index of the solution quasi order in the selection set.
`v`	the version used; `v = 1` (minimized corrected), `v = 2` (corrected), and `v = 3` (original).

Note

The function iita can be used to perform one of the three inductive item tree analysis procedures selectively. Whereas for the functions orig_iita, corr_iita, mini_iita selection sets of competing quasi orders have to be passed via an argument manually, iita automatically generates a selection set from the data using the inductive generation procedure implemented in ind_gen.

The latter approach using iita is common so far, in knowledge space theory, where the inductive data analysis methods have been utilized for exploratory derivations of surmise relations from data. The functions orig_iita, corr_iita, mini_iita, on the other hand, can be used to select among surmise relations for instance obtained from querying experts or from competing psychological theories.

Author(s)

Anatol Sargin, Ali Uenlue

References

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Schrepp, M. (1999) On the empirical construction of implications between bi-valued test items. Mathematical Social Sciences, 38, 361–375.

Schrepp, M. (2003) A method for the analysis of hierarchical dependencies between items of a questionnaire. Methods of Psychological Research, 19, 43–79.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Examples

iita(pisa, v = 1)
iita(pisa, v = 3)
iita(pisa, v = 1)
iita(pisa, v = 3)

Transformation from Implications to Knowledge States

Description

imp2state transforms a set of implications (ought to be a surmise relation) to the corresponding set of knowledge states (the quasi ordinal knowledge space).

Usage

imp2state(imp, items)
imp2state(imp, items)

Arguments

`imp`	a required object of class `set` representing the set of implications, for instance obtained from a call to `iita`.
`items`	a required numeric giving the number of items of the domain taken as basis for `imp`.

Value

If the arguments imp and items are of required types, imp2state returns a matrix consisting of ones or zeros (the quasi ordinal knowledge space), in which each row represents the $1$ / $0$ -pattern of a knowledge state.

Note

For any set of implications the returned knowledge structure is a quasi ordinal knowledge space. In case of a surmise relation this is Birkhoff's theorem. For details refer to Doignon and Falmagne (1999, Theorem 1.49).

Author(s)

Anatol Sargin, Ali Uenlue

References

Doignon, J.-P. and Falmagne, J.-C. (1999) Knowledge Spaces. Berlin, Heidelberg, and New York: Springer-Verlag.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Examples

x <- iita(pisa, v = 1)
imp2state(x$implications, ncol(pisa))
x <- iita(pisa, v = 1)
imp2state(x$implications, ncol(pisa))

Inductive Generation Procedure

Description

ind_gen generates inductively a set of competing quasi orders.

Usage

ind_gen(b)
ind_gen(b)

Arguments

`b`	a required matrix of the numbers of counterexamples for all pairs of items, for instance obtained from a call to `ob_counter`.

Value

If the argument b is of required type, ind_gen returns a list of the inductively generated quasi orders.

Note

The function iita calls ind_gen for constructing the set of competing quasi orders according to the inductive generation procedure.

The set of competing quasi orders is a list of objects of the class set. These objects (quasi orders) consist of $2$ -tuples $(i, j)$ of the class tuple, where a $2$ -tuple $(i, j)$ is interpreted as 'mastering item $j$ implies mastering item $i$ .'

Author(s)

Anatol Sargin, Ali Uenlue

References

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Schrepp, M. (1999) On the empirical construction of implications between bi-valued test items. Mathematical Social Sciences, 38, 361–375.

Schrepp, M. (2003) A method for the analysis of hierarchical dependencies between items of a questionnaire. Methods of Psychological Research, 19, 43–79.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Examples

ob <- ob_counter(pisa)
ind_gen(ob)
ob <- ob_counter(pisa)
ind_gen(ob)

Minimized Corrected Inductive Item Tree Analysis

Description

mini_iita performs the minimized corrected inductive item tree analysis procedure and returns the corresponding diff values.

Usage

mini_iita(dataset, A)
mini_iita(dataset, A)

Arguments

`dataset`	a required data frame or matrix consisting of binary, $1$ or $0$ , numeric data.
`A`	a required list of competing quasi orders (surmise relations), for instance obtained from a call to `ind_gen`.

Details

Minimized corrected inductive item tree analysis is a data analysis method for deriving knowledge structures (more precisely, surmise relations) from binary data. Details on this procedure can be found in iita. The set of competing quasi orders is passed via the argument A, so any selection set of quasi orders can be used.

The data must contain only ones and zeros, which encode solving or failing to solve an item, respectively.

Value

If the arguments dataset and A are of required types, corr_iita returns a named list of the following components:

`diff.value`	a vector of the diff values corresponding to the competing quasi orders in `A`.
`error.rate`	a vector of the error rates corresponding to the competing quasi orders in `A`.

Note

The function iita can be used to perform one of the three inductive item tree analysis procedures (including the minimized corrected inductive item tree analysis method) selectively. Whereas for the function mini_iita a selection set of competing quasi orders has to be passed via the argument A manually, iita automatically generates a selection set from the data using the inductive generation procedure implemented in ind_gen.

The latter approach using iita is common so far, in knowledge space theory, where the inductive data analysis methods have been utilized for exploratory derivations of surmise relations from data. The function mini_iita, on the other hand, can be used to select among surmise relations for instance obtained from querying experts or from competing psychological theories.

Author(s)

Anatol Sargin, Ali Uenlue

References

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Examples

ind <- ind_gen(ob_counter(pisa))
mini_iita(pisa, ind)
ind <- ind_gen(ob_counter(pisa))
mini_iita(pisa, ind)

Computation of Numbers of Counterexamples

Description

ob_counter computes from a dataset for all item pairs the corresponding numbers of counterexamples.

Usage

ob_counter(dataset)
ob_counter(dataset)

Arguments

dataset

a required data frame or matrix consisting of binary, $1$ or $0$ , numeric data.

Details

For an item pair $(i, j)$ , the number of subjects solving item $j$ but failing to solve item $i$ , is the corresponding number of counterexamples. Their response patterns contradict the interpretation of $(i, j)$ as 'mastering item $j$ implies mastering item $i$ .'

The data must contain only ones and zeros, which encode solving or failing to solve an item, respectively.

Value

If the argument dataset is of required type, ob_counter returns a matrix of the numbers of counterexamples for all pairs of items.

Note

The function ind_gen can be used to inductively generate from the returned matrix of the numbers of counterexamples a set of quasi orders.

The function iita calls ob_counter.

Author(s)

Anatol Sargin, Ali Uenlue

References

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Examples

ob_counter(pisa)
ob_counter(pisa)

Original Inductive Item Tree Analysis

Description

orig_iita performs the original inductive item tree analysis procedure and returns the corresponding diff values.

Usage

orig_iita(dataset, A)
orig_iita(dataset, A)

Arguments

`dataset`	a required data frame or matrix consisting of binary, $1$ or $0$ , numeric data.
`A`	a required list of competing quasi orders (surmise relations), for instance obtained from a call to `ind_gen`.

Details

Original inductive item tree analysis is a data analysis method for deriving knowledge structures (more precisely, surmise relations) from binary data. Details on this procedure can be found in iita. The set of competing quasi orders is passed via the argument A, so any selection set of quasi orders can be used.

The data must contain only ones and zeros, which encode solving or failing to solve an item, respectively.

Value

If the arguments dataset and A are of required types, corr_iita returns a named list of the following components:

`diff.value`	a vector of the diff values corresponding to the competing quasi orders in `A`.
`error.rate`	a vector of the error rates corresponding to the competing quasi orders in `A`.

Note

The function iita can be used to perform one of the three inductive item tree analysis procedures (including the original inductive item tree analysis method) selectively. Whereas for the function orig_iita a selection set of competing quasi orders has to be passed via the argument A manually, iita automatically generates a selection set from the data using the inductive generation procedure implemented in ind_gen.

The latter approach using iita is common so far, in knowledge space theory, where the inductive data analysis methods have been utilized for exploratory derivations of surmise relations from data. The function orig_iita, on the other hand, can be used to select among surmise relations for instance obtained from querying experts or from competing psychological theories.

Author(s)

Anatol Sargin, Ali Uenlue

References

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Examples


ind <- ind_gen(ob_counter(pisa))
orig_iita(pisa, ind)
ind <- ind_gen(ob_counter(pisa))
orig_iita(pisa, ind)

Frequencies of Response Patterns and Knowledge States

Description

pattern computes the absolute frequencies of the (occurring) response patterns, and optionally, the absolute frequencies of a collection of specified knowledge states in a dataset.

Usage

pattern(dataset, n = 5, P = NULL)
pattern(dataset, n = 5, P = NULL)

Arguments

`dataset`	a required data frame or matrix consisting of binary, $1$ or $0$ , numeric data.
`n`	an optional numeric, with default `n = 5`, giving the $n$ highest frequencies and their corresponding response patterns to be returned.
`P`	an optional matrix of ones and zeros giving the knowledge states to be used. The default `P = NULL` corresponds to no knowledge states being specified.

Details

This function can be used to retrieve information about how often response patterns and knowledge states occur in a dataset. The argument n refers to response patterns, not knowledge states, and in particular is independent of specifications of the argument P. If pattern is called without specifying $n$ explicitly, the response patterns with the five highest frequencies are returned (along with their frequencies). If $n$ is specified, the response patterns with the $n$ highest frequencies are returned (along with their frequencies). If $n$ is larger than the number of different response patterns in the dataset, $n$ is set the number of different response patterns.

The knowledge states are represented as $1$ / $0$ -patterns and are the rows of the argument matrix P. The matrix P must contain only ones and zeros, which encode whether or not an item belongs to a knowledge state, respectively. If P is not specified, pattern only returns information about response patterns (as described previously).

The data must contain only ones and zeros, which encode solving or failing to solve an item, respectively.

Value

If the arguments dataset, n, and P are of required types, pattern returns a named list consisting of the following three components:

`response.patterns`	an array giving the response patterns (with the $n$ highest frequencies) and their absolute frequencies in `dataset`.
`states`	a matrix of the knowledge states and their absolute frequencies in `dataset`. Each row represents a knowledge state, the last column gives the frequencies of the states. If `P = NULL`, the component `states` is `NULL`.
`n`	a numeric giving the number of response patterns that are returned.

Note

Although pattern is intended for use with dichotomous data, it also works with polytomously scored items.

Author(s)

Anatol Sargin, Ali Uenlue

References

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Examples

pattern(pisa, n = 3)
pattern(pisa)
pattern(pisa, n = 3)
pattern(pisa)

Programme for International Student Assessment (PISA) Data

Description

The accompanying binary dataset is part of the empirical 2003 Programme for International Student Assessment (PISA) data. It contains the item responses by $340$ German students on a $5$ -item dichotomously scored mathematical literacy test.

Usage

pisa
pisa

Format

The pisa data frame consists of $340$ rows and $5$ columns, representing the response patterns of the students to the test items. Each number, an integer, $1$ or $0$ , encodes a correct or incorrect response, respectively.

Note

The dataset pisa was obtained after dichotomizing the original multiple-choice or open format test data. Wording of the test items used in the assessment is not known (not available publicly).

Source

OECD Programme for International Student Assessment (PISA; http://www.pisa.oecd.org/)

References

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Population Inductive Item Tree Analysis

Description

pop_iita can be used to perform one of the three inductive item tree analysis algorithms (original, corrected, and minimized corrected) in population quantities (in a known population) selectively.

Usage

pop_iita(imp, ce, lg, items, dataset = NULL, A = NULL, v)
pop_iita(imp, ce, lg, items, dataset = NULL, A = NULL, v)

Arguments

`imp`	a required object of class `set` representing the underlying set of implications (assumed to be a quasi order), for instance obtained from a call to `iita`.
`ce`	a required numeric giving the probability for a careless error.
`lg`	a required numeric giving the probability for a lucky guess.
`items`	a required numeric giving the number of items of the domain taken as basis for `imp`.
`dataset`	an optional data frame or matrix consisting of binary, $1$ or $0$ , numeric data. The default `dataset = NULL` corresponds to no dataset being used.
`A`	an optional list of competing quasi orders (surmise relations).
`v`	a required numeric giving the inductive item tree analysis algorithm to be performed, in population quantities; `v = 1` (minimized corrected), `v = 2` (corrected), and `v = 3` (original).

Details

The three inductive item tree analysis algorithms are exploratory methods for extracting quasi orders (surmise relations) from data. In each algorithm, competing binary relations are generated (in the same way for all three versions), and a fit measure (differing from version to version) is computed for every relation of the selection set in order to find the quasi order that fits the data best. In all three algorithms, the idea is to estimate the numbers of counterexamples for each quasi order, and to find, over all competing quasi orders, the minimum value for the discrepancy between the observed and expected numbers of counterexamples. The three data analysis methods differ in their choices of estimates for the expected numbers of counterexamples. For details see iita. The algorithms are described in the paper about the DAKS package by Uenlue and Sargin (2010), and in the paper by Sargin and Uenlue (2009).

Compared to iita, the function pop_iita implements the three inductive item tree analysis algorithms in population, not sample, quantities. The argument imp must give a quasi order, and equipped with the error probabilities ce and lg, it is considered a special case of the basic local independence model (Doignon and Falmagne, 1999). The latter then is considered as the underlying population model. If dataset = NULL a set of competing quasi orders is constructed based on a population analog of the inductive generation procedure implemented in sample quantities in ind_gen. If a dataset is specified explicitly, that data are used to generate the set of competing quasi orders based on the sample version of the inductive generation procedure.

The data (in dataset) must contain only ones and zeros, which encode solving or failing to solve an item, respectively.

Value

If the arguments imp, ce, lg, items, dataset, A, and v are of required types, pop_iita returns a named list consisting of the following five components:

`pop.diff`	a vector giving the population diff values corresponding to the (inductively generated) competing quasi orders (subject to selected version that was performed).
`pop.matrix`	a matrix of all possible response patterns and their corresponding population occurrence probabilities.
`error.pop`	a vector of the population $\gamma$ rates corresponding to the (inductively generated) competing quasi orders (subject to selected version that was performed).
`selection.set`	a list of the (inductively generated) competing quasi orders.
`v`	the version used; `v = 1` (minimized corrected), `v = 2` (corrected), and `v = 3` (original).

Note

The single careless error ce and lucky guess lg probabilities are assumed to be constant over all items. The most general case that can be specified thus includes two error probabilities, which are the same for all items.

The sample diff coefficients of the three inductive item tree analysis algorithms can be transformed into maximum likelihood estimators, by division through the square of sample size. These transformed diff coefficients are considered in population quantities. The $\gamma$ rates are the algorithms' specific estimates of the postulated response error probability.

Population and estimated asymptotic variances of the maximum likelihood estimators diff are implemented in the functions pop_variance and variance, respectively.

Author(s)

Anatol Sargin, Ali Uenlue

References

Doignon, J.-P. and Falmagne, J.-C. (1999) Knowledge Spaces. Berlin, Heidelberg, and New York: Springer-Verlag.

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Examples

x <- simu(3, 10000, ce = 0.05, lg = 0.05, delta = 0.12)
y <- iita(x$dataset, v = 2)
z <- pop_iita(x$implications, 0.05, 0.05, 3, x$dataset, v = 2)

## similar sample and population diff values are obtained
(y$diff) / (10000^2)
z
x <- simu(3, 10000, ce = 0.05, lg = 0.05, delta = 0.12)
y <- iita(x$dataset, v = 2)
z <- pop_iita(x$implications, 0.05, 0.05, 3, x$dataset, v = 2)

## similar sample and population diff values are obtained
(y$diff) / (10000^2)
z

Population Asymptotic Variance

Description

pop_variance computes the population (exact) asymptotic variances of the maximum likelihood estimators diff, assuming a multinomial probability distribution on the set of all response patterns.

Usage

pop_variance(pop_matrix, imp, error_pop, v)
pop_variance(pop_matrix, imp, error_pop, v)

Arguments

`pop_matrix`	a required matrix of all possible response patterns and their corresponding population occurrence probabilities, for instance obtained from a call to `pop_iita`.
`imp`	a required object of class `set` representing the set of implications (ought to be a quasi order) for which diff is computed, for instance obtained from a call to `pop_iita`.
`error_pop`	a required numeric giving the $\gamma$ rate to be used for computing diff, for instance obtained from a call to `pop_iita`.
`v`	a required numeric giving the inductive item tree analysis algorithm to be performed, in population quantities; `v = 1` (minimized corrected) and `v = 2` (corrected).

Details

Subject to the selected version to be performed, pop_variance computes the population asymptotic variance of the maximum likelihood estimator diff, which here is formulated for the relation specified in imp and for the $\gamma$ rate in error_pop. This population variance is obtained using the delta method, which requires calculating the Jacobian matrix of the diff coefficient and the inverse of the expected Fisher information matrix for the multinomial distribution with cell probabilities as specified in pop_matrix.

Value

If the arguments pop_matrix, imp, error_pop, and v are of required types, pop_variance returns a numeric giving the population asymptotic variance of the maximum likelihood estimator diff (formulated for the relation in imp and the $\gamma$ rate in error_pop).

Note

The current version of the package DAKS does not support computing population asymptotic variances for the original inductive item tree analysis algorithm; population asymptotic variances can be calculated only for the corrected and minimized corrected algorithms.

Estimated asymptotic variances of the maximum likelihood estimators diff are implemented in the function variance.

Author(s)

Anatol Sargin, Ali Uenlue

References

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Examples

## Not run: 
x <- simu(5, 100, 0.05, 0.05, delta = 0.15)
y <- pop_iita(x$implications, 0.05, 0.05, 5, x$dataset, v = 2)
pop_variance(y$pop.matrix, 
             y$selection.set[[which(y$pop.diff == min(y$pop.diff))]],
             y$error.pop[which(y$pop.diff == min(y$pop.diff))], v = 2)

## End(Not run)
## Not run: 
x <- simu(5, 100, 0.05, 0.05, delta = 0.15)
y <- pop_iita(x$implications, 0.05, 0.05, 5, x$dataset, v = 2)
pop_variance(y$pop.matrix, 
             y$selection.set[[which(y$pop.diff == min(y$pop.diff))]],
             y$error.pop[which(y$pop.diff == min(y$pop.diff))], v = 2)

## End(Not run)

Print Method for Objects of Class iita

Description

S3 method to print objects of class iita.

Usage

## S3 method for class 'iita'
print(x, ...)
## S3 method for class 'iita'
print(x, ...)

Arguments

`x`	a required object of class `iita`, obtained from a call to the function iita.
`...`	further arguments to be passed to or from other methods.

Details

Prints the main results from inductive item tree analysis algorithms.

Value

If the argument x is of required type, print.iita prints the set of implications.

Author(s)

Anatol Sargin, Ali Uenlue

References

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Print Method for Objects of Class pat

Description

S3 method to print objects of class pat.

Usage

## S3 method for class 'pat'
print(x, ...)
## S3 method for class 'pat'
print(x, ...)

Arguments

`x`	a required object of class `pat`, obtained from a call to the function `pattern`.
`...`	further arguments to be passed to or from other methods.

Details

Prints the main results from inductive item tree analysis algorithms.

Value

If the argument x is of required type, print.pat prints the response patterns with the $n$ highest frequencies in the dataset and optionally returns the absolute frequencies of specified knowledge states in the dataset.

Author(s)

Anatol Sargin, Ali Uenlue

References

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Print Method for Objects of Class popiita

Description

S3 method to print objects of class popiita.

Usage

## S3 method for class 'popiita'
print(x, ...)
## S3 method for class 'popiita'
print(x, ...)

Arguments

`x`	a required object of class `popiita`, obtained from a call to the function `pop_iita`.
`...`	further arguments to be passed to or from other methods.

Details

Prints the main results from inductive item tree analysis algorithms in population values.

Value

If the argument x is of required type, print.popiita prints a vector of population diff values, a vector of population error rates, and the quasi order with minimum population diff value.

Author(s)

Anatol Sargin, Ali Uenlue

References

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Print Method for Objects of Class summpopiita

Description

S3 method to print objects of class summpopiita.

Usage

## S3 method for class 'summpopiita'
print(x, ...)
## S3 method for class 'summpopiita'
print(x, ...)

Arguments

`x`	a required object of class `summpopiita`, obtained from a call to the function `summary.popiita`.
`...`	further arguments to be passed to or from other methods.

Details

Prints the main results from inductive item tree analysis algorithms in population values.

Value

If the argument x is of required type, print.summpopiita prints a vector of population diff values, a vector of population error rates, the population matrix, and the obtained selection set.

Author(s)

Anatol Sargin, Ali Uenlue

References

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Print Method for Objects of Class ztest

Description

S3 method to print objects of class ztest.

Usage

## S3 method for class 'ztest'
print(x, ...)
## S3 method for class 'ztest'
print(x, ...)

Arguments

`x`	a required object of class `ztest`, obtained from a call to the function `z_test`.
`...`	further arguments to be passed to or from other methods.

Details

Prints the main results from inductive item tree analysis algorithms.

Value

If the argument x is of required type, print.ztest prints the Z- and p-value, the alternative hypothesis, the confidence interval, and the sample estimates.

Author(s)

Anatol Sargin, Ali Uenlue

References

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376– 392.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Data and Quasi Order Simulation Tool

Description

simu can be used to simulate binary, of type $1$ / $0$ , data using a basic local independence model. The number of items, the sample size, and two parameters for the careless error and lucky guess probabilities can be set explicitly. The underlying combinatorial structure used for simulating the data can either be specified manually or is generated randomly.

Usage

simu(items, size, ce, lg, imp = NULL, delta)
simu(items, size, ce, lg, imp = NULL, delta)

Arguments

`items`	a required numeric giving the number of items of the domain taken as basis for the simulation.
`size`	a required numeric giving the number of response patterns to be simulated (the sample size).
`ce`	a required numeric giving the probability for a careless error.
`lg`	a required numeric giving the probability for a lucky guess.
`imp`	an optional object of class `set` representing the underlying set of implications (assumed to be a quasi order) used for simulating the data, for instance obtained from a call to `iita`. The default `imp = NULL` corresponds to generating the quasi order used for simulating the data randomly.
`delta`	a required (if `imp = NULL`) numeric giving the probability for adding an item pair to the randomly generated quasi order (reflexive pairs are always included a priori).

Details

The function simu simulates data using a special case of the basic local independence model, which is a fundamental restricted latent class model in knowledge space theory (Doignon and Falmagne, 1999). The single careless error ce and lucky guess lg probabilities are assumed to be constant over all items. The most general case that can be specified thus includes two error probabilities at each item, the same two rates for all items. The general form of the basic local independence model allows for varying careless error and lucky guess rates from item to item (not identifiable in general, however).

If a quasi order is specified in imp explicitly, Birkhoff's theorem is used to derive its corresponding quasi ordinal knowledge space, which is equipped with the error probabilities ce and lg to give the basic local independence model used for simulating the data. If imp = NULL, the underlying quasi order is generated randomly as follows. All reflexive pairs are added to the relation. The constant specified in delta is utilized as the probability for adding each of the remaining non-reflexive item pairs to the relation. The transitive closure of this relation is computed, and the resulting quasi order is then the relation underlying the simulation.

The simulated dataset contains only ones and zeros, which encode solving or failing to solve an item, respectively.

Value

If the arguments items, size, ce, lg, imp, and delta are of required types, simu returns a named list consisting of the following three components:

`dataset`	a matrix of binary, $1$ or $0$ , numeric data.
`implications`	an object of class `set` representing the underlying set of implications (assumed to be a quasi order) used for simulating the data. If `imp = NULL`, this is the quasi order that was randomly generated; otherwise identical to the set of implications specified in the argument `imp`.
`states`	a matrix consisting of ones or zeros (the quasi ordinal knowledge space), in which each row represents the $1$ / $0$ -pattern of a knowledge state. This is the knowledge structure corresponding to the set of implications specified in `implications`.

Note

To pass a quasi order as the argument imp to simu it may be more convenient to transform from knowledge states to implications using the function state2imp.

The probability specified in delta does not necessarily correspond to the ratio of implications in the randomly generated quasi order, because the transitive closure is formed after having added item pairs. In Sargin and Uenlue (2009) a normal sampling scheme for drawing delta values using $\mu = 0.16$ and $\sigma = 0.06$ for nine items has been proposed. This sampling scheme provides far better representative samples of quasi orders than sampling delta values uniformly from the unit interval.

Author(s)

Anatol Sargin, Ali Uenlue

References

Doignon, J.-P. and Falmagne, J.-C. (1999) Knowledge Spaces. Berlin, Heidelberg, and New York: Springer-Verlag.

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Examples

simu(7, 20, 0.1, 0.1, delta = 0.15)
simu(7, 20, 0.1, 0.1, delta = 0.15)

Transformation from Knowledge States to Implications

Description

state2imp transforms a set of knowledge states (ought to be a quasi ordinal knowledge space) to the corresponding set of implications (the surmise relation).

Usage

state2imp(P)
state2imp(P)

Arguments

`P`	a required matrix of ones and zeros giving the knowledge states to be used. Each row represents the $1$ / $0$ -pattern of a knowledge state.

Value

If the argument P is of required type, state2imp returns an object of class set (the surmise relation) representing the set of implications.

Note

For any set of knowledge states the returned binary relation is a surmise relation. In case of a quasi ordinal knowledge space this is Birkhoff's theorem. For details refer to Doignon and Falmagne (1999, Theorem 1.49).

Author(s)

Anatol Sargin, Ali Uenlue

References

Doignon, J.-P. and Falmagne, J.-C. (1999) Knowledge Spaces. Berlin, Heidelberg, and New York: Springer-Verlag.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Examples

## an arbitrary matrix of knowledge states is defined
x <- matrix(0, nrow = 5, ncol = 3)
x[1, ] <- c(0, 0, 0)
x[2, ] <- c(0, 0, 1)
x[3, ] <- c(0, 1, 0)
x[4, ] <- c(0, 1, 1)
x[5, ] <- c(1, 1, 1)

state2imp(x)
## an arbitrary matrix of knowledge states is defined
x <- matrix(0, nrow = 5, ncol = 3)
x[1, ] <- c(0, 0, 0)
x[2, ] <- c(0, 0, 1)
x[3, ] <- c(0, 1, 0)
x[4, ] <- c(0, 1, 1)
x[5, ] <- c(1, 1, 1)

state2imp(x)

Summary Method for Objects of Class iita

Description

S3 method to summarize objects of class iita.

Usage

## S3 method for class 'iita'
summary(object, ...)
## S3 method for class 'iita'
summary(object, ...)

Arguments

`object`	a required object of class `iita`, obtained from a call to the function `iita`.
`...`	further arguments to be passed to or from other methods.

Details

Summarizes the main results from inductive item tree analysis algorithms.

Value

If the argument object is of required type, summary.iita returns the vector of diff values, the quasi order, the error rate, and the index in the selection set of the quasi order with minimum diff value.

Author(s)

Anatol Sargin, Ali Uenlue

References

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Summary Method for Objects of Class popiita

Description

S3 method to summarize objects of class popiita.

Usage

## S3 method for class 'popiita'
summary(object, ...)
## S3 method for class 'popiita'
summary(object, ...)

Arguments

`object`	a required object of class `popiita`, obtained from a call to the function `pop_iita`.
`...`	further arguments to be passed to or from other methods.

Details

Summarizes the main results from inductive item tree analysis algorithms in population values.

Value

If the argument object is of required type, summary.popiita returns the results obtained from pop_iita.

Author(s)

Anatol Sargin, Ali Uenlue

References

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Estimated Asymptotic Variance

Description

variance computes estimated asymptotic variances of the maximum likelihood estimators diff from data, assuming a multinomial probability distribution on the set of all response patterns.

Usage

variance(dataset, imp, v)
variance(dataset, imp, v)

Arguments

`dataset`	a required data frame or matrix consisting of binary, $1$ or $0$ , numeric data.
`imp`	a required object of class `set` representing the set of implications (ought to be a quasi order) for which diff is computed, for instance obtained from a call to `simu`.
`v`	a required numeric giving the inductive item tree analysis algorithm to be performed; `v = 1` (minimized corrected) and `v = 2` (corrected).

Details

Subject to the selected version to be performed, variance computes a consistent estimator for the population asymptotic variance of the maximum likelihood estimator diff, which here is formulated for the relation specified in imp and for the data in dataset. This estimated asymptotic variance is obtained using the delta method, which requires calculating the Jacobian matrix of the diff coefficient and the inverse of the expected Fisher information matrix for the multinomial distribution on the set of all response patterns. In the expression for the exact asymptotic variance, the true parameter vector of multinomial probabilities is estimated by its corresponding maximum likelihood estimate (vector of the relative frequencies of the response patterns).

The data must contain only ones and zeros, which encode solving or failing to solve an item, respectively.

Value

If the arguments dataset, imp, and v are of required types, variance returns a numeric giving the estimated asymptotic variance of the maximum likelihood estimator diff (formulated for the relation in imp and the data in dataset).

Note

The current version of the package DAKS does not support computing estimated asymptotic variances for the original inductive item tree analysis algorithm; population asymptotic variances can be estimated only for the corrected and minimized corrected algorithms.

The two types of estimators for the population asymptotic variances of the diff coefficients obtained using the expected Fisher information matrix on the one hand, and the observed Fisher information matrix on the other, yield the same result, in the case of the multinomial distribution. Since computation based on expected Fisher information is faster, this is implemented in variance.

Population (exact) asymptotic variances of the maximum likelihood estimators diff are implemented in the function pop_variance.

Author(s)

Anatol Sargin, Ali Uenlue

References

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Examples

x <- simu(5, 100, 0.05, 0.05, delta = 0.15)
variance(x$dataset, x$implications, v = 2)
x <- simu(5, 100, 0.05, 0.05, delta = 0.15)
variance(x$dataset, x$implications, v = 2)

One- and Two-Sample Z-Tests for diff Values

Description

z_test performs one- and two-sample Z-tests for the diff values.

Usage

z_test(dataset, imp, imp_alt = NULL,
       alternative = c("two.sided", "less", "greater"), mu = 0, 
       conf.level = 0.95, v)
z_test(dataset, imp, imp_alt = NULL,
       alternative = c("two.sided", "less", "greater"), mu = 0, 
       conf.level = 0.95, v)

Arguments

`dataset`	a required data frame or matrix consisting of binary, $1$ or $0$ , numeric data.
`imp`	a required object of class `set` representing the set of implications (ought to be a quasi order).
`imp_alt`	an optional set of implications, representing the alternative quasi order.
`alternative`	a character string specifying the alternative hypothesis, must be one of `"two.sided"` (default), `"less"`, or `"greater"`.
`mu`	a number indicating the true value of the mean (or difference in means if you are performing a two sample test).
`conf.level`	confidence level of the interval.
`v`	a required numeric giving the inductive item tree analysis algorithm to be performed; `v = 1` (minimized corrected) and `v = 2` (corrected).

Details

This function performs a Z-test for the diff values of one or two quasi orders.

Value

If the arguments are of required types, z_test returns a named list consisting of the following seven components:

`Z.value`	the value of the Z-statistic.
`p.value`	the p-value for the test.
`conf`	a confidence interval for the mean appropriate to the specified alternative hypothesis.
`diff_value`	the corresponding diff values for the used quasi orders according to the specified method.
`alternative`	a character string specifying the alternative hypothesis.
`mu`	a number indicating the true value of the mean (or difference in means if you are performing a two sample test).
`conf.level`	the level of the confidence interval.

Note

The current version of the package DAKS does not support performing a Z-test for the original inductive item tree analysis algorithm.

Author(s)

Anatol Sargin, Ali Uenlue

References

Sargin, A. and Uenlue, A. (2009) Inductive item tree analysis: Corrections, improvements, and comparisons. Mathematical Social Sciences, 58, 376–392.

Uenlue, A. and Sargin, A. (2010) DAKS: An R package for data analysis methods in knowledge space theory. Journal of Statistical Software, 37(2), 1–31. URL http://www.jstatsoft.org/v37/i02/.

Examples

sel_set<-ind_gen(ob_counter(pisa[, 1:3]))
z_test(pisa[, 1:3], sel_set[[2]], sel_set[[3]], v = 1)
sel_set<-ind_gen(ob_counter(pisa[, 1:3]))
z_test(pisa[, 1:3], sel_set[[2]], sel_set[[3]], v = 1)

Package 'DAKS'

Help Index

Data Analysis and Knowledge Spaces: The R Package DAKS

Description

Details

Author(s)

References

Corrected Inductive Item Tree Analysis

Description

Usage

Arguments

Details

Value

Note

Author(s)

References

See Also

Examples

Hasse Diagram of Surmise Relation

Description

Usage

Arguments

Value

Note

Author(s)

References

See Also

Examples

Inductive Item Tree Analysis

Description

Usage

Arguments

Details

Value

Note

Author(s)

References

See Also

Examples

Transformation from Implications to Knowledge States

Description

Usage

Arguments

Value

Note

Author(s)

References

See Also

Examples

Inductive Generation Procedure

Description

Usage

Arguments

Value

Note

Author(s)

References

See Also

Examples

Minimized Corrected Inductive Item Tree Analysis

Description

Usage

Arguments

Details

Value

Note

Author(s)

References

See Also

Examples

Computation of Numbers of Counterexamples

Description

Usage

Arguments

Details

Value

Note

Author(s)

References

See Also