Title: | Compute the Median Ranking(s) According to the Kemeny's Axiomatic Approach |
---|---|
Description: | Compute the median ranking according to the Kemeny's axiomatic approach. Rankings can or cannot contain ties, rankings can be both complete or incomplete. The package contains both branch-and-bound algorithms and heuristic solutions recently proposed. The searching space of the solution can either be restricted to the universe of the permutations or unrestricted to all possible ties. The package also provide some useful utilities for deal with preference rankings, including both element-weight Kemeny distance and correlation coefficient. This release declare as deprecated some functions that are still in the package for compatibility. Next release will not contains these functions. Please type '?ConsRank-deprecated' Essential references: Emond, E.J., and Mason, D.W. (2002) <doi:10.1002/mcda.313>; D'Ambrosio, A., Amodio, S., and Iorio, C. (2015) <doi:10.1285/i20705948v8n2p198>; Amodio, S., D'Ambrosio, A., and Siciliano R. (2016) <doi:10.1016/j.ejor.2015.08.048>; D'Ambrosio, A., Mazzeo, G., Iorio, C., and Siciliano, R. (2017) <doi:10.1016/j.cor.2017.01.017>; Albano, A., and Plaia, A. (2021) <doi:10.1285/i20705948v14n1p117>. |
Authors: | Antonio D'Ambrosio [aut, cre], Sonia Amodio [ctb], Giulio Mazzeo [ctb], Alessandro Albano [ctb], Antonella Plaia [ctb] |
Maintainer: | Antonio D'Ambrosio <[email protected]> |
License: | GPL-3 |
Version: | 2.1.4 |
Built: | 2024-12-02 06:40:02 UTC |
Source: | CRAN |
Compute the median ranking according to the Kemeny's axiomatic approach. Rankings can or cannot contain ties, rankings can be both complete or incomplete. The package contains both branch-and-bound and heuristic solutions as well as routines for computing the median constrained bucket order and the K-median cluster component analysis. The package also contains routines for visualize rankings and for detecting the universe of rankings including ties.
Package: | ConsRank |
Type: | Package |
Version: | 2.1.0 |
Date: | 2017-04-28 |
License: | GPL-3 |
Antonio D'Ambrosio [cre,aut] <[email protected]>, Sonia Amdio <[email protected]> [ctb], Giulio Mazzeo [ctb] <[email protected]>
Maintainer: Antonio D'Ambrosio <[email protected]>
Kemeny, J. G., & Snell, J. L. (1962). Mathematical models in the social sciences (Vol. 9). New York: Ginn.
Marden, J. I. (1996). Analyzing and modeling rank data. CRC Press.
Emond, E. J., & Mason, D. W. (2002). A new rank correlation coefficient with application to the consensus ranking problem. Journal of Multi-Criteria Decision Analysis, 11(1), 17-28.
D'Ambrosio, A. (2008). Tree based methods for data editing and preference rankings. Ph.D. thesis. http://www.fedoa.unina.it/id/eprint/2746
Heiser, W. J., & D'Ambrosio, A. (2013). Clustering and prediction of rankings within a Kemeny distance framework. In Algorithms from and for Nature and Life (pp. 19-31). Springer International Publishing.
Amodio, S., D'Ambrosio, A. & Siciliano, R (2016). Accurate algorithms for identifying the median ranking when dealing with weak and partial rankings under the Kemeny axiomatic approach. European Journal of Operational Research, vol. 249(2).
D'Ambrosio, A., Amodio, S. & Iorio, C. (2015). Two algorithms for finding optimal solutions of the Kemeny rank aggregation problem for full rankings. Electronic Journal of Applied Statistical Analysis, vol. 8(2).
D'Ambrosio, A., Mazzeo, G., Iorio, C., & Siciliano, R. (2017). A differential evolution algorithm for finding the median ranking under the Kemeny axiomatic approach. Computers & Operations Research, vol. 82.
D'Ambrosio, A., & Heiser, W.J. (2019). A Distribution-free Soft Clustering Method for Preference Rankings. Behaviormetrika , vol. 46(2), pp. 333–351.
D'Ambrosio, A., Iorio, C., Staiano, M., & Siciliano, R. (2019). Median constrained bucket order rank aggregation. Computational Statitstics, vol. 34(2), pp. 787–802,
## load APA data set, full version data(APAFULL) ## Emond and Mason Branch-and-Bound algorithm. #CR=consrank(APAFULL) #use frequency tables #TR=tabulaterows(APAFULL) #quick algorithm #CR2=consrank(TR$X,wk=TR$Wk,algorithm="quick") #FAST algorithm #CR3=consrank(TR$X,wk=TR$Wk,algorithm="fast",itermax=10) #Decor algorithm #CR4=consrank(TR$X,wk=TR$Wk,algorithm="decor",itermax=10) ##################################### ### load sports data set #data(sports) ### FAST algorithm #CR=consrank(sports,algorithm="fast",itermax=10) ##################################### ####################################### ### load Emond and Mason data set #data(EMD) ### matrix X contains rankings #X=EMD[,1:15] ### vector Wk contains frequencies #Wk=EMD[,16] ### QUICK algorithm #CR=consrank(X,wk=Wk,algorithm="quick") #######################################
## load APA data set, full version data(APAFULL) ## Emond and Mason Branch-and-Bound algorithm. #CR=consrank(APAFULL) #use frequency tables #TR=tabulaterows(APAFULL) #quick algorithm #CR2=consrank(TR$X,wk=TR$Wk,algorithm="quick") #FAST algorithm #CR3=consrank(TR$X,wk=TR$Wk,algorithm="fast",itermax=10) #Decor algorithm #CR4=consrank(TR$X,wk=TR$Wk,algorithm="decor",itermax=10) ##################################### ### load sports data set #data(sports) ### FAST algorithm #CR=consrank(sports,algorithm="fast",itermax=10) ##################################### ####################################### ### load Emond and Mason data set #data(EMD) ### matrix X contains rankings #X=EMD[,1:15] ### vector Wk contains frequencies #Wk=EMD[,16] ### QUICK algorithm #CR=consrank(X,wk=Wk,algorithm="quick") #######################################
The American Psychological Association dataset includes 15449 ballots of the election of the president in 1980, 5738 of which are complete rankings, in which the candidates are ranked from most to least favorite.
data(APAFULL)
data(APAFULL)
Diaconis, P. (1988). Group representations in probability and statistics. Lecture Notes-Monograph Series, i-192., pag. 96.
The American Psychological Association reduced dataset includes 5738 ballots of the election of the president in 1980, in which the candidates are ranked from most to least favorite.
data(APAred)
data(APAred)
Diaconis, P. (1988). Group representations in probability and statistics. Lecture Notes-Monograph Series, i-192., pag. 96.
Branch-and-bound algorithm to find consensus ranking as defined by D'Ambrosio et al. (2015). If the number of objects to be ranked is large (greater than 20 or 25), it can work for very long time. Use either QuickCons or FASTcons with the option FULL=TRUE instead
BBFULL(X, Wk = NULL, PS = TRUE)
BBFULL(X, Wk = NULL, PS = TRUE)
X |
A N by M data matrix, in which there are N judges and M objects to be judged. Each row is a ranking of the objects which are represented by the columns. The data matrix can contain both full and tied rankings, or incomplete rankings. Alternatively X can contain the rankings observed only once. In this case the argument Wk must be used |
Wk |
Optional: the frequency of each ranking in the data |
PS |
If PS=TRUE, on the screen some information about how many branches are processed are displayed |
This function is deprecated and it will be removed in the next release of the package. Use function 'consrank' instead.
If the objects to be ranked is large (>25 - 30), it can take long time to find the solutions
a "list" containing the following components:
Consensus | the Consensus Ranking | |
Tau | averaged TauX rank correlation coefficient | |
Eltime | Elapsed time in seconds |
Antonio D'Ambrosio [email protected]
D'Ambrosio, A., Amodio, S., and Iorio, C. (2015). Two algorithms for finding optimal solutions of the Kemeny rank aggregation problem for full rankings. Electronic Journal of Applied Statistical Analysis, 8(2), 198-213.
#data(APAFULL) #CR=BBFULL(APAFULL)
#data(APAFULL) #CR=BBFULL(APAFULL)
The data consist of ballots of three candidates, where the 948 voters rank the candidates from 1 to 3. Data are in form of frequency table.
data(BU)
data(BU)
Brook, D., & Upton, G. J. G. (1974). Biases in local government elections due to position on the ballot paper. Applied Statistics, 414-419.
Marden, J. I. (1996). Analyzing and modeling rank data. CRC Press, pag. 153.
data(BU) polyplot(BU[,1:3],Wk=BU[,4])
data(BU) polyplot(BU[,1:3],Wk=BU[,4])
Compute the Combined input matrix of a data set as defined by Emond and Mason (2002)
combinpmatr(X, Wk = NULL)
combinpmatr(X, Wk = NULL)
X |
A data matrix N by M, in which there are N judges and M objects to be judged. Each row is a ranking of the objects which are represented by the columns. Alternatively X can contain the rankings observed only once. In this case the argument Wk must be used |
Wk |
Optional: the frequency of each ranking in the data |
The M by M combined input matrix
Antonio D'Ambrosio [email protected]
Emond, E. J., and Mason, D. W. (2002). A new rank correlation coefficient with application to the consensus ranking problem. Journal of Multi-Criteria Decision Analysis, 11(1), 17-28.
tabulaterows
frequency distribution of a ranking data.
data(APAred) CI<-combinpmatr(APAred) TR<-tabulaterows(APAred) CI<-combinpmatr(TR$X,TR$Wk)
data(APAred) CI<-combinpmatr(APAred) TR<-tabulaterows(APAred) CI<-combinpmatr(TR$X,TR$Wk)
Branch-and-bound, Quick , FAST and DECOR algorithms to find consensus (median) ranking according to the Kemeny's axiomatic approach. The median ranking(s) can be restricted to be necessarily a full ranking, namely without ties
consrank( X, wk = NULL, ps = TRUE, algorithm = "BB", full = FALSE, itermax = 10, np = 15, gl = 100, ff = 0.4, cr = 0.9, proc = FALSE )
consrank( X, wk = NULL, ps = TRUE, algorithm = "BB", full = FALSE, itermax = 10, np = 15, gl = 100, ff = 0.4, cr = 0.9, proc = FALSE )
X |
A n by m data matrix, in which there are n judges and m objects to be judged. Each row is a ranking of the objects which are represented by the columns. If X contains the rankings observed only once, the argument wk can be used |
wk |
Optional: the frequency of each ranking in the data |
ps |
If PS=TRUE, on the screen some information about how many branches are processed are displayed. |
algorithm |
Specifies the used algorithm. One among "BB", "quick", "fast" and "decor". algorithm="BB" is the default option. |
full |
Specifies if the median ranking must be searched in the universe of rankings including all the possible ties (full=FALSE) or in the restricted space of full rankings (permutations). full=FALSE is the default option. |
itermax |
maximum number of iterations for FAST and DECOR algorithms. itermax=10 is the default option. |
np |
For DECOR algorithm only: the number of population individuals. np=15 is the default option. |
gl |
For DECOR algorithm only: generations limit, maximum number of consecutive generations without improvement. gl=100 is the default option. |
ff |
For DECOR algorithm only: the scaling rate for mutation. Must be in [0,1]. ff=0.4 is the default option. |
cr |
For DECOR algorithm only: the crossover range. Must be in [0,1]. cr=0.9 is the default option. |
proc |
For BB algorithm only: proc=TRUE allows the branch and bound algorithm to work in difficult cases, i.e. when the number of objects is larger than 15 or 25. proc=FALSE is the default option |
The BB algorithm can take long time to find the solutions if the number objects to be ranked is large with some missing (>15-20 if full=FALSE, <25-30 if full=TRUE). quick algorithm works with a large number of items to be ranked. The solution is quite accurate. fast algorithm works with a large number of items to be ranked by repeating several times the quick algorithm with different random starting points. decor algorithm works with a very large number of items to be ranked. For decor algorithm, empirical evidence shows that the number of population individuals (the 'np' parameter) can be set equal to 10, 20 or 30 for problems till 20, 50 and 100 items. Both scaling rate and crossover ratio (parameters 'ff' and 'cr') must be set by the user. The default options (ff=0.4, cr=0.9) work well for a large variety of data sets All algorithms allow the user to set the option 'full=TRUE' if the median ranking(s) must be searched in the restricted space of permutations instead of in the unconstrained universe of rankings of n items including all possible ties
a "list" containing the following components:
Consensus | the Consensus Ranking | |
Tau | averaged TauX rank correlation coefficient | |
Eltime | Elapsed time in seconds |
#'
Antonio D'Ambrosio [email protected]
Emond, E. J., and Mason, D. W. (2002). A new rank correlation coefficient with application to the consensus ranking problem. Journal of Multi-Criteria Decision Analysis, 11(1), 17-28. D'Ambrosio, A., Amodio, S., and Iorio, C. (2015). Two algorithms for finding optimal solutions of the Kemeny rank aggregation problem for full rankings. Electronic Journal of Applied Statistical Analysis, 8(2), 198-213. Amodio, S., D'Ambrosio, A. and Siciliano, R. (2016). Accurate algorithms for identifying the median ranking when dealing with weak and partial rankings under the Kemeny axiomatic approach. European Journal of Operational Research, 249(2), 667-676. D'Ambrosio, A., Mazzeo, G., Iorio, C., and Siciliano, R. (2017). A differential evolution algorithm for finding the median ranking under the Kemeny axiomatic approach. Computers and Operations Research, vol. 82, pp. 126-138.
data(Idea) RevIdea<-6-Idea # as 5 means "most associated", it is necessary compute the reverse ranking of # each rankings to have rank 1 = "most associated" and rank 5 = "least associated" CR<-consrank(RevIdea) CR<-consrank(RevIdea,algorithm="quick") #CR<-consrank(RevIdea,algorithm="fast",itermax=10) #not run #data(EMD) #CRemd<-consrank(EMD[,1:15],wk=EMD[,16],algorithm="decor",itermax=1) #data(APAFULL) #CRapa<-consrank(APAFULL,full=TRUE)
data(Idea) RevIdea<-6-Idea # as 5 means "most associated", it is necessary compute the reverse ranking of # each rankings to have rank 1 = "most associated" and rank 5 = "least associated" CR<-consrank(RevIdea) CR<-consrank(RevIdea,algorithm="quick") #CR<-consrank(RevIdea,algorithm="fast",itermax=10) #not run #data(EMD) #CRemd<-consrank(EMD[,1:15],wk=EMD[,16],algorithm="decor",itermax=1) #data(APAFULL) #CRapa<-consrank(APAFULL,full=TRUE)
These functions still work but will be removed (defunct) in the next version.
All these functions are deprecated, and will be removed in the next release of this package. The functions still remain in the package for compatibility of ConsRank users
Differential evolution algorithm for median ranking detection. It works with full, tied and partial rankings. The solution con be constrained to be a full ranking or a tied ranking
DECOR(X, Wk = NULL, NP = 15, L = 100, FF = 0.4, CR = 0.9, FULL = FALSE)
DECOR(X, Wk = NULL, NP = 15, L = 100, FF = 0.4, CR = 0.9, FULL = FALSE)
X |
A N by M data matrix, in which there are N judges and M objects to be judged. Each row is a ranking of the objects which are represented by the columns. Alternatively X can contain the rankings observed only once. In this case the argument Wk must be used |
Wk |
Optional: the frequency of each ranking in the data |
NP |
The number of population individuals |
L |
Generations limit: maximum number of consecutive generations without improvement |
FF |
The scaling rate for mutation. Must be in [0,1] |
CR |
The crossover range. Must be in [0,1] |
FULL |
Default FULL=FALSE. If FULL=TRUE, the searching is limited to the space of full rankings. |
This function is deprecated and it will be removed in the next release of the package. Use function 'consrank' instead.
a "list" containing the following components:
Consensus | the Consensus Ranking | |
Tau | averaged TauX rank correlation coefficient | |
Eltime | Elapsed time in seconds |
Antonio D'Ambrosio [email protected] and Giulio Mazzeo [email protected]
D'Ambrosio, A., Mazzeo, G., Iorio, C., and Siciliano, R. (2017). A differential evolution algorithm for finding the median ranking under the Kemeny axiomatic approach. Computers and Operations Research, vol. 82, pp. 126-138.
#not run #data(EMD) #CR=DECOR(EMD[,1:15],EMD[,16])
#not run #data(EMD) #CR=DECOR(EMD[,1:15],EMD[,16])
Branch-and-bound algorithm to find consensus ranking as definned by Emond and Mason (2002). If the number of objects to be ranked is large (greater than 15 or 20, specially if there are missing rankings), it can work for very long time.
EMCons(X, Wk = NULL, PS = TRUE)
EMCons(X, Wk = NULL, PS = TRUE)
X |
A N by M data matrix, in which there are N judges and M objects to be judged. Each row is a ranking of the objects which are represented by the columns. Alternatively X can contain the rankings observed only once. In this case the argument Wk must be used |
Wk |
Optional: the frequency of each ranking in the data |
PS |
If PS=TRUE, on the screen some information about how many branches are processed are displayed |
This function is deprecated and it will be removed in the next release of the package. Use function 'consrank' instead.
a "list" containing the following components:
Consensus | the Consensus Ranking | |
Tau | averaged TauX rank correlation coefficient | |
Eltime | Elapsed time in seconds |
Antonio D'Ambrosio [email protected]
Emond, E. J., and Mason, D. W. (2002). A new rank correlation coefficient with application to the consensus ranking problem. Journal of Multi-Criteria Decision Analysis, 11(1), 17-28.
data(Idea) RevIdea=6-Idea # as 5 means "most associated", it is necessary compute the reverse ranking of # each rankings to have rank 1 = "most associated" and rank 5 = "least associated" CR=EMCons(RevIdea)
data(Idea) RevIdea=6-Idea # as 5 means "most associated", it is necessary compute the reverse ranking of # each rankings to have rank 1 = "most associated" and rank 5 = "least associated" CR=EMCons(RevIdea)
Data simuated by Emond and Mason to check their branch-and-bound algorithm. There are 112 voters ranking 15 objects. There are 21 uncomplete rankings. Data are in form of frequency table.
data(EMD)
data(EMD)
Emond, E. J., & Mason, D. W. (2000). A new technique for high level decision support. Department of National Defence, Operational Research Division, pag. 28.
Emond, E. J., & Mason, D. W. (2000). A new technique for high level decision support. Department of National Defence, Operational Research Division, pag. 28.
data(EMD) CR=consrank(EMD[,1:15],EMD[,16],algorithm="quick")
data(EMD) CR=consrank(EMD[,1:15],EMD[,16],algorithm="quick")
FAST algorithm to find consensus (median) ranking.
FAST algorithm to find consensus (median) ranking defined by Amodio, D'Ambrosio and Siciliano (2016). It returns at least one of the solutions. If there are multiple solutions, sometimes it returns all the solutions, sometimes it returns some solutions, always it returns at least one solution.
FASTcons(X, Wk = NULL, maxiter = 50, FULL = FALSE, PS = FALSE)
FASTcons(X, Wk = NULL, maxiter = 50, FULL = FALSE, PS = FALSE)
X |
is a ranking data matrix |
Wk |
is a vector of weights |
maxiter |
maximum number of iterations: default = 50. |
FULL |
Default FULL=FALSE. If FULL=TRUE, the searching is limited to the space of full rankings. |
PS |
Default PS=FALSE. If PS=TRUE the number of current iteration is diplayed |
This function is deprecated and it will be removed in the next release of the package. Use function 'consrank' instead.
a "list" containing the following components:
Consensus | the Consensus Ranking | |
Tau | averaged TauX rank correlation coefficient | |
Eltime | Elapsed time in seconds |
Antonio D'Ambrosio [email protected] and Sonia Amodio [email protected]
Amodio, S., D'Ambrosio, A. and Siciliano, R. (2016). Accurate algorithms for identifying the median ranking when dealing with weak and partial rankings under the Kemeny axiomatic approach. European Journal of Operational Research, 249(2), 667-676.
EMCons
Emond and Mason branch-and-bound algorithm.
QuickCons
Quick algorithm.
##data(EMD) ##X=EMD[,1:15] ##Wk=matrix(EMD[,16],nrow=nrow(X)) ##CR=FASTcons(X,Wk,maxiter=100) ##These lines produce all the three solutions in less than a minute. data(sports) CR=FASTcons(sports,maxiter=5)
##data(EMD) ##X=EMD[,1:15] ##Wk=matrix(EMD[,16],nrow=nrow(X)) ##CR=FASTcons(X,Wk,maxiter=100) ##These lines produce all the three solutions in less than a minute. data(sports) CR=FASTcons(sports,maxiter=5)
FAST algorithm repeats DECOR a prespecified number of time. It returns the best solutions among the iterations
FASTDECOR( X, Wk = NULL, maxiter = 10, NP = 15, L = 100, FF = 0.4, CR = 0.9, FULL = FALSE, PS = TRUE )
FASTDECOR( X, Wk = NULL, maxiter = 10, NP = 15, L = 100, FF = 0.4, CR = 0.9, FULL = FALSE, PS = TRUE )
X |
A N by M data matrix, in which there are N judges and M objects to be judged. Each row is a ranking of the objects which are represented by the columns. Alternatively X can contain the rankings observed only once. In this case the argument Wk must be used |
Wk |
Optional: the frequency of each ranking in the data |
maxiter |
maximum number of iterations. Default 10 |
NP |
The number of population individuals |
L |
Generations limit: maximum number of consecutive generations without improvement |
FF |
The scaling rate for mutation. Must be in [0,1] |
CR |
The crossover range. Must be in [0,1] |
FULL |
Default FULL=FALSE. If FULL=TRUE, the searching is limited to the space of full rankings. In this case, the data matrix must contain full rankings. |
PS |
Default PS=TRUE. If PS=TRUE the number of a multiple of 5 iterations is diplayed |
This function is deprecated and it will be removed in the next release of the package. Use function 'consrank' instead.
a "list" containing the following components:
Consensus | the Consensus Ranking | |
Tau | averaged TauX rank correlation coefficient | |
Eltime | Elapsed time in seconds |
Antonio D'Ambrosio [email protected] and Giulio Mazzeo [email protected]
D'Ambrosio, A., Mazzeo, G., Iorio, C., and Siciliano, R. (2017). A differential evolution algorithm for finding the median ranking under the Kemeny axiomatic approach. Computers and Operations Research, vol. 82, pp. 126-138.
#data(EMD) #CR=FASTDECOR(EMD[,1:15],EMD[,16])
#data(EMD) #CR=FASTDECOR(EMD[,1:15],EMD[,16])
Ranking data of 2262 German respondents about the desirability of the four political goals: a = the maintenance of order in the nation; b = giving people more say in the decisions of government; c = growthing rising prices; d = protecting freedom of speech
data(German)
data(German)
Croon, M. A. (1989). Latent class models for the analysis of rankings. Advances in psychology, 60, 99-121.
data(German) TR=tabulaterows(German) polyplot(TR$X,Wk=TR$Wk,nobj=4)
data(German) TR=tabulaterows(German) polyplot(TR$X,Wk=TR$Wk,nobj=4)
98 college students where asked to rank five words, (thought, play, theory, dream, attention) regarding its association with the word idea, from 5=most associated to 1=least associated.
data(Idea)
data(Idea)
Fligner, M. A., & Verducci, J. S. (1986). Distance based ranking models. Journal of the Royal Statistical Society. Series B (Methodological), 359-369.
data(Idea) revIdea=6-Idea TR=tabulaterows(revIdea) CR=consrank(TR$X,wk=TR$Wk,algorithm="quick") colnames(CR$Consensus)=colnames(Idea)
data(Idea) revIdea=6-Idea TR=tabulaterows(revIdea) CR=consrank(TR$X,wk=TR$Wk,algorithm="quick") colnames(CR$Consensus)=colnames(Idea)
Compute the item-weighted Kemeny distance of a data matrix containing preference rankings, or compute the kemeny distance between two (matrices containing) rankings.
iw_kemenyd(x, y = NULL, w)
iw_kemenyd(x, y = NULL, w)
x |
A N by M data matrix, in which there are N judges and M objects to be judged. Each row is a ranking of the objects which are represented by the columns. If there is only x as input, the output is a square distance matrix |
y |
A row vector, or a N by M data matrix in which there are N judges and the same M objects as x to be judged. |
w |
A M-dimensional row vector (individually weighted items), or a M by M matrix (item similarities) |
If there is only x as input, d = square distance matrix. If there is also y as input, d = matrix with N rows and n columns.
Alessandro Albano [email protected]
Antonella Plaia [email protected]
Kemeny, J. G., & Snell, L. J. (1962). Preference ranking: an axiomatic approach. Mathematical models in the social sciences, 9-23.
Albano, A. and Plaia, A. (2021) Element weighted Kemeny distance for ranking data. Electronic Journal of Applied Statistical Analysis, doi: 10.1285/i20705948v14n1p117
iw_tau_x
item-weighted tau_x rank correlation coefficient
kemenyd
Kemeny distance
#Individually weighted items data("German") w=c(10,5,5,10) iw_kemenyd(x= German[c(1,200,300,500),],w= w) iw_kemenyd(x= German[1,],y=German[400,],w= w) #Item similarity weights data(sports) P=matrix(NA,nrow=7,ncol=7) P[1,]=c(0,5,5,10,10,10,10) P[2,]=c(5,0,5,10,10,10,10) P[3,]=c(5,5,0,10,10,10,10) P[4,]=c(10,10,10,0,5,5,5) P[5,]=c(10,10,10,5,0,5,5) P[6,]=c(10,10,10,5,5,0,5) P[7,]=c(10,10,10,5,5,5,0) iw_kemenyd(x=sports[c(1,3,5,7),], w= P) iw_kemenyd(x=sports[1,],y=sports[100,], w= P)
#Individually weighted items data("German") w=c(10,5,5,10) iw_kemenyd(x= German[c(1,200,300,500),],w= w) iw_kemenyd(x= German[1,],y=German[400,],w= w) #Item similarity weights data(sports) P=matrix(NA,nrow=7,ncol=7) P[1,]=c(0,5,5,10,10,10,10) P[2,]=c(5,0,5,10,10,10,10) P[3,]=c(5,5,0,10,10,10,10) P[4,]=c(10,10,10,0,5,5,5) P[5,]=c(10,10,10,5,0,5,5) P[6,]=c(10,10,10,5,5,0,5) P[7,]=c(10,10,10,5,5,5,0) iw_kemenyd(x=sports[c(1,3,5,7),], w= P) iw_kemenyd(x=sports[1,],y=sports[100,], w= P)
Compute the item-weighted TauX rank correlation coefficient of a data matrix containing preference rankings, or compute the item-weighted correlation coefficient between two (matrices containing) rankings.
iw_tau_x(x, y = NULL, w)
iw_tau_x(x, y = NULL, w)
x |
A N by M data matrix, in which there are N judges and M objects to be judged. Each row is a ranking of the objects which are represented by the columns. If there is only x as input, the output is a square matrix |
y |
A row vector, or a N by M data matrix in which there are N judges and the same M objects as x to be judged. |
w |
A M-dimensional row vector (individually weighted items), or a M by M matrix (item similarities) |
Item-weighted TauX rank correlation coefficient
Alessandro Albano [email protected]
Antonella Plaia [email protected]
Emond, E. J., and Mason, D. W. (2002). A new rank correlation coefficient with application to the consensus ranking problem. Journal of Multi-Criteria Decision Analysis, 11(1), 17-28.
Albano, A. and Plaia, A. (2021) Element weighted Kemeny distance for ranking data. Electronic Journal of Applied Statistical Analysis, doi: 10.1285/i20705948v14n1p117
tau_x
TauX rank correlation coefficient
iw_kemenyd
item-weighted Kemeny distance
#Individually weighted items data("German") w=c(10,5,5,10) iw_tau_x(x= German[c(1,200,300,500),],w= w) iw_tau_x(x= German[1,],y=German[400,],w= w) #Item similarity weights data(sports) P=matrix(NA,nrow=7,ncol=7) P[1,]=c(0,5,5,10,10,10,10) P[2,]=c(5,0,5,10,10,10,10) P[3,]=c(5,5,0,10,10,10,10) P[4,]=c(10,10,10,0,5,5,5) P[5,]=c(10,10,10,5,0,5,5) P[6,]=c(10,10,10,5,5,0,5) P[7,]=c(10,10,10,5,5,5,0) iw_tau_x(x=sports[c(1,3,5,7),], w= P) iw_tau_x(x=sports[1,],y=sports[100,], w= P)
#Individually weighted items data("German") w=c(10,5,5,10) iw_tau_x(x= German[c(1,200,300,500),],w= w) iw_tau_x(x= German[1,],y=German[400,],w= w) #Item similarity weights data(sports) P=matrix(NA,nrow=7,ncol=7) P[1,]=c(0,5,5,10,10,10,10) P[2,]=c(5,0,5,10,10,10,10) P[3,]=c(5,5,0,10,10,10,10) P[4,]=c(10,10,10,0,5,5,5) P[5,]=c(10,10,10,5,0,5,5) P[6,]=c(10,10,10,5,5,0,5) P[7,]=c(10,10,10,5,5,5,0) iw_tau_x(x=sports[c(1,3,5,7),], w= P) iw_tau_x(x=sports[1,],y=sports[100,], w= P)
Compute the item-weighted Combined input matrix of a data set as defined by Albano and Plaia (2021)
iwcombinpmatr(X, w, Wk = NULL)
iwcombinpmatr(X, w, Wk = NULL)
X |
A data matrix N by M, in which there are N judges and M objects to be judged. Each row is a ranking of the objects which are represented by the columns. Alternatively X can contain the rankings observed only once. In this case the argument Wk must be used |
w |
A M-dimensional row vector (individually weighted items), or a M by M matrix (item similarities) |
Wk |
Optional: the frequency of each ranking in the data |
The M by M item-weighted combined input matrix
Alessandro Albano [email protected]
Antonella Plaia [email protected]
Emond, E. J., and Mason, D. W. (2002). A new rank correlation coefficient with application to the consensus ranking problem. Journal of Multi-Criteria Decision Analysis, 11(1), 17-28.
Albano, A. and Plaia, A. (2021). Element weighted Kemeny distance for ranking data. Electronic Journal of Applied Statistical Analysis, doi: 10.1285/i20705948v14n1p117
tabulaterows
frequency distribution of a ranking data.
combinpmatr
combined input matrix of a ranking data set.
data(sports) np <- dim(sports)[2] P <- matrix(NA,nrow=np,ncol=np) P[1,] <- c(0,5,5,10,10,10,10) P[2,] <- c(5,0,5,10,10,10,10) P[3,] <- c(5,5,0,10,10,10,10) P[4,] <- c(10,10,10,0,5,5,5) P[5,] <- c(10,10,10,5,0,5,5) P[6,] <- c(10,10,10,5,5,0,5) P[7,] <- c(10,10,10,5,5,5,0) CIW <- iwcombinpmatr(sports,w=P)
data(sports) np <- dim(sports)[2] P <- matrix(NA,nrow=np,ncol=np) P[1,] <- c(0,5,5,10,10,10,10) P[2,] <- c(5,0,5,10,10,10,10) P[3,] <- c(5,5,0,10,10,10,10) P[4,] <- c(10,10,10,0,5,5,5) P[5,] <- c(10,10,10,5,0,5,5) P[6,] <- c(10,10,10,5,5,0,5) P[7,] <- c(10,10,10,5,5,5,0) CIW <- iwcombinpmatr(sports,w=P)
The item-weighted Quick algorithm finds up to 4 solutions. Solutions reached are most of the time optimal solutions.
iwquickcons(X, w, Wk = NULL, full = FALSE, PS = FALSE)
iwquickcons(X, w, Wk = NULL, full = FALSE, PS = FALSE)
X |
A N by M data matrix in which there are N judges and M objects to be judged. Each row is a ranking of the objects which are represented by the columns. Alternatively X can contain the rankings observed only once in the sample. In this case the argument Wk must be used |
w |
A M-dimensional row vector (individually weighted items), or a M by M matrix (item similarities) |
Wk |
Optional: the frequency of each ranking in the data |
full |
Default full=FALSE. If full=TRUE, the searching is limited to the space of full rankings. |
PS |
Default PS=FALSE. If PS=TRUE the number of evaluated branches is diplayed |
The item-weigthed Quick algorithm finds up the consensus (median) ranking according to the Kemeny's axiomatic approach. The median ranking(s) can be restricted to be necessarily a full ranking, namely without ties.
a "list" containing the following components:
Consensus | the Consensus Ranking | |
Tau | averaged item-weighted TauX rank correlation coefficient | |
Eltime | Elapsed time in seconds |
Alessandro Albano [email protected]
Antonella Plaia [email protected]
Amodio, S., D'Ambrosio, A. and Siciliano, R. (2016). Accurate algorithms for identifying the median ranking when dealing with weak and partial rankings under the Kemeny axiomatic approach. European Journal of Operational Research, 249(2), 667-676.
Albano, A. and Plaia, A. (2021). Element weighted Kemeny distance for ranking data. Electronic Journal of Applied Statistical Analysis, doi: 10.1285/i20705948v14n1p117
#Individually weighted items data("German") w=c(10,5,5,10) iwquickcons(X= German,w= w) #Item similirity weights data(sports) dim(sports) P=matrix(NA,nrow=7,ncol=7) P[1,]=c(0,5,5,10,10,10,10) P[2,]=c(5,0,5,10,10,10,10) P[3,]=c(5,5,0,10,10,10,10) P[4,]=c(10,10,10,0,5,5,5) P[5,]=c(10,10,10,5,0,5,5) P[6,]=c(10,10,10,5,5,0,5) P[7,]=c(10,10,10,5,5,5,0) iwquickcons(X= sports, w= P)
#Individually weighted items data("German") w=c(10,5,5,10) iwquickcons(X= German,w= w) #Item similirity weights data(sports) dim(sports) P=matrix(NA,nrow=7,ncol=7) P[1,]=c(0,5,5,10,10,10,10) P[2,]=c(5,0,5,10,10,10,10) P[3,]=c(5,5,0,10,10,10,10) P[4,]=c(10,10,10,0,5,5,5) P[5,]=c(10,10,10,5,0,5,5) P[6,]=c(10,10,10,5,5,0,5) P[7,]=c(10,10,10,5,5,5,0) iwquickcons(X= sports, w= P)
Compute the Kemeny distance of a data matrix containing preference rankings, or compute the kemeny distance between two (matrices containing) rankings.
kemenyd(X, Y = NULL)
kemenyd(X, Y = NULL)
X |
A N by M data matrix, in which there are N judges and M objects to be judged. Each row is a ranking of the objects which are represented by the columns. If there is only X as input, the output is a square distance matrix |
Y |
A row vector, or a n by M data matrix in which there are n judges and the same M objects as X to be judged. |
If there is only X as input, d = square distance matrix. If there is also Y as input, d = matrix with N rows and n columns.
Antonio D'Ambrosio [email protected]
Kemeny, J. G., & Snell, L. J. (1962). Preference ranking: an axiomatic approach. Mathematical models in the social sciences, 9-23.
tau_x
TauX rank correlation coefficient
iw_kemenyd
item-weighted Kemeny distance
data(Idea) RevIdea<-6-Idea ##as 5 means "most associated", it is necessary compute the reverse #ranking of each rankings to have rank 1 = "most associated" and rank 5 = "least associated" KD<-kemenyd(RevIdea) KD2<-kemenyd(RevIdea[1:10,],RevIdea[55,])
data(Idea) RevIdea<-6-Idea ##as 5 means "most associated", it is necessary compute the reverse #ranking of each rankings to have rank 1 = "most associated" and rank 5 = "least associated" KD<-kemenyd(RevIdea) KD2<-kemenyd(RevIdea[1:10,],RevIdea[55,])
Define a design matrix to compute Kemeny distance
kemenydesign(X)
kemenydesign(X)
X |
A N by M data matrix, in which there are N judges and M objects to be judged. Each row is a ranking of the objects represented by the columns. |
Design matrix
Antonio D'Ambrosio [email protected]
D'Ambrosio, A. (2008). Tree based methods for data editing and preference rankings. Unpublished PhD Thesis. Universita' degli Studi di Napoli Federico II.
Given a ranking, it computes the score matrix as defined by Emond and Mason (2002)
kemenyscore(X)
kemenyscore(X)
X |
a ranking (must be a row vector or, better, a matrix with one row and M columns) |
the M by M score matrix
Antonio D'Ambrosio [email protected]
Kemeny, J and Snell, L. (1962). Mathematical models in the social sciences.
scorematrix
The score matrix as defined by Emond and Mason (2002)
Y <- matrix(c(1,3,5,4,2),1,5) SM<-kemenyscore(Y) # Z<-c(1,2,3,2) SM2<-kemenyscore(Z)
Y <- matrix(c(1,3,5,4,2),1,5) SM<-kemenyscore(Y) # Z<-c(1,2,3,2) SM2<-kemenyscore(Z)
Given a ranking (or a matrix of rank data), transforms it into an ordering (or a ordering matrix)
labels(x, m, label = 1:m, labs)
labels(x, m, label = 1:m, labs)
x |
a ranking, or a n by m data matrix in which there are n judges ranking m objects |
m |
the number of objects |
label |
optional: the name of the objects |
labs |
labs = 1 displays the names of the objects if there is argument "label", otherwise displays the permutation of first m integer. labs = 2 is to be used only if the argument "label" is not defined. In such a case it displays the permutation of the first m letters |
This function is deprecated and it will be removed in the next release of the package. Use function 'rank2order' instead.
the ordering
Sonia Amodio [email protected]
data(Idea) TR=tabulaterows(Idea) Ord=labels(TR$X,ncol(Idea),colnames(Idea),labs=1) Ord2=labels(TR$X,ncol(Idea),labs=2) cbind(Ord,TR$Wk) cbind(Ord2,TR$Wk)
data(Idea) TR=tabulaterows(Idea) Ord=labels(TR$X,ncol(Idea),colnames(Idea),labs=1) Ord2=labels(TR$X,ncol(Idea),labs=2) cbind(Ord,TR$Wk) cbind(Ord2,TR$Wk)
From ordering to rank. IMPORTANT: check which symbol denotes tied rankings in the X matrix
order2rank(X, TO = "{", TC = "}")
order2rank(X, TO = "{", TC = "}")
X |
A ordering or a matrix containing orderings |
TO |
symbol indicating the start of a set of items ranked in a tie |
TC |
symbol indicating the end of a set of items ranked in a tie |
a ranking or a matrix of rankings:
R | ranking or matrix of rankings |
Antonio D'Ambrosio [email protected]
data(APAred) ord=rank2order(APAred) #transform rankings into orderings ran=order2rank(ord) #transform the orderings into rankings
data(APAred) ord=rank2order(APAred) #transform rankings into orderings ran=order2rank(ord) #transform the orderings into rankings
Generate all possible partitions of n items constrained into k non empty subsets. It does not generate the universe of rankings constrained into k buckets.
partitions(n, k = NULL, items = NULL, itemtype = "L")
partitions(n, k = NULL, items = NULL, itemtype = "L")
n |
a (integer) number denoting the number of items |
k |
The number of the non-empty subsets. Default value is NULL, in this case all the possible partitions are displayed |
items |
items: the items to be placed into the ordering matrix. Default are the first c small letters |
itemtype |
to be used only if items is not set. The default value is "L", namely letters. Any other symbol produces items as the first c integers |
If the objects to be ranked is large (>15-20) with some missing, it can take long time to find the solutions. If the searching space is limited to the space of full rankings (also incomplete rankings, but without ties), use the function BBFULL or the functions FASTcons and QuickCons with the option FULL=TRUE.
the ordering matrix (or vector)
Antonio D'Ambrosio [email protected]
stirling2
Stirling number of second kind.
rank2order
Convert rankings into orderings.
order2rank
Convert orderings into ranks.
univranks
Generate the universe of rankings given the input partition
X<-partitions(4,3) #shows all the ways to partition 4 items (say "a", "b", "c" and "d" into 3 non-empty subets #(i.e., into 3 buckets). The Stirling number of the second kind (4,3) indicates that there #are 6 ways. s2<-stirling2(4,3)$S X2<-order2rank(X) #it transform the ordering into ranking
X<-partitions(4,3) #shows all the ways to partition 4 items (say "a", "b", "c" and "d" into 3 non-empty subets #(i.e., into 3 buckets). The Stirling number of the second kind (4,3) indicates that there #are 6 ways. s2<-stirling2(4,3)$S X2<-order2rank(X) #it transform the ordering into ranking
Plot rankings a permutation polytope that is the geometrical space of preference rankings. The plot is available for 3 or for 4 objects
polyplot(X = NULL, L = NULL, Wk = NULL, nobj = 3)
polyplot(X = NULL, L = NULL, Wk = NULL, nobj = 3)
X |
the sample of rankings. Most of the time it is returned by tabulaterows |
L |
labels of the objects |
Wk |
frequency associated to each ranking |
nobj |
number of objects. It must be either 3 or 4 |
polyplot() plots the universe of 3 objecys. polyplot(nobj=4) plots the universe of 4 objecys.
the permutation polytope
Antonio D'Ambrosio [email protected] and Sonia Amodio [email protected]
Thompson, G. L. (1993). Generalized permutation polytopes and exploratory graphical methods for ranked data. The Annals of Statistics, 1401-1430. # Heiser, W. J., and D'Ambrosio, A. (2013). Clustering and prediction of rankings within a Kemeny distance framework. In Algorithms from and for Nature and Life (pp. 19-31). Springer International Publishing.
tabulaterows
frequency distribution for ranking data.
polyplot() #polyplot(nobj=4) data(BU) polyplot(BU[,1:3],Wk=BU[,4])
polyplot() #polyplot(nobj=4) data(BU) polyplot(BU[,1:3],Wk=BU[,4])
The Quick algorithm finds up to 4 solutions. Solutions reached are most of the time optimal solutions.
QuickCons(X, Wk = NULL, FULL = FALSE, PS = FALSE)
QuickCons(X, Wk = NULL, FULL = FALSE, PS = FALSE)
X |
A N by M data matrix in which there are N judges and M objects to be judged. Each row is a ranking of the objects which are represented by the columns. Alternatively X can contain the rankings observed only once in the sample. In this case the argument Wk must be used |
Wk |
Optional: the frequency of each ranking in the data |
FULL |
Default FULL=FALSE. If FULL=TRUE, the searching is limited to the space of full rankings. |
PS |
Default PS=FALSE. If PS=TRUE the number of evaluated branches is diplayed |
This function is deprecated and it will be removed in the next release of the package. Use function 'consrank' instead.
a "list" containing the following components:
Consensus | the Consensus Ranking | |
Tau | averaged TauX rank correlation coefficient | |
Eltime | Elapsed time in seconds |
Antonio D'Ambrosio [email protected]
Amodio, S., D'Ambrosio, A. and Siciliano, R. (2016). Accurate algorithms for identifying the median ranking when dealing with weak and partial rankings under the Kemeny axiomatic approach. European Journal of Operational Research, 249(2), 667-676.
data(EMD) CR=QuickCons(EMD[,1:15],EMD[,16])
data(EMD) CR=QuickCons(EMD[,1:15],EMD[,16])
From ranking to ordering. IMPORTANT: check which symbol denotes tied rankings in the X matrix
rank2order(X, items = NULL, TO = "{", TC = "}", itemtype = "L")
rank2order(X, items = NULL, TO = "{", TC = "}", itemtype = "L")
X |
A ordering or a matrix containing orderings |
items |
items to be placed into the ordering matrix. Default are the |
TO |
symbol indicating the start of a set of items ranked in a tie |
TC |
symbol indicating the end of a set of items ranked in a tie |
itemtype |
to be used only if items=NULL. The default value is "L", namely |
a ordering or a matrix of orderings:
out | ranking or matrix of rankings |
Antonio D'Ambrosio [email protected]
data(APAred) ord<-rank2order(APAred)
data(APAred) ord<-rank2order(APAred)
Given a ranking of M objects (or a matrix with M columns), it reduces it in "natural" form (i.e., with integers from 1 to M)
reordering(X)
reordering(X)
X |
a ranking, or a ranking data matrix |
a ranking in natural form
Antonio D'Ambrosio [email protected]
Given a ranking, it computes the score matrix as defined by Emond and Mason (2002)
scorematrix(X)
scorematrix(X)
X |
a ranking (must be a row vector or, better, a matrix with one row and M columns) |
the M by M score matrix
Antonio D'Ambrosio [email protected]
Emond, E. J., and Mason, D. W. (2002). A new rank correlation coefficient with application to the consensus ranking problem. Journal of Multi-Criteria Decision Analysis, 11(1), 17-28.
combinpmatr
The combined inut matrix
Y <- matrix(c(1,3,5,4,2),1,5) SM<-scorematrix(Y) # Z<-c(1,2,4,3) SM2<-scorematrix(Z)
Y <- matrix(c(1,3,5,4,2),1,5) SM<-scorematrix(Y) # Z<-c(1,2,4,3) SM2<-scorematrix(Z)
130 students at the University of Illinois ranked seven sports according to their preference (Baseball, Football, Basketball, Tennis, Cycling, Swimming, Jogging).
data(sports)
data(sports)
Marden, J. I. (1996). Analyzing and modeling rank data. CRC Press.
data(sports)
data(sports)
Denote the number of ways to partition a set of n objects into k non-empty subsets
stirling2(n, k)
stirling2(n, k)
n |
(integer): the number of the objects |
k |
(integer <=n): the number of the non-empty subsets (buckets) |
a "list" containing the following components:
S | the stirling number of the second kind | |
SM | a matrix showing, for each k (on the columns) in how many ways the n objects (on the rows) can be partitioned |
Antonio D'Ambrosio [email protected]
Comtet, L. (1974). Advanced Combinatorics: The art of finite and infinite expansions. D. Reidel, Dordrecth, The Netherlands.
parts<-stirling2(4,2)
parts<-stirling2(4,2)
Given a sample of preference rankings, it compute the frequency associated to each ranking
tabulaterows(X, miss = FALSE)
tabulaterows(X, miss = FALSE)
X |
a N by M data matrix containing N judges judging M objects |
miss |
TRUE if there are missing data (either partial or incomplete rankings): default: FALSE |
a "list" containing the following components:
X | the unique rankings | |
Wk | the frequency associated to each ranking | |
tabfreq | frequency table |
Antonio D'Ambrosio [email protected]
data(Idea) TR<-tabulaterows(Idea) FR<-TR$Wk/sum(TR$Wk) RF<-cbind(TR$X,FR) colnames(RF)<-c(colnames(Idea),"fi") #compute modal ranking maxfreq<-which(RF[,6]==max(RF[,6])) rank2order(RF[maxfreq,1:5],items=colnames(Idea)) # data(APAred) TR<-tabulaterows(APAred) # data(APAFULL) TR<-tabulaterows(APAFULL) CR1<-consrank(TR$X,wk=TR$Wk) CR2<-consrank(TR$X,wk=TR$Wk,algorithm="fast",itermax=15) CR3<-consrank(TR$X,wk=TR$Wk,algorithm="quick")
data(Idea) TR<-tabulaterows(Idea) FR<-TR$Wk/sum(TR$Wk) RF<-cbind(TR$X,FR) colnames(RF)<-c(colnames(Idea),"fi") #compute modal ranking maxfreq<-which(RF[,6]==max(RF[,6])) rank2order(RF[maxfreq,1:5],items=colnames(Idea)) # data(APAred) TR<-tabulaterows(APAred) # data(APAFULL) TR<-tabulaterows(APAFULL) CR1<-consrank(TR$X,wk=TR$Wk) CR2<-consrank(TR$X,wk=TR$Wk,algorithm="fast",itermax=15) CR3<-consrank(TR$X,wk=TR$Wk,algorithm="quick")
Tau exstension is a new rank correlation coefficient defined by Emond and Mason (2002)
tau_x(X, Y = NULL) Tau_X(X, Y = NULL)
tau_x(X, Y = NULL) Tau_X(X, Y = NULL)
X |
a M by N data matrix, in which there are N judges and M objects to be judged. Each row is a ranking of the objects which are represented by the columns. If there is only X as input, the output is a square matrix containing the Tau_X rcc. |
Y |
A row vector, or a n by M data matrix in which there are n judges and the same M objects as X to be judged. |
Tau_x rank correlation coefficient
Antonio D'Ambrosio [email protected]
Emond, E. J., and Mason, D. W. (2002). A new rank correlation coefficient with application to the consensus ranking problem. Journal of Multi-Criteria Decision Analysis, 11(1), 17-28.
kemenyd
Kemeny distance
iw_tau_x
item-weighted tau_x rank correlation coefficient
data(BU) RD<-BU[,1:3] Tau<-tau_x(RD) Tau1_3<-tau_x(RD[1,],RD[3,])
data(BU) RD<-BU[,1:3] Tau<-tau_x(RD) Tau1_3<-tau_x(RD[1,],RD[3,])
Generate the universe of rankings given the input partition
univranks(X, k = NULL, ordering = TRUE)
univranks(X, k = NULL, ordering = TRUE)
X |
A ranking, an ordering, a matrix of rankings, a matrix of orderings or a number |
k |
Optional: the number of the non-empty subsets. It has to be used only if X is anumber. The default value is NULL, In this case the universe of rankings with n=X items are computed |
ordering |
The universe of rankings must be returned as orderings (default) or rankings? |
The function should be used with small numbers because it can generate a large number of permutations. The use of X greater than 9, of X matrices with more than 9 columns as input is not reccomended.
a "list" containing the following components:
Runiv | The universe of rankings | |
Cuniv | A list containing: | |
R | The universe of rankings in terms of rankings; | |
Parts | for each ranking in input the produced rankings | |
Univinbuckets | the universe of rankings within each bucket |
Antonio D'Ambrosio [email protected]
stirling2
Stirling number of second kind.
rank2order
Convert rankings into orderings.
order2rank
Convert orderings into ranks.
partitions
Generate partitions of n items constrained into k non empty subsets.
S2<-stirling2(4,4)$SM[4,] #indicates in how many ways 4 objects #can be placed, respectively, into 1, 2, #3 or 4 non-empty subsets. CardConstr<-factorial(c(1,2,3,4))*S2 #the cardinality of rankings #constrained into 1, 2, 3 and 4 #buckets Card<-sum(CardConstr) #Cardinality of the universe of rankings with 4 #objects U<-univranks(4)$Runiv #the universe of rankings with four objects # we know that the universe counts 75 #different rankings Uk<-univranks(4,2)$Runiv #the universe of rankings of four objects #constrained into k=2 buckets, we know they are 14 Up<-univranks(c(1,4,3,1))$Runiv #the universe of rankings with 4 objects #for which the first and the fourth item #are tied
S2<-stirling2(4,4)$SM[4,] #indicates in how many ways 4 objects #can be placed, respectively, into 1, 2, #3 or 4 non-empty subsets. CardConstr<-factorial(c(1,2,3,4))*S2 #the cardinality of rankings #constrained into 1, 2, 3 and 4 #buckets Card<-sum(CardConstr) #Cardinality of the universe of rankings with 4 #objects U<-univranks(4)$Runiv #the universe of rankings with four objects # we know that the universe counts 75 #different rankings Uk<-univranks(4,2)$Runiv #the universe of rankings of four objects #constrained into k=2 buckets, we know they are 14 Up<-univranks(c(1,4,3,1))$Runiv #the universe of rankings with 4 objects #for which the first and the fourth item #are tied
Random subset of the rankings collected by O'Leary Morgan and Morgon (2010) on the 50 American States. The 368 number of items (the number of American States) is equal to 50, and the number of rankings is equal to 104. These data concern rankings of the 50 American States on three particular aspects: socio-demographic characteristics, health care expenditures and crime statistics.
data(USAranks)
data(USAranks)
Amodio, S., D'Ambrosio, A. & Siciliano, R (2015). Accurate algorithms for identifying the median ranking when dealing with weak and partial rankings under the Kemeny axiomatic approach. European Journal of Operational Research. DOI: 10.1016/j.ejor.2015.08.048
O'Leary Morgan, K., Morgon, S., (2010). State Rankings 2010: A Statistical view of America; Crime State Ranking 2010: Crime Across America; Health Care State Rankings 2010: Health Care Across America. CQ Press.
data(USAranks)
data(USAranks)