A fast algorithm for learning a ranking function from large-scale data sets
- PMID: 18550900
- DOI: 10.1109/TPAMI.2007.70776
A fast algorithm for learning a ranking function from large-scale data sets
Abstract
We consider the problem of learning the ranking function that maximizes a generalization of the Wilcoxon-Mann-Whitney statistic on the training data. Relying on an $\epsilon$-accurate approximation for the error-function, we reduce the computational complexity of each iteration of a conjugate gradient algorithm for learning ranking functions from O(m2) to O(m2), where m is the number of training samples. Experiments on public benchmarks for ordinal regression and collaborative filtering indicate that the proposed algorithm is as accurate as the best available methods in terms of ranking accuracy, when the algorithms are trained on the same data. However, since it is several orders of magnitude faster than the current state-of-the-art approaches, it is able to leverage much larger training datasets.
Similar articles
-
Sparse multinomial logistic regression: fast algorithms and generalization bounds.IEEE Trans Pattern Anal Mach Intell. 2005 Jun;27(6):957-68. doi: 10.1109/TPAMI.2005.127. IEEE Trans Pattern Anal Mach Intell. 2005. PMID: 15943426
-
Scalable model-based clustering for large databases based on data summarization.IEEE Trans Pattern Anal Mach Intell. 2005 Nov;27(11):1710-9. doi: 10.1109/TPAMI.2005.226. IEEE Trans Pattern Anal Mach Intell. 2005. PMID: 16285371
-
Learning weighted metrics to minimize nearest-neighbor classification error.IEEE Trans Pattern Anal Mach Intell. 2006 Jul;28(7):1100-10. doi: 10.1109/TPAMI.2006.145. IEEE Trans Pattern Anal Mach Intell. 2006. PMID: 16792099
-
Computational methods for predicting protein-protein interactions.Adv Biochem Eng Biotechnol. 2008;110:247-67. doi: 10.1007/10_2007_089. Adv Biochem Eng Biotechnol. 2008. PMID: 18202838 Review.
-
Computational intelligence approaches for pattern discovery in biological systems.Brief Bioinform. 2008 Jul;9(4):307-16. doi: 10.1093/bib/bbn021. Epub 2008 May 5. Brief Bioinform. 2008. PMID: 18460474 Review.
Cited by
-
Granular computing with multiple granular layers for brain big data processing.Brain Inform. 2014 Dec;1(1-4):1-10. doi: 10.1007/s40708-014-0001-z. Epub 2014 Sep 6. Brain Inform. 2014. PMID: 27747523 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources