Empirical Inference Article 2003

Use of the Zero-Norm with Linear Models and Kernel Methods

We explore the use of the so-called zero-norm of the parameters of linear models in learning. Minimization of such a quantity has many uses in a machine learning context: for variable or feature selection, minimizing training error and ensuring sparsity in solutions. We derive a simple but practical method for achieving these goals and discuss its relationship to existing techniques of minimizing the zero-norm. The method boils down to implementing a simple modification of vanilla SVM, namely via an iterative multiplicative rescaling of the training data. Applications we investigate which aid our discussion include variable and feature selection on biological microarray data, and multicategory classification.

Author(s): Weston, J. and Elisseeff, A. and Schölkopf, B. and Tipping, M.
Journal: Journal of Machine Learning Research
Volume: 3
Pages: 1439-1461
Year: 2003
Month: March
Day: 0
Bibtex Type: Article (article)
Digital: 0
Electronic Archiving: grant_archive
Language: en
Organization: Max-Planck-Gesellschaft
School: Biologische Kybernetik
Links:

BibTex

@article{2207,
  title = {Use of the Zero-Norm with Linear Models and Kernel Methods},
  journal = {Journal of Machine Learning Research},
  abstract = {We explore the use of the so-called zero-norm of the parameters of linear models in learning. Minimization of such a quantity has many uses in a machine learning context: for variable or feature selection, minimizing training error and ensuring sparsity in solutions. We derive a simple but practical method for achieving these goals and discuss its relationship to existing techniques of minimizing the zero-norm. The method boils down to implementing a simple modification of vanilla SVM, namely via an iterative multiplicative rescaling of the training data. Applications we investigate which aid our discussion include variable and feature selection on biological microarray data, and multicategory classification.},
  volume = {3},
  pages = {1439-1461},
  organization = {Max-Planck-Gesellschaft},
  school = {Biologische Kybernetik},
  month = mar,
  year = {2003},
  slug = {2207},
  author = {Weston, J. and Elisseeff, A. and Sch{\"o}lkopf, B. and Tipping, M.},
  month_numeric = {3}
}