Back
Dealing with large Diagonals in Kernel Matrices
In kernel methods, all the information about the training data is contained in the Gram matrix. If this matrix has large diagonal values, which arises for many types of kernels, then kernel methods do not perform well: We propose and test several methods for dealing with this problem by reducing the dynamic range of the matrix while preserving the positive definiteness of the Hessian of the quadratic programming problem that one has to solve when training a Support Vector Machine, which is a common kernel approach for pattern recognition.
@article{1866, title = {Dealing with large Diagonals in Kernel Matrices}, journal = {Annals of the Institute of Statistical Mathematics}, abstract = {In kernel methods, all the information about the training data is contained in the Gram matrix. If this matrix has large diagonal values, which arises for many types of kernels, then kernel methods do not perform well: We propose and test several methods for dealing with this problem by reducing the dynamic range of the matrix while preserving the positive definiteness of the Hessian of the quadratic programming problem that one has to solve when training a Support Vector Machine, which is a common kernel approach for pattern recognition.}, volume = {55}, number = {2}, pages = {391-408}, organization = {Max-Planck-Gesellschaft}, school = {Biologische Kybernetik}, month = jun, year = {2003}, slug = {1866}, author = {Weston, J. and Sch{\"o}lkopf, B. and Eskin, E. and Leslie, C. and Noble, WS.}, month_numeric = {6} }