Empirical Inference Talk 2004

Riemannian Geometry on Graphs and its Application to Ranking and Classification

We consider the problem of transductive inference. In many real-world problems, unlabeled data is far easier to obtain than labeled data. Hence transductive inference is very significant in many practical problems. According to Vapnik's point of view, one should predict the function value only on the given points directly rather than a function defined on the whole space, the latter being a more complicated problem. Inspired by this idea, we develop discrete calculus on finite discrete spaces, and then build discrete regularization. A family of transductive algorithms is naturally derived from this regularization framework. We validate the algorithms on both synthetic and real-world data from text/web categorization to bioinformatics problems. A significant by-product of this work is a powerful way of ranking data based on examples including images, documents, proteins and many other kinds of data.

Author(s): Zhou, D.
Year: 2004
Month: June
Day: 25
Bibtex Type: Talk (talk)
Digital: 0
Electronic Archiving: grant_archive
Event Place: DIMACS Working Group on The Mathematics of Web Search and Meta-Search, Bertorino, Italy
Organization: Max-Planck-Gesellschaft
School: Biologische Kybernetik
Links:

BibTex

@talk{2806,
  title = {Riemannian Geometry on Graphs and its Application to Ranking and
  Classification},
  abstract = {We consider the problem of  transductive inference. In many real-world problems, unlabeled data is far easier to obtain than labeled data. Hence  transductive inference is very significant in many practical problems. According to Vapnik's point of view, one should predict the function value only on the given points directly  rather than a function defined on the whole space, the latter being a more complicated problem. Inspired by this idea, we develop discrete calculus on finite discrete spaces,  and  then build discrete regularization. A family of transductive algorithms is naturally derived from this regularization framework. We validate the algorithms on both synthetic and real-world data from text/web categorization to bioinformatics problems. A significant by-product  of  this work is a powerful way of ranking data based on examples including  images, documents, proteins and many other kinds of data.},
  organization = {Max-Planck-Gesellschaft},
  school = {Biologische Kybernetik},
  month = jun,
  year = {2004},
  slug = {2806},
  author = {Zhou, D.},
  month_numeric = {6}
}