We believe that the cluster assumption is key to successful semi-supervised learning. Based on this, we propose three semi-supervised algorithms: 1. deriving graph-based distances that emphazise low density regions between clusters, followed by training a standard SVM; 2. optimizing the Transductive SVM objective function, which places the decision boundary in low density regions, by gradient descent; 3. combining the first two to make maximum use of the cluster assumption. We compare with state of the art algorithms and demonstrate superior accuracy for the latter two methods.
Author(s): | Chapelle, O. and Zien, A. |
Book Title: | AISTATS 2005 |
Journal: | Proceedings of the Tenth International Workshop on Artificial Intelligence and Statistics (AISTATS 2005) |
Pages: | 57-64 |
Year: | 2005 |
Month: | January |
Day: | 0 |
Editors: | Cowell, R. , Z. Ghahramani |
Bibtex Type: | Conference Paper (inproceedings) |
Event Name: | Tenth International Workshop on Artificial Intelligence and Statistics (AI & Statistics 2005) |
Event Place: | Barbados |
Digital: | 0 |
Electronic Archiving: | grant_archive |
ISBN: | 0-9727358-1-X |
Language: | en |
Organization: | Max-Planck-Gesellschaft |
School: | Biologische Kybernetik |
Links: |
BibTex
@inproceedings{2899, title = {Semi-Supervised Classification by Low Density Separation}, journal = {Proceedings of the Tenth International Workshop on Artificial Intelligence and Statistics (AISTATS 2005)}, booktitle = {AISTATS 2005}, abstract = {We believe that the cluster assumption is key to successful semi-supervised learning. Based on this, we propose three semi-supervised algorithms: 1. deriving graph-based distances that emphazise low density regions between clusters, followed by training a standard SVM; 2. optimizing the Transductive SVM objective function, which places the decision boundary in low density regions, by gradient descent; 3. combining the first two to make maximum use of the cluster assumption. We compare with state of the art algorithms and demonstrate superior accuracy for the latter two methods.}, pages = {57-64}, editors = {Cowell, R. , Z. Ghahramani}, organization = {Max-Planck-Gesellschaft}, school = {Biologische Kybernetik}, month = jan, year = {2005}, slug = {2899}, author = {Chapelle, O. and Zien, A.}, month_numeric = {1} }