SphereNet: Learning Spherical Representations for Detection and Classification in Omnidirectional Images

Institute Homepage

Institute Homepage DE Sign In

Back

Autonomous Vision Conference Paper 2018

Autonomous Vision

Benjamin Coors

Autonomous Vision, Perceiving Systems

Andreas Geiger

Guest Scientist

Omnidirectional cameras offer great benefits over classical cameras wherever a wide field of view is essential, such as in virtual reality applications or in autonomous robots. Unfortunately, standard convolutional neural networks are not well suited for this scenario as the natural projection surface is a sphere which cannot be unwrapped to a plane without introducing significant distortions, particularly in the polar regions. In this work, we present SphereNet, a novel deep learning framework which encodes invariance against such distortions explicitly into convolutional neural networks. Towards this goal, SphereNet adapts the sampling locations of the convolutional filters, effectively reversing distortions, and wraps the filters around the sphere. By building on regular convolutions, SphereNet enables the transfer of existing perspective convolutional neural network models to the omnidirectional case. We demonstrate the effectiveness of our method on the tasks of image classification and object detection, exploiting two newly created semi-synthetic and real-world omnidirectional datasets.

Author(s):	Benjamin Coors and Alexandru Paul Condurache and Andreas Geiger
Book Title:	European Conference on Computer Vision (ECCV)
Year:	2018
Month:	September

Project(s):	SphereNet
Bibtex Type:	Conference Paper (conference)

Event Place:	Munich, Germany

Electronic Archiving:	grant_archive

Links:	pdf suppmat

BibTex

@conference{Coors2018ECCV,
  title = {SphereNet: Learning Spherical Representations for Detection and Classification in Omnidirectional Images },
  booktitle = {European Conference on Computer Vision (ECCV)},
  abstract = {Omnidirectional cameras offer great benefits over classical cameras wherever a wide field of view is essential, such as in virtual reality applications or in autonomous robots. Unfortunately, standard convolutional neural networks are not well suited for this scenario as the natural projection surface is a sphere which cannot be unwrapped to a plane without introducing significant distortions, particularly in the polar regions. In this work, we present SphereNet, a novel deep learning framework which encodes invariance against such distortions explicitly into convolutional neural networks. Towards this goal, SphereNet adapts the sampling locations of the convolutional filters, effectively reversing distortions, and wraps the filters around the sphere. By building on regular convolutions, SphereNet enables the transfer of existing perspective convolutional neural network models to the omnidirectional case. We demonstrate the effectiveness of our method on the tasks of image classification and object detection, exploiting two newly created semi-synthetic and real-world omnidirectional datasets.},
  month = sep,
  year = {2018},
  slug = {coors2018eccv},
  author = {Coors, Benjamin and Condurache, Alexandru Paul and Geiger, Andreas},
  month_numeric = {9}
}

Research

Departments

Research Groups

People

Contact

Our Institute

Our History

Career

Doctoral Programs

Training

Service Units

Central Scientific Facilities

Workshops

Campus Services

Impact

Cooperation

Partners and Initiatives