Perceiving Systems Conference Paper 2021

Enhancing Deep Semantic Segmentation of RGB-D Data with Entangled Forests

Entangledfeatures

Semantic segmentation is a problem which is getting more and more attention in the computer vision community. Nowadays, deep learning methods represent the state of the art to solve this problem, and the trend is to use deeper networks to get higher performance. The drawback with such models is a higher computational cost, which makes it difficult to integrate them on mobile robot platforms. In this work we want to explore how to obtain lighter deep learning models without compromising performance. To do so we will consider the features used in the 3D Entangled Forests algorithm and we will study the best strategies to integrate these within FuseNet deep network. Such new features allow us to shrink the network size without loosing performance, obtaining hence a lighter model which achieves state-of-the-art performance on the semantic segmentation task and represents an interesting alternative for mobile robotics applications, where computational power and energy are limited.

Author(s): Terreran, Matteo and Bonetto, Elia and Ghidoni, Stefano
Book Title: 2020 25th International Conference on Pattern Recognition (ICPR 2020)
Pages: 4634--4641
Year: 2021
Month: January
Day: 10-15
Publisher: IEEE
Bibtex Type: Conference Paper (conference)
Address: Piscataway, NJ
DOI: 10.1109/ICPR48806.2021.9412787
Event Name: 25th International Conference on Pattern Recognition (ICPR 2020)
Event Place: Milan, Italy
State: Published
URL: https://ieeexplore.ieee.org/abstract/document/9412787/authors#authors
Digital: True
Electronic Archiving: grant_archive
ISBN: 978-1-7281-8808-9

BibTex

@conference{Bonetto_2021_CD,
  title = {Enhancing Deep Semantic Segmentation of RGB-D Data with Entangled Forests},
  booktitle = {2020 25th International Conference on Pattern Recognition (ICPR 2020)},
  abstract = {Semantic segmentation is a problem which is getting more and more attention in the computer vision community. Nowadays, deep learning methods represent the state of the art to solve this problem, and the trend is to use deeper networks to get higher performance. The drawback with such models is a higher computational cost, which makes it difficult to integrate them on mobile robot platforms. In this work we want to explore how to obtain lighter deep learning models without compromising performance. To do so we will consider the features used in the 3D Entangled Forests algorithm and we will study the best strategies to integrate these within FuseNet deep network. Such new features allow us to shrink the network size without loosing performance, obtaining hence a lighter model which achieves state-of-the-art performance on the semantic segmentation task and represents an interesting alternative for mobile robotics applications, where computational power and energy are limited.
  },
  pages = {4634--4641},
  publisher = {IEEE},
  address = {Piscataway, NJ},
  month = jan,
  year = {2021},
  slug = {bonetto_2021_cd},
  author = {Terreran, Matteo and Bonetto, Elia and Ghidoni, Stefano},
  url = {https://ieeexplore.ieee.org/abstract/document/9412787/authors#authors},
  month_numeric = {1}
}