Automatic semantic maps generation from lexical annotations

 

Authors
Rangel, José Carlos; Cazorla, Miguel; García Varea, Ismael; Romero González, Cristina; Martínez Gómez, Jesus
Format
Article
Status
publishedVersion
Description

The generation of semantic environment representations is still an open problem in robotics. Most of the current proposals are based on metric representations, and incorporate semantic information in a supervised fashion. The purpose of the robot is key in the generation of these representations, which has traditionally reduced the inter-usability of the maps created for different applications. We propose the use of information provided by lexical annotations to generate general-purpose semantic maps from RGB-D images. We exploit the availability of deep learning models suitable for describing any input image by means of lexical labels. Lexical annotations are more appropriate for computing the semantic similarity between images than the state-of-the-art visual descriptors. From these annotations, we perform a bottom-up clustering approach that associates each image with a different category. The use of RGB-D images allows the robot pose associated with each acquisition to be obtained, thus complementing the semantic with the metric information.
The generation of semantic environment representations is still an open problem in robotics. Most of the current proposals are based on metric representations, and incorporate semantic information in a supervised fashion. The purpose of the robot is key in the generation of these representations, which has traditionally reduced the inter-usability of the maps created for different applications. We propose the use of information provided by lexical annotations to generate general-purpose semantic maps from RGB-D images. We exploit the availability of deep learning models suitable for describing any input image by means of lexical labels. Lexical annotations are more appropriate for computing the semantic similarity between images than the state-of-the-art visual descriptors. From these annotations, we perform a bottom-up clustering approach that associates each image with a different category. The use of RGB-D images allows the robot pose associated with each acquisition to be obtained, thus complementing the semantic with the metric information.

Publication Year
2020
Language
eng
Topic
Semantic map
Lexical annotations
3D registration
RGB-D data
Deep learning
Semantic map
Lexical annotations
3D registration
RGB-D data
Deep learning
Repository
RI de Documento Digitales de Acceso Abierto de la UTP
Get full text
https://link.springer.com/article/10.1007/s10514-018-9723-8
https://ridda2.utp.ac.pa/handle/123456789/9442
Rights
embargoedAccess
License