Editorial Board

Editor-in-Chief
Krzysztof Janowicz

Managing Editors
Cogan Shimizu
Eva Blomqvist

Editorial Board
Mehwish Alam
Claudia d’Amato
Stefano Borgo
Boyan Brodaric
Philipp Cimiano
Oscar Corcho
Bernardo Cuenca-Grau
Elena Demidova
Jerome Euzenat
Mark Gahegan
Aldo Gangemi
Anna Lisa Gentile
Rafael Goncalves
Dagmar Gromann
Armin Haller
Pascal Hitzler
Aidan Hogan
Katja Hose
Eero Hyvönen
Sabrina Kirrane
Agnieszka Lawrynowicz
Freddy Lecue
Maria Maleshkova
Raghava Mutharaju
Axel Polleres
Guilin Qi
Marta Sabou
Harald Sack
Christoph Schlieder
Stefan Schlobach
Oshani Seneviratne
Cogan Shimizu
Ruben Verborgh
GQ Zhang

Former Editors-in-Chief
Pascal Hitzler

Editorial Assistants
Michael McCain

Syndicate

Studying the Impact of the Full-Network Embedding on Multimodal Pipelines

Submitted by Armand Vilalta on 09/26/2018 - 06:16

Tracking #: 2024-3237

Authors:

Armand Vilalta

Dario Garcia-Gasulla

Ferran Parés

Eduard Ayguade

Jesus Labarta

E Ulises Moya-Sánchez

Ulises Cortés

Responsible editor:

Guest Editors Semantic Deep Learning 2018

Submission type:

Full Paper

Abstract:

Abstract. The current state of the art for image annotation and image retrieval tasks is obtained through deep neural network multimodal pipelines, which combine an image representation and a text representation into a shared embedding space. In this paper we evaluate the impact of using the Full-Network embedding (FNE) in this setting, replacing the original image representation in four competitive multimodal embedding generation schemes. Unlike the one-layer image embeddings typically used by most approaches, the Full-Network embedding provides a multi-scale discrete representation of images, which results in richer characterisations. Extensive testing is performed on three different datasets comparing the performance of the studied variants and the impact of the FNE on a levelled playground, i.e., under equality of data used, source CNN models and hyper-parameter tuning. The results obtained indicate that the Full-Network embedding is consistently superior to the one-layer embedding. Furthermore, its impact on performance is superior to the improvement stemming from the other variants studied. These results motivate the integration of the Full-Network embedding on any multimodal embedding generation scheme.

Full PDF Version:

swj2024.pdf

Previous Version:

Studying the Impact of the Full-Network Embedding on Multimodal Pipelines

Tags:

Reviewed

Decision/Status:

Solicited Reviews:

Click to Expand/Collapse

Log in or register to post comments
8589 reads

Main menu

Editorial Board

Syndicate

Studying the Impact of the Full-Network Embedding on Multimodal Pipelines

Tracking #: 2024-3237

Reviewed Articles

Authors & Reviewers

Links

Recent blog posts

Accepted Articles

Search form

Main menu

Login

Editorial Board

Syndicate

Studying the Impact of the Full-Network Embedding on Multimodal Pipelines

Tracking #: 2024-3237

Reviewed Articles

Authors & Reviewers

Links

Recent blog posts

Accepted Articles