Dataciencia

Colección SciELO Chile

IMFD IMPRESEE at TRECVID 2019: Ad-hoc video search and video to text

Indexado

Scopus

SCOPUS_ID:85097831633

DOI

Año

2019

Tipo

Citas Totales

Autores Afiliación Chile

Instituciones Chile

% Participación
Internacional

Autores
Afiliación Extranjera

Instituciones
Extranjeras

Abstract

In this paper we present an overview of our participation in TRECVID 2019 [1]. We participated in the task Ad-hoc Video Search (AVS) and the subtasks Description Generation and Matching and Ranking of Video to Text (VTT) task. First, for the AVS Task, we develop a system architecture that we call “Word2AudioVisualVec++” (W2AVV++) based on Word2VisualVec++ (W2VV++) [11] that in addition to using deep visual features of videos, also uses deep audio features obtained from pre-trained networks. Second, for the VTT Matching and Ranking Task, we develop another deep learning model based on Word2VisualVec++, extracting temporal information of the video by using Dense Trajectories [16] and a clustering approach to encode them into a single vector representation. Third, for the VTT Description Generation Task, we develop an Encoder-Decoder model incorporating semantic states into the Encoder phase.

Disciplinas de Investigación

WOS
Sin Disciplinas

Scopus
Sin Disciplinas

SciELO
Sin Disciplinas

Muestra la distribución de disciplinas para esta publicación.

Publicaciones WoS (Ediciones: ISSHP, ISTP, AHCI, SSCI, SCI), Scopus, SciELO Chile.

Colaboración Institucional

Muestra la distribución de colaboración, tanto nacional como extranjera, generada en esta publicación.

Autores - Afiliación

Ord.	Autor	Género	Institución - País
1	Hernandez, Rodrigo	-	Universidad de Chile - Chile
2	Perez-Martin, Jesus	-	Universidad de Chile - Chile Instituto Milenio Fundamentos de los Datos - Chile
3	Bravo, Nicolas	-	Universidad de Chile - Chile Instituto Milenio Fundamentos de los Datos - Chile
4	Barrios, Juan Manuel	-	Universidad de Chile - Chile ORAND SA - Chile Impresee Inc. - Estados Unidos
5	Bustos, Benjamin	-	Universidad de Chile - Chile Instituto Milenio Fundamentos de los Datos - Chile

Muestra la afiliación y género (detectado) para los co-autores de la publicación.

Financiamiento

Fuente
National Institute of Standards and Technology
Universiteit van Amsterdam

Muestra la fuente de financiamiento declarada en la publicación.

Agradecimientos

Agradecimiento
The authors of this paper would like to thank NIST and all the coordinators of TRECVID for organizing this event. Special thanks to Cees Snoek, Pascal Mettes and the University of Amsterdam MediaMill team for sharing their pre-trained ResNeXt-101 model with us.