Improving Audiovisual Content Annotation Through a Semi-automated Process Based on Deep Learning

Paula Viana; Maria Teresa Andrade; Pedro Miguel Carvalho; Vilaça,L

Improving Audiovisual Content Annotation Through a Semi-automated Process Based on Deep Learning

Files

P-00Q-GEZ.pdf (1.33 MB)

Date

2018

Authors

Paula Viana

Maria Teresa Andrade

Pedro Miguel Carvalho

Vilaça,L

Abstract

Over the last years, Deep Learning has become one of the most popular research fields of Artificial Intelligence. Several approaches have been developed to address conventional challenges of AI. In computer vision, these methods provide the means to solve tasks like image classification, object identification and extraction of features. In this paper, some approaches to face detection and recognition are presented and analyzed, in order to identify the one with the best performance. The main objective is to automate the annotation of a large dataset and to avoid the costy and time-consuming process of content annotation. The approach follows the concept of incremental learning and a R-CNN model was implemented. Tests were conducted with the objective of detecting and recognizing one personality within image and video content. Results coming from this initial automatic process are then made available to an auxiliary tool that enables further validation of the annotations prior to uploading them to the archive. Tests show that, even with a small size dataset, the results obtained are satisfactory. © 2020, Springer Nature Switzerland AG.

URI

http://repositorio.inesctec.pt/handle/123456789/12178
http://dx.doi.org/10.1007/978-3-030-17065-3_7

Collections

CTM - Indexed Articles in Conferences

Full item page