Publication
ICASSP 2005
Conference paper

A generalized multiple instance learning algorithm for large scale modeling of multimedia semantics

View publication

Abstract

Statistical learning techniques provide a robust framework for learning representations of semantic concepts from multimedia features [1]. The bottleneck is the number of training samples needed to construct robust models. This is particularly expensive when the annotation needs to happen at finer granularity. We present a novel approach where the annotations may be entered at coarser spatial granularity while the concept may still be learnt at finer granularity. This can speed up annotation significantly. Using the multiple instance learning paradigm, we show that it is possible to learn representations of concepts occurring at the regional level by using annotations for several images. We present a generalized multiple instance learning algorithm that can scale to a large number of training samples as well as a large number of instances per bag. The algorithm also provides the ability to plug in different density modeling or regression techniques. Using the TREC 2001 Corpus we demonstrate the superior performance of the proposed algorithm over the existing diverse density algorithm [2]. © 2005 IEEE.

Date

Publication

ICASSP 2005

Authors

Share