Learning to search efficiently in high dimensions
Zhen Li, Huazhong Ning, et al.
NeurIPS 2011
Rapid growth in the capture and generation of images and videos is driving the need for more efficient and effective systems for analyzing, searching, and retrieving this data. Specific challenges include supporting automatic content indexing at a large scale and accurately extracting a sufficiently large number of relevant semantic concepts to enable effective search. In this paper, we describe the development of a system for massive-scale visual semantic concept extraction and learning for images and video. The system models the visual semantic space using a hierarchical faceted classification scheme across objects, scenes, people, activities, and events and utilizes a novel machine learning approach that creates ensemble classifiers from automatically extracted visual features. The ensemble learning and extraction processes are easily parallelizable for distributed processing using Hadoop® and IBM InfoSphere® Streams, which enable efficient processing of large data sets. We report on various applications and quantitative and qualitative results for different image and video data sets.
Zhen Li, Huazhong Ning, et al.
NeurIPS 2011
Philipp Tschandl, Christoph Rinner, et al.
Nature Medicine
Michele Merler, Hui Wu, et al.
MADiMa 2016
John R. Smith, Liangliang Cao
MM 2013