IBM®
Skip to main content
    United States change      Terms of use
 
 
Select a scope:    
     Home      Products      Services & industry solutions      Support & downloads      My account     
alphaWorks  >  Information management  >  

IBM Multimedia Analysis and Retrieval System

An automated desktop indexing and multimodal search system for digital image and video collections.


Date Posted: February 28, 2005
OverviewRequirements Download FAQs Forum Reviews

Update: May 9, 2008

Improved Indexing Tool (various video formats, additional functionality, and larger set of semantic classifiers) and Search Tool (dynamic Web-based search, visualization, and tagging system).

What is IBM Multimedia Analysis and Retrieval System?

IBM® Multimedia Analysis and Retrieval System is an automated content indexing and multimodal search system for digital image and video collections. This system addresses the problem of indexing, classifying, and searching large volumes of images and videos: It can visually analyze images and videos, categorize them based on appearance and associated metadata, and make them more searchable than they are when only associated text metadata are used.

Existing solutions require the user to describe visual content manually. Manual tagging is time-consuming and often subjective, leading to incomplete and inconsistent annotations of images and video and thus preventing efficient cataloging of the current deluge of multimedia data. IBM Multimedia Analysis and Retrieval System is unique in its approach to analyzing and fusing audio, visual, and text information in order to automatically annotate multimedia data.

This tool is being developed at IBM Research.

How does it work?

IBM Multimedia Analysis and Retrieval System automatically indexes unlabeled images and videos repositories with a set of classifiers, using machine learning approaches.

IBM Multimedia Analysis and Retrieval System provides a number of important multimedia browsing and search functions in its dynamic search interface that are based on content (such as color, texture, shape, and edges), content clusters, models (such as scenes, objects, and events), and text (such as speech, closed captions, textual metadata, and user-associated tags). Users can query image and video repositories using concept models, visual feature descriptors, and extracted metadata as needed for a particular query. Users can tag individual video shots or meaningful groups of shots created from the fused multi-modal index.


About the technology author(s):

This tool was developed by the IBM T. J. Watson Research Center Intelligent Information Analysis team: John R. Smith, Apostol (Paul) Natsev, Jelena Tešić, Lexing Xie, Rong Yan, and IBM interns Florian Letz, Christian Penz, Joachim Seidl, and Jun Yang.


IBM is a trademark of IBM Corporation in the United States, other countries, or both.
Other company, product, or service names may be trademarks or service marks of others.

Download now Download now
View demo View demo

Related technologies

For platform(s):
Windows

For topics:
digital media, MPEG, Multimedia, semantics, UIMA, video


Related resources

Semantics Research topic

Press Articles

 

    About IBM Privacy Contact