IBM®
Skip to main content
    United States change      Terms of use
 
 
Select a scope:    
     Home      Products      Services & industry solutions      Support & downloads      My account     
alphaWorks  >  Eclipse technology  >  

IBM Multimodal Annotation Tool

A tool that assists in annotating MPEG files with MPEG-7 meta data; both video and audio annotations can be created.


Date Posted: August 9, 2002
Overview

This technology has been retired.


About the technology author(s):
Bill Adams joined the T. J. Watson Research Center as a senior programmer in 1990. He received his Master's degree in Computer Science from the University of Illinois at Champaign in 1973. While at IBM, he contributed to a variety of projects, including the Power Visualization System, Air Traffic Control Using Speech Game, and Literacy Tutor Using Speech. Prior to joining IBM, Mr. Adams was manager of the Digital Equipment Corporation VMS workstation group and project leader of X-11 device-independent server. He has an extensive background in video games as a developer and manager. He was director of game development at Bally Midway. Among the video games that he developed is TRON, which was awarded "Coin-Operated Game of the Year" by Electronic Games Magazine. Mr. Adams has worked as a contractor, creating a number of embedded system products, and he developed applications for police and fire departments at the Motorola Communications Division.

Ching-Yung Lin joined IBM T. J. Watson Research Center as a research staff member in October 2000. He received his B.S. and M.S. from National Taiwan University in 1991 and 1993, respectively, both in Electrical Engineering. Dr. Lin received his Ph.D. in the Department of Electrical Engineering, Columbia University, in 2000. Dr. Lin was a wireless officer at R.O.C. Air Force, Taiwan, from 1993 to 1995, and he was an instructor of Network Communication Lab at National Taiwan University from 1995 to 1996. His current research interests include multimedia authentication and watermarking techniques for security; multimedia semantic analysis; indexing and query; multimedia transmission and networking; human-computer interaction; and multi-rate, multidimensional signal processing. Dr. Lin was the primary contributor in the design of the first successful semi-fragile content authentication system (SARI), which distinguishes JPEG/MPEG compression from malicious manipulation. He was also the primary contributor in the design of the first public/oblivious watermarking methods surviving the print-and-scan process.

Giridharan Iyengar joined IBM's T. J. Watson Research Center in 1999. He completed his Ph.D. at MIT Media Lab in 1999. Dr. Iyengar's thesis, "Characterization of Unstructured Video," looked at distributional clustering algorithms for video retrieval applied to cataloging of home videos. The focus of this work was to build a set of tools that analyze, characterize, and prepare casually-shot footage for interesting re-use, such as refrigerator magnets, postcards, etc. Dr. Iyengar is currently a member of the Human Language Technology Department; he is working on audio-visual speech technologies, mainly AV speech recognition and AV speech event detection.


Related technologies

For platform(s):
Windows 2000

For topics:
MPEG-7, MPEG, annotation, digital media, content creation, video, audio


Related resources

Press Articles

 

    About IBM Privacy Contact