MultiMedia Metadata Management: a Proposal for an Infrastructure (original) (raw)
Related papers
Semantic Integration and Retrieval of Multimedia Metadata
2005
The amount of digital media that has to be actually managed has already become unaffordable without fine-grained computeris ed support. This requires an extensive use of multimedia metadata. MPEG-7 is the greatest metadata framework created to date but it is based on XML Schemas. Therefore, its does not have formal semantics, which makes difficult to manage, extend and integrate it. Consequently, there have been a lot attempts to move MPEG-7 to the Semantic Web. Our approach contributes a complete and automatic mapping of the whole MPEG-7 standard to OWL. It is based on a generic XML Schema to OWL mapping. The previous mapping is complemented with an XML metadata instances to RDF mapping that completes a transparent transfer of metadata from the XML to the Semantic Web domain. Once in a semantic space, data integration, which is a crucial factor when several sources of information are available, is facilitated enormously. We have used the generated MPEG-7 OWL ontology as an "upper-ontology" for multimedia metadata, where three different music schemas have been linked. Thus, it has been possible to retrieve related information from instances of all the metadata sources. Furthermore, detecting and merging instances from different sources has allowed us to enhance the description of audio files, both content-based and editorial data.
Integration of Structural and Semantic Models for Multimedia Metadata Management
2007 International Workshop on Content-Based Multimedia Indexing, 2007
The management and exchange of multimedia data is challenging due to the variety of formats, standards and intended applications. In addition, production of multimedia data is rapidly increasing due to the availability of off-the-shelf, modern digital devices that can be used by even inexperienced users. It is likely that this volume of information will only increase in the future. A key goal of the MUSCLE (Multimedia Understanding through Semantics, Computation and Learning) network is to develop tools, technologies and standards to facilitate the interoperability of multimedia content and support the exchange of such data. One approach for achieving this was the creation of a specific "E-Team", composed of the authors, to discuss core questions and practical issues based on the participant's individual work. In this paper, we present the relevant points of view with regards to sharing experiences and to extracting and integrating multimedia data and metadata from different modes (text, images, video).
Multimedia Annotation using Semantic Web Technologies
Due to the progressively increasing amount of multimedia on the Web, the need for efficient metadata formats describing that content has become increasingly evident. This paper gives an overview of the different approaches and methods for creation and retrieval of semantic rich multimedia metadata. Semantic web and its most important technologies XML, RDF and ontologies used for multimedia annotation are defined. An overview of various multimedia metadata vocabularies and formats that vary in their size and purpose is provided. Multimedia metadata is a type of metadata used for describing different aspects of multimedia content. All formats of multimedia metadata are not compatible with each other and most of it do not provide enough semantics. New Semantic Web technologies provide well-defined information meaning so different multimedia metadata can be more easily processed by computers.
Adding multimedia to the semantic web-building an mpeg-7 ontology
International Semantic Web Working Symposium (SWWS), 2001
For the past two years the Moving Pictures Expert Group (MPEG), a working group of ISO/IEC, have been developing MPEG-7 [1], the "Multimedia Content Description Interface", a standard for describing multimedia content. The goal of this standard is to develop a rich set of standardized tools to enable both humans and machines to generate and understand audiovisual descriptions which can be used to enable fast efficient retrieval from digital archives (pull applications) as well as filtering of streamed audiovisual broadcasts on the Internet (push applications). MPEG-7 is intended to describe audiovisual information regardless of storage, coding, display, transmission, medium, or technology. It will address a wide variety of media types including: still pictures, graphics, 3D models, audio, speech, video, and combinations of these (e.g., multimedia presentations). MPEG-7 is due for completion in October 2001. At this stage MPEG-7 definitions (description schemes and descriptors) are expressed solely in XML Schema [2-4]. XML Schema has been ideal for expressing the syntax, structural, cardinality and datatyping constraints required by MPEG-7. However it has become increasingly clear that in order to make MPEG-7 accessible, re-usable and interoperable with other domains then the semantics of the MPEG-7 metadata terms also need to be expressed in an ontology using a machine-understandable language. This paper describes the trials and tribulations of building such an ontology represented in RDF Schema [5] and demonstrates how this ontology can be exploited and reused by other communities on the semantic web (such as TV-Anytime [6], MPEG-21 [7], NewsML [8], museum, educational and geospatial domains) to enable the inclusion and exchange of multimedia content through a common understanding of the associated MPEG-7 multimedia content descriptions.
Adding Multimedia to the SemanticWeb: Building and Applying an MPEG-7 Ontology
Methods, Standards and Tools, 2005
For the past two years the Moving Pictures Expert Group (MPEG), a working group of ISO/IEC, have been developing MPEG-7 [1], the "Multimedia Content Description Interface", a standard for describing multimedia content. The goal of this standard is to develop a rich set of standardized tools to enable both humans and machines to generate and understand audiovisual descriptions which can be used to enable fast efficient retrieval from digital archives (pull applications) as well as filtering of streamed audiovisual broadcasts on the Internet (push applications). MPEG-7 is intended to describe audiovisual information regardless of storage, coding, display, transmission, medium, or technology. It will address a wide variety of media types including: still pictures, graphics, 3D models, audio, speech, video, and combinations of these (e.g., multimedia presentations). MPEG-7 is due for completion in October 2001. At this stage MPEG-7 definitions (description schemes and descriptors) are expressed solely in XML Schema [2-4]. XML Schema has been ideal for expressing the syntax, structural, cardinality and datatyping constraints required by MPEG-7. However it has become increasingly clear that in order to make MPEG-7 accessible, re-usable and interoperable with other domains then the semantics of the MPEG-7 metadata terms also need to be expressed in an ontology using a machine-understandable language. This paper describes the trials and tribulations of building such an ontology represented in RDF Schema [5] and demonstrates how this ontology can be exploited and reused by other communities on the semantic web (such as TV-Anytime [6], MPEG-21 [7], NewsML [8], museum, educational and geospatial domains) to enable the inclusion and exchange of multimedia content through a common understanding of the associated MPEG-7 multimedia content descriptions.
Towards a Unified Multimedia Metadata Management Solution
With increasing use of multimedia in various domains, several metadata standards appeared these last decades in order to facilitate the manipulation of multimedia contents. These standards help consumers to search content they desire and to adapt the retrieved content according to consumers' profiles and preferences. However, in order to extract information from a given standard, user must have a pre-knowledge about this latest. This condition is not easy to satisfy due to the increasing number of available standards. In this book chapter we introduce some of the main de facto multimedia standards which cover the description, by means of metadata, of the content and of the use context (profiles, devices, networks…). We discuss then the benefits of proposing an integrated vision of multimedia metadata standards through the usage of a generic multimedia metadata integration system and we expose the challenges of its implementation.
Using a Multimedia Ontology Infrastructure for Semantic Annotation of Multimedia Content
2005
In this paper we discuss the use of knowledge for the automatic extraction of semantic metadata from multimedia content. For the representation of knowledge we extended and enriched current general-purpose ontologies to include low-level visual features. More specifically, we implemented a tool that links MPEG-7 visual descriptors to high-level, domain-specific concepts. For the exploitation of this knowledge infrastructure we developed an experimentation platform, that allows us to analyze multimedia content and automatically create the associated semantic metadata, as well as to test, validate and refine the ontologies built. We pursued a tight and functional integration of the knowledge base and the analysis modules putting them in a loop of constant interaction instead of being the one just a pre-or post-processing step of the other.
Semantic Modeling of Digital Multimedia
Citeseer
- The requirement for a commonly accepted efficient mapping between multimedia metadata standards and semantic web-ontology standards is a major issue recognized by semantic multimedia research community. Though there have been several attempts to translate MPEG-7 ...