Cooperative content and metadata generation in music archives (original) (raw)

IEEE 1599: a Multi-layer Approach to Music Description

Journal of Multimedia, 2009

IEEE 1599 is a new XML-based format to describe heterogeneous music contents comprehensively. In a single file, music symbols, printed scores, audio tracks, computer-driven performances, catalogue metadata, text and graphic contents related to a single music piece are linked and mutually synchronized within the same framework. Heterogeneous contents are organized in a multilayered structure that supports different encoding formats and

Music representation of score, sound, MIDI, structure and metadata all integrated in a single multilayer environment based on XML

2007

In this chapter, we will analyze the heterogeneous contents involved in a comprehensive description of music, organizing them according to a multilayer structure. Each layer we can identify corresponds to a different degree of abstraction in music information. In particular, our approach arranges music contents in six layers: General, Music Logic, Structural, Notational, Performance, and Audio. In order to reflect such organization, we will introduce a new XML-based format, called MX, which is currently undergoing the IEEE standardization process (IEEE SA PAR1599). In an MX file, music symbols, printed scores, audio tracks, computer-driven performances, catalogue metadata, and graphic contents related to a single music piece can be linked and mutually synchronized within the same encoding. The aforementioned multilayer structure allows us to gather and organize heterogeneous contents, leaving them encoded in well-known and commonly used formats aimed at music description.

Metadata for music and sounds: The cuidado project

2001

The IST project CUIDADO is the result of two years of concertation at the European level in the context of the CUIDAD Working Group (Esprit 28793). CUIDADO-led by Ircam, started in January 2000 and will end in December 2003-aims at developing content-based audio modules and applications using the MPEG-7 media representation standard. The project covers the analysis process (extraction of descriptors), the navigation process (retrieval methods and interfaces implemented in a leading database system with Web interoperability), up to the creative process (consuming and authoring tools) involving content creators and consumers at each stage. The project addresses both the audio (samples) and the music (titles) domains with the assumption that high-level descriptors for music should rely on robust lower level audio descriptors (pitch, energy or spectral features) in order to cover a wide range of applications. This approach matches the needs of record labels and copyright societies for Information management methods for both marketing and protecting their contents. CUIDADO is also a first attempt to go beyond content retrieval by providing an Authoring system using content features for professional musicians and studios.

Representing Modifiable and Reusable Musical Content on the Web With Constrained Multi-Hierarchical Structures

IEEE Transactions on Multimedia, 2020

The most commonly used formats for exchanging musical information today are limited in that they represent music as flat and rigid streams of events or as raw audio signals without any structural information about the content. Such files can only be listened to in a linear way and reused and manipulated in manners determined by a target application such as a Digital Audio Workstation. The publisher has no means to incorporate their intentions or understanding of the content. This paper introduces an extension of the music formalism CHARM for the representation of modifiable and reusable musical content on the Web. It discusses how various kinds of multi-hierarchical graph structures together with logical constraints can be useful to model different musical situations. In particular, we focus on presenting solutions on how to interpret, navigate and schedule such structures in order for them to be played back. We evaluate the versatility of the representation in a number of practical examples created with a Web-based implementation based on Semantic Web technologies.

Music representation issues, techniques, and systems

Computer Music Journal, 1993

Musicians deal with many levels of abstraction in music. If a conductor says, ''play the downbeat with more conviction,'' he or she is referencing music structure (a downbeat) and emotional content in the same sentence. It is convenient to think of musical representations at different levels, ranging from the highly symbolic and abstract level denoted by printed music to the non-symbolic and concrete level of an audio signal. Performance information is an intermediate level. We must consider these levels because there is rarely a unique conversion between any two of each of them. Each level contains at least some information not available in other levels. In general, there is great interest and value in performing automatic (partial) conversions between levels [Katayose 89], such as in optical music recognition or music transcription.

The MX Formalism for Semantic Web Compatible Representation of Music Metadata

Lecture Notes in Computer Science, 2005

Music description is nowadays considered an important matter in Information and Communication Technology. The encoding formats commonly accepted and employed are often characterized by a partial view of the whole problem: they describe music data or metadata for score, audio tracks, computer performances of music pieces, but they seldom encode all these aspects together. In this paper, we present the MX formalism that aims to address this limitation of the existing formats, by providing a Semantic Web compatible representation of music information in terms of structural and semantic features, by means of XML and OWL.

An XML-based format for advanced music fruition

2006

This paper describes an XML-based format that allows an advanced fruition of music contents. Thanks to such format, namely MX (IEEE PAR1599), and to the implementation of ad hoc interfaces, users can enjoy music from different points of view: the same piece can be described through different scores, video and audio performances, mutually synchronized. The purpose of this paper is pointing out the basic concepts of our XML encoding and presenting the process required to create rich multimedia descriptions of a music piece in MX format. Finally, a working application to play and view MX files will be presented.

Development and validation of a concept for layered audio descriptions

2021

The increasing availability of online video content, partially fueled by the Covid-19 pandemic and the growing presence of social media, adds to the importance of providing audio descriptions as a media alternative to video content for blind and visually impaired people. In order to address concerns as to what can be sufficiently described and how such descriptions can be delivered to users, a concept has been developed providing audio descriptions in multiple levels of detail. Relevant information is incorporated into an XML-based data structure. The concept also includes a process to provide optional explanations to terms and abbreviations, helping users without specific knowledge or people with cognitive concerns in comprehending complex videos. These features are implemented into a prototype based on the Able Player software. By conducting a user test, the benefits of multi-layered audio descriptions and optional explanatory content are evaluated. Findings suggest that the choic...

An XML-based Synchronization of Audio and Graphical Representations of Music Scores

Eighth International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS '07), 2007

This paper presents an overview on a future IEEE standard aimed at providing an overall description of music. This format, known as IEEE PAR1599, is based on the XML meta-language. Its purpose is taking into account the heterogeneous multimedia representations of music, such as audio tracks, video clips and graphical instances of score. The key characteristics of the format are the possibility to enjoy such heterogeneous contents in a synchronized way and the possibility to switch from a particular representation to another in real-time.