SCITEPRESS (original) (raw)

Paper

Paper Unlock

ColEnViSon: Color Enhanced Visual Sonifier - A Polyphonic Audio Texture and Salient Scene Analysis

Topics: Cognitive & Biologically Inspired Vision; Visual Navigation

Codruta Orniana Ancuti ; Cosmin Ancuti and Philippe Bekaert

Affiliation: Hasselt University, Belgium

Keyword(s): Blind Navigation,Visual Saliency, Color Transformation.

RelatedOntology Subjects/Areas/Topics: Computer Vision, Visualization and Computer Graphics ; Motion, Tracking and Stereo Vision ; Visual Navigation

Abstract: In this work we introduce a color based image-audio system that enhances the perception of the visually impaired users. Traditional sound-vision substitution systems mainly translate gray scale images into corresponding audio frequencies. However, these algorithms deprive the user from the color information, an critical factor in object recognition and also for attracting visual attention. We propose an algorithm that translates the scene into sound based on some classical computer vision algorithms. The most salient visual regions are extracted by a hybrid approach that blends the computed salient map with the segmented image. The selected image region is simplified based on a reference color map dictionary. The centroid of the color space are translated into audio by different musical instruments. We chose to encode the audio file by polyphonic music composition reasoning that humans are capable to distinguish more than one instrument in the same time but also to reduce the playing duration. Testing the prototype demonstrate that non-proficient blindfold participants can easily interpret sequence of colored patterns and also to distinguish by example the quantity of a specific color contained by a given image. (More)

In this work we introduce a color based image-audio system that enhances the perception of the visually impaired users. Traditional sound-vision substitution systems mainly translate gray scale images into corresponding audio frequencies. However, these algorithms deprive the user from the color information, an critical factor in object recognition and also for attracting visual attention. We propose an algorithm that translates the scene into sound based on some classical computer vision algorithms. The most salient visual regions are extracted by a hybrid approach that blends the computed salient map with the segmented image. The selected image region is simplified based on a reference color map dictionary. The centroid of the color space are translated into audio by different musical instruments. We chose to encode the audio file by polyphonic music composition reasoning that humans are capable to distinguish more than one instrument in the same time but also to reduce the playing duration. Testing the prototype demonstrate that non-proficient blindfold participants can easily interpret sequence of colored patterns and also to distinguish by example the quantity of a specific color contained by a given image.

CC BY-NC-ND 4.0

Sign In

Guests can use SciTePress Digital Library without having a SciTePress account. However, guests have limited access to downloading full text versions of papers and no access to special options.

Guests can use SciTePress Digital Library without having a SciTePress account. However, guests have limited access to downloading full text versions of papers and no access to special options.

Guest:Register as new SciTePress user now for free.

Sign In

Download limit per month - 500 recent papers or 4000 papers more than 2 years old.

SciTePress user: please login.

You are not signed in, therefore limits apply to your IP address 8.228.100.111

In the current month:

Recent papers: 100 available of 100 total

2+ years older papers: 200 available of 200 total

Paper citation in several formats:

Ancuti, C. O., Ancuti, C. and Bekaert, P. (2009). ColEnViSon: Color Enhanced Visual Sonifier - A Polyphonic Audio Texture and Salient Scene Analysis. In Proceedings of the Fourth International Conference on Computer Vision Theory and Applications (VISIGRAPP 2009) - Volume 2: VISAPP; ISBN 978-989-8111-69-2; ISSN 2184-4321, SciTePress, pages 566-572. DOI: 10.5220/0001805105660572

@conference{visapp09,
author={Codruta Orniana Ancuti and Cosmin Ancuti and Philippe Bekaert},
title={ColEnViSon: Color Enhanced Visual Sonifier - A Polyphonic Audio Texture and Salient Scene Analysis},
booktitle={Proceedings of the Fourth International Conference on Computer Vision Theory and Applications (VISIGRAPP 2009) - Volume 2: VISAPP},
year={2009},
pages={566-572},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001805105660572},
isbn={978-989-8111-69-2},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the Fourth International Conference on Computer Vision Theory and Applications (VISIGRAPP 2009) - Volume 2: VISAPP
TI - ColEnViSon: Color Enhanced Visual Sonifier - A Polyphonic Audio Texture and Salient Scene Analysis
SN - 978-989-8111-69-2
IS - 2184-4321
AU - Ancuti, C.
AU - Ancuti, C.
AU - Bekaert, P.
PY - 2009
SP - 566
EP - 572
DO - 10.5220/0001805105660572
PB - SciTePress