T. Coianiz - Academia.edu (original) (raw)

Papers by T. Coianiz

Research paper thumbnail of Improving robot's indoor navigation capabilities by integrating visual, sonar, and odometric measurements

SPIE Proceedings, 1993

In the framework of autonomous navigation, an application of Kalman filtering to the problem of m... more In the framework of autonomous navigation, an application of Kalman filtering to the problem of multi-sensor information processing is presented. In particular, estimation of model parameters is considered when a mobile robot equipped with a set of sonars and a standard TV camera moves along corridors, and the environment is affected by sharp discontinuities due to the presence of recesses or protrusions. Experiments performed in real-world situations are also presented and discussed.

Research paper thumbnail of Estimating the crowding level with a neuro-fuzzy classifier

Journal of Electronic Imaging, 1997

This paper introduces a neuro-fuzzy system for the estimation of the crowding level in a scene. M... more This paper introduces a neuro-fuzzy system for the estimation of the crowding level in a scene. Monitoring the number of people present in a given indoor environment is a requirement in a variety of surveillance applications. In the present work, crowding has to be estimated from the image processing of visual scenes collected via a TV camera. A suitable preprocessing

Research paper thumbnail of Problematiche tecnologiche nell'aggiornamento della banca dati territoriale di un comune

Using non-topological data, like a CAD project, in order to update a geographic database in which... more Using non-topological data, like a CAD project, in order to update a geographic database in which strong topological constraints are defined, is not straightforward. It requires indeed the definition of proper conversion procedures, in order to deal with potential arising conflicts. In this work a workaround is proposed, which is based on an overlay working structure, and which was positively tested in the framework of a local administration.

Research paper thumbnail of Geometric Layout Analysis Techniques for Document Image Understanding: a Review

Document Image Understanding (DIU) is an interesting research area with a large variety of challe... more Document Image Understanding (DIU) is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. The main purpose of the present report is to describe the current status of DIU with particular attention to two subprocesses: document skew angle estimation and page decomposition. Several algorithms proposed in the literature are synthetically described. They are included in a novel classification scheme. Some methods proposed for the evaluation of page decomposition algorithms are described. Critical discussions are reported about the current status of the field and about the open problems. Some considerations about the logical layout analysis are also reported.

Research paper thumbnail of WebGIS quale strumento di e-government per l'amministrazione comunale

Research paper thumbnail of Le problematiche di visualizzazione e rappresentazione dei DB topografico multiscala

Research paper thumbnail of Analysis and encoding of lip movements

Lecture Notes in Computer Science, 1997

Model-based image coding has recently attracted much attention as a basis for the next generation... more Model-based image coding has recently attracted much attention as a basis for the next generation of communication services. This article proposes a model-based image coding for the mouth, which is aimed at capturing visual information related to speech, in order to make the decoded video sequence suitable for lip-reading. Such a coding system is basically composed of an analysis process

Research paper thumbnail of A fuzzy classifier for visual crowding estimates

Proceedings of International Conference on Neural Networks (ICNN'96), 1996

A trainable vision-based system is presented, which is able to perform reliable, real time estima... more A trainable vision-based system is presented, which is able to perform reliable, real time estimates of the crowding level present on the platforms of underground stations. Taking as input standard the B/W images of the scene, a classification of the crowding level is performed in terms of five qualitative crowding classes, ranging from no people to overcrowding. Visual feature extraction

Research paper thumbnail of 2D Deformable Models for Visual Speech Analysis

. A scheme for describing the mouth of a speaker in color imagesequences is proposed, which is ba... more . A scheme for describing the mouth of a speaker in color imagesequences is proposed, which is based on a parametric 2D model of the lips.At each frame, key information is extracted from chrominance analysis, andoptimal parameters for the model are found by maximizing a suitable scorefunction. A detailed description of the techniques employed is given, andsome preliminary results shown.1 IntroductionThe mouth is a part of the human body which presents high interpersonalvariability....

Research paper thumbnail of Use of simulated data for robust telephone speech recognition

Proc. 6th European Conf. on …, 1999

The collection of telephone databases, for training speech recognisers, is a time consuming and c... more The collection of telephone databases, for training speech recognisers, is a time consuming and costly work. In the paper we propose a method for producing simulated telephone data starting from clean wide band databases. The result of the simulation is the generation of a noisy ...

Research paper thumbnail of Geometric Layout Analysis Techniques for Document Image Understanding: a Review. TR 9703-09

Document Image Understanding (DIU) is an interesting research area with a large variety of challe... more Document Image Understanding (DIU) is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. The main purpose of the present report is to describe the current status of DIU with particular attention to two subprocesses: document skew angle estimation and page decomposition. Several algorithms proposed in the literature are synthetically described. They are included in a novel classification scheme. Some methods proposed for the evaluation of page decomposition algorithms are described. Critical discussions are reported about the current status of the field and about the open problems. Some considerations about the logical layout analysis are also reported.

Research paper thumbnail of Analysis and encoding of lip movements

Lecture Notes in Computer Science, 1997

Model-based image coding has recently attracted much attention as a basis for the next generation... more Model-based image coding has recently attracted much attention as a basis for the next generation of communication services. This article proposes a model-based image coding for the mouth, which is aimed at capturing visual information related to speech, in order to make the decoded video sequence suitable for lip-reading. Such a coding system is basically composed of an analysis process

Research paper thumbnail of A fuzzy classifier for visual crowding estimates

Proceedings of International Conference on Neural Networks (ICNN'96), 1996

A trainable vision-based system is presented, which is able to perform reliable, real time estima... more A trainable vision-based system is presented, which is able to perform reliable, real time estimates of the crowding level present on the platforms of underground stations. Taking as input standard the B/W images of the scene, a classification of the crowding level is performed in terms of five qualitative crowding classes, ranging from no people to overcrowding. Visual feature extraction

Research paper thumbnail of 2D Deformable Models for Visual Speech Analysis

. A scheme for describing the mouth of a speaker in color imagesequences is proposed, which is ba... more . A scheme for describing the mouth of a speaker in color imagesequences is proposed, which is based on a parametric 2D model of the lips.At each frame, key information is extracted from chrominance analysis, andoptimal parameters for the model are found by maximizing a suitable scorefunction. A detailed description of the techniques employed is given, andsome preliminary results shown.1 IntroductionThe mouth is a part of the human body which presents high interpersonalvariability....

Research paper thumbnail of Use of simulated data for robust telephone speech recognition

Proc. 6th European Conf. on …, 1999

The collection of telephone databases, for training speech recognisers, is a time consuming and c... more The collection of telephone databases, for training speech recognisers, is a time consuming and costly work. In the paper we propose a method for producing simulated telephone data starting from clean wide band databases. The result of the simulation is the generation of a noisy ...

Research paper thumbnail of Geometric Layout Analysis Techniques for Document Image Understanding: a Review. TR 9703-09

Document Image Understanding (DIU) is an interesting research area with a large variety of challe... more Document Image Understanding (DIU) is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. The main purpose of the present report is to describe the current status of DIU with particular attention to two subprocesses: document skew angle estimation and page decomposition. Several algorithms proposed in the literature are synthetically described. They are included in a novel classification scheme. Some methods proposed for the evaluation of page decomposition algorithms are described. Critical discussions are reported about the current status of the field and about the open problems. Some considerations about the logical layout analysis are also reported.

Research paper thumbnail of Geometric Layout Analysis Techniques for Document Image Understanding: a Review. TR 9703-09

Document Image Understanding (DIU) is an interesting research area with a large variety of challe... more Document Image Understanding (DIU) is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. The main purpose of the present report is to describe the current status of DIU with particular attention to two subprocesses: document skew angle estimation and page decomposition. Several algorithms proposed in the literature are synthetically described. They are included in a novel classification scheme. Some methods proposed for the evaluation of page decomposition algorithms are described. Critical discussions are reported about the current status of the field and about the open problems. Some considerations about the logical layout analysis are also reported.

Research paper thumbnail of Improving robot's indoor navigation capabilities by integrating visual, sonar, and odometric measurements

SPIE Proceedings, 1993

In the framework of autonomous navigation, an application of Kalman filtering to the problem of m... more In the framework of autonomous navigation, an application of Kalman filtering to the problem of multi-sensor information processing is presented. In particular, estimation of model parameters is considered when a mobile robot equipped with a set of sonars and a standard TV camera moves along corridors, and the environment is affected by sharp discontinuities due to the presence of recesses or protrusions. Experiments performed in real-world situations are also presented and discussed.

Research paper thumbnail of Estimating the crowding level with a neuro-fuzzy classifier

Journal of Electronic Imaging, 1997

This paper introduces a neuro-fuzzy system for the estimation of the crowding level in a scene. M... more This paper introduces a neuro-fuzzy system for the estimation of the crowding level in a scene. Monitoring the number of people present in a given indoor environment is a requirement in a variety of surveillance applications. In the present work, crowding has to be estimated from the image processing of visual scenes collected via a TV camera. A suitable preprocessing

Research paper thumbnail of Problematiche tecnologiche nell'aggiornamento della banca dati territoriale di un comune

Using non-topological data, like a CAD project, in order to update a geographic database in which... more Using non-topological data, like a CAD project, in order to update a geographic database in which strong topological constraints are defined, is not straightforward. It requires indeed the definition of proper conversion procedures, in order to deal with potential arising conflicts. In this work a workaround is proposed, which is based on an overlay working structure, and which was positively tested in the framework of a local administration.

Research paper thumbnail of Geometric Layout Analysis Techniques for Document Image Understanding: a Review

Document Image Understanding (DIU) is an interesting research area with a large variety of challe... more Document Image Understanding (DIU) is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. The main purpose of the present report is to describe the current status of DIU with particular attention to two subprocesses: document skew angle estimation and page decomposition. Several algorithms proposed in the literature are synthetically described. They are included in a novel classification scheme. Some methods proposed for the evaluation of page decomposition algorithms are described. Critical discussions are reported about the current status of the field and about the open problems. Some considerations about the logical layout analysis are also reported.

Research paper thumbnail of WebGIS quale strumento di e-government per l'amministrazione comunale

Research paper thumbnail of Le problematiche di visualizzazione e rappresentazione dei DB topografico multiscala

Research paper thumbnail of Analysis and encoding of lip movements

Lecture Notes in Computer Science, 1997

Model-based image coding has recently attracted much attention as a basis for the next generation... more Model-based image coding has recently attracted much attention as a basis for the next generation of communication services. This article proposes a model-based image coding for the mouth, which is aimed at capturing visual information related to speech, in order to make the decoded video sequence suitable for lip-reading. Such a coding system is basically composed of an analysis process

Research paper thumbnail of A fuzzy classifier for visual crowding estimates

Proceedings of International Conference on Neural Networks (ICNN'96), 1996

A trainable vision-based system is presented, which is able to perform reliable, real time estima... more A trainable vision-based system is presented, which is able to perform reliable, real time estimates of the crowding level present on the platforms of underground stations. Taking as input standard the B/W images of the scene, a classification of the crowding level is performed in terms of five qualitative crowding classes, ranging from no people to overcrowding. Visual feature extraction

Research paper thumbnail of 2D Deformable Models for Visual Speech Analysis

. A scheme for describing the mouth of a speaker in color imagesequences is proposed, which is ba... more . A scheme for describing the mouth of a speaker in color imagesequences is proposed, which is based on a parametric 2D model of the lips.At each frame, key information is extracted from chrominance analysis, andoptimal parameters for the model are found by maximizing a suitable scorefunction. A detailed description of the techniques employed is given, andsome preliminary results shown.1 IntroductionThe mouth is a part of the human body which presents high interpersonalvariability....

Research paper thumbnail of Use of simulated data for robust telephone speech recognition

Proc. 6th European Conf. on …, 1999

The collection of telephone databases, for training speech recognisers, is a time consuming and c... more The collection of telephone databases, for training speech recognisers, is a time consuming and costly work. In the paper we propose a method for producing simulated telephone data starting from clean wide band databases. The result of the simulation is the generation of a noisy ...

Research paper thumbnail of Geometric Layout Analysis Techniques for Document Image Understanding: a Review. TR 9703-09

Document Image Understanding (DIU) is an interesting research area with a large variety of challe... more Document Image Understanding (DIU) is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. The main purpose of the present report is to describe the current status of DIU with particular attention to two subprocesses: document skew angle estimation and page decomposition. Several algorithms proposed in the literature are synthetically described. They are included in a novel classification scheme. Some methods proposed for the evaluation of page decomposition algorithms are described. Critical discussions are reported about the current status of the field and about the open problems. Some considerations about the logical layout analysis are also reported.

Research paper thumbnail of Analysis and encoding of lip movements

Lecture Notes in Computer Science, 1997

Model-based image coding has recently attracted much attention as a basis for the next generation... more Model-based image coding has recently attracted much attention as a basis for the next generation of communication services. This article proposes a model-based image coding for the mouth, which is aimed at capturing visual information related to speech, in order to make the decoded video sequence suitable for lip-reading. Such a coding system is basically composed of an analysis process

Research paper thumbnail of A fuzzy classifier for visual crowding estimates

Proceedings of International Conference on Neural Networks (ICNN'96), 1996

A trainable vision-based system is presented, which is able to perform reliable, real time estima... more A trainable vision-based system is presented, which is able to perform reliable, real time estimates of the crowding level present on the platforms of underground stations. Taking as input standard the B/W images of the scene, a classification of the crowding level is performed in terms of five qualitative crowding classes, ranging from no people to overcrowding. Visual feature extraction

Research paper thumbnail of 2D Deformable Models for Visual Speech Analysis

. A scheme for describing the mouth of a speaker in color imagesequences is proposed, which is ba... more . A scheme for describing the mouth of a speaker in color imagesequences is proposed, which is based on a parametric 2D model of the lips.At each frame, key information is extracted from chrominance analysis, andoptimal parameters for the model are found by maximizing a suitable scorefunction. A detailed description of the techniques employed is given, andsome preliminary results shown.1 IntroductionThe mouth is a part of the human body which presents high interpersonalvariability....

Research paper thumbnail of Use of simulated data for robust telephone speech recognition

Proc. 6th European Conf. on …, 1999

The collection of telephone databases, for training speech recognisers, is a time consuming and c... more The collection of telephone databases, for training speech recognisers, is a time consuming and costly work. In the paper we propose a method for producing simulated telephone data starting from clean wide band databases. The result of the simulation is the generation of a noisy ...

Research paper thumbnail of Geometric Layout Analysis Techniques for Document Image Understanding: a Review. TR 9703-09

Document Image Understanding (DIU) is an interesting research area with a large variety of challe... more Document Image Understanding (DIU) is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. The main purpose of the present report is to describe the current status of DIU with particular attention to two subprocesses: document skew angle estimation and page decomposition. Several algorithms proposed in the literature are synthetically described. They are included in a novel classification scheme. Some methods proposed for the evaluation of page decomposition algorithms are described. Critical discussions are reported about the current status of the field and about the open problems. Some considerations about the logical layout analysis are also reported.

Research paper thumbnail of Geometric Layout Analysis Techniques for Document Image Understanding: a Review. TR 9703-09

Document Image Understanding (DIU) is an interesting research area with a large variety of challe... more Document Image Understanding (DIU) is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. The main purpose of the present report is to describe the current status of DIU with particular attention to two subprocesses: document skew angle estimation and page decomposition. Several algorithms proposed in the literature are synthetically described. They are included in a novel classification scheme. Some methods proposed for the evaluation of page decomposition algorithms are described. Critical discussions are reported about the current status of the field and about the open problems. Some considerations about the logical layout analysis are also reported.