T. Coianiz - Academia.edu (original) (raw)
Papers by T. Coianiz
SPIE Proceedings, 1993
In the framework of autonomous navigation, an application of Kalman filtering to the problem of m... more In the framework of autonomous navigation, an application of Kalman filtering to the problem of multi-sensor information processing is presented. In particular, estimation of model parameters is considered when a mobile robot equipped with a set of sonars and a standard TV camera moves along corridors, and the environment is affected by sharp discontinuities due to the presence of recesses or protrusions. Experiments performed in real-world situations are also presented and discussed.
Journal of Electronic Imaging, 1997
This paper introduces a neuro-fuzzy system for the estimation of the crowding level in a scene. M... more This paper introduces a neuro-fuzzy system for the estimation of the crowding level in a scene. Monitoring the number of people present in a given indoor environment is a requirement in a variety of surveillance applications. In the present work, crowding has to be estimated from the image processing of visual scenes collected via a TV camera. A suitable preprocessing
Using non-topological data, like a CAD project, in order to update a geographic database in which... more Using non-topological data, like a CAD project, in order to update a geographic database in which strong topological constraints are defined, is not straightforward. It requires indeed the definition of proper conversion procedures, in order to deal with potential arising conflicts. In this work a workaround is proposed, which is based on an overlay working structure, and which was positively tested in the framework of a local administration.
Document Image Understanding (DIU) is an interesting research area with a large variety of challe... more Document Image Understanding (DIU) is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. The main purpose of the present report is to describe the current status of DIU with particular attention to two subprocesses: document skew angle estimation and page decomposition. Several algorithms proposed in the literature are synthetically described. They are included in a novel classification scheme. Some methods proposed for the evaluation of page decomposition algorithms are described. Critical discussions are reported about the current status of the field and about the open problems. Some considerations about the logical layout analysis are also reported.
Lecture Notes in Computer Science, 1997
Model-based image coding has recently attracted much attention as a basis for the next generation... more Model-based image coding has recently attracted much attention as a basis for the next generation of communication services. This article proposes a model-based image coding for the mouth, which is aimed at capturing visual information related to speech, in order to make the decoded video sequence suitable for lip-reading. Such a coding system is basically composed of an analysis process
Proceedings of International Conference on Neural Networks (ICNN'96), 1996
A trainable vision-based system is presented, which is able to perform reliable, real time estima... more A trainable vision-based system is presented, which is able to perform reliable, real time estimates of the crowding level present on the platforms of underground stations. Taking as input standard the B/W images of the scene, a classification of the crowding level is performed in terms of five qualitative crowding classes, ranging from no people to overcrowding. Visual feature extraction
. A scheme for describing the mouth of a speaker in color imagesequences is proposed, which is ba... more . A scheme for describing the mouth of a speaker in color imagesequences is proposed, which is based on a parametric 2D model of the lips.At each frame, key information is extracted from chrominance analysis, andoptimal parameters for the model are found by maximizing a suitable scorefunction. A detailed description of the techniques employed is given, andsome preliminary results shown.1 IntroductionThe mouth is a part of the human body which presents high interpersonalvariability....
Proc. 6th European Conf. on …, 1999
The collection of telephone databases, for training speech recognisers, is a time consuming and c... more The collection of telephone databases, for training speech recognisers, is a time consuming and costly work. In the paper we propose a method for producing simulated telephone data starting from clean wide band databases. The result of the simulation is the generation of a noisy ...
Document Image Understanding (DIU) is an interesting research area with a large variety of challe... more Document Image Understanding (DIU) is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. The main purpose of the present report is to describe the current status of DIU with particular attention to two subprocesses: document skew angle estimation and page decomposition. Several algorithms proposed in the literature are synthetically described. They are included in a novel classification scheme. Some methods proposed for the evaluation of page decomposition algorithms are described. Critical discussions are reported about the current status of the field and about the open problems. Some considerations about the logical layout analysis are also reported.
Lecture Notes in Computer Science, 1997
Model-based image coding has recently attracted much attention as a basis for the next generation... more Model-based image coding has recently attracted much attention as a basis for the next generation of communication services. This article proposes a model-based image coding for the mouth, which is aimed at capturing visual information related to speech, in order to make the decoded video sequence suitable for lip-reading. Such a coding system is basically composed of an analysis process
Proceedings of International Conference on Neural Networks (ICNN'96), 1996
A trainable vision-based system is presented, which is able to perform reliable, real time estima... more A trainable vision-based system is presented, which is able to perform reliable, real time estimates of the crowding level present on the platforms of underground stations. Taking as input standard the B/W images of the scene, a classification of the crowding level is performed in terms of five qualitative crowding classes, ranging from no people to overcrowding. Visual feature extraction
. A scheme for describing the mouth of a speaker in color imagesequences is proposed, which is ba... more . A scheme for describing the mouth of a speaker in color imagesequences is proposed, which is based on a parametric 2D model of the lips.At each frame, key information is extracted from chrominance analysis, andoptimal parameters for the model are found by maximizing a suitable scorefunction. A detailed description of the techniques employed is given, andsome preliminary results shown.1 IntroductionThe mouth is a part of the human body which presents high interpersonalvariability....
Proc. 6th European Conf. on …, 1999
The collection of telephone databases, for training speech recognisers, is a time consuming and c... more The collection of telephone databases, for training speech recognisers, is a time consuming and costly work. In the paper we propose a method for producing simulated telephone data starting from clean wide band databases. The result of the simulation is the generation of a noisy ...
Document Image Understanding (DIU) is an interesting research area with a large variety of challe... more Document Image Understanding (DIU) is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. The main purpose of the present report is to describe the current status of DIU with particular attention to two subprocesses: document skew angle estimation and page decomposition. Several algorithms proposed in the literature are synthetically described. They are included in a novel classification scheme. Some methods proposed for the evaluation of page decomposition algorithms are described. Critical discussions are reported about the current status of the field and about the open problems. Some considerations about the logical layout analysis are also reported.
Document Image Understanding (DIU) is an interesting research area with a large variety of challe... more Document Image Understanding (DIU) is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. The main purpose of the present report is to describe the current status of DIU with particular attention to two subprocesses: document skew angle estimation and page decomposition. Several algorithms proposed in the literature are synthetically described. They are included in a novel classification scheme. Some methods proposed for the evaluation of page decomposition algorithms are described. Critical discussions are reported about the current status of the field and about the open problems. Some considerations about the logical layout analysis are also reported.
SPIE Proceedings, 1993
In the framework of autonomous navigation, an application of Kalman filtering to the problem of m... more In the framework of autonomous navigation, an application of Kalman filtering to the problem of multi-sensor information processing is presented. In particular, estimation of model parameters is considered when a mobile robot equipped with a set of sonars and a standard TV camera moves along corridors, and the environment is affected by sharp discontinuities due to the presence of recesses or protrusions. Experiments performed in real-world situations are also presented and discussed.
Journal of Electronic Imaging, 1997
This paper introduces a neuro-fuzzy system for the estimation of the crowding level in a scene. M... more This paper introduces a neuro-fuzzy system for the estimation of the crowding level in a scene. Monitoring the number of people present in a given indoor environment is a requirement in a variety of surveillance applications. In the present work, crowding has to be estimated from the image processing of visual scenes collected via a TV camera. A suitable preprocessing
Using non-topological data, like a CAD project, in order to update a geographic database in which... more Using non-topological data, like a CAD project, in order to update a geographic database in which strong topological constraints are defined, is not straightforward. It requires indeed the definition of proper conversion procedures, in order to deal with potential arising conflicts. In this work a workaround is proposed, which is based on an overlay working structure, and which was positively tested in the framework of a local administration.
Document Image Understanding (DIU) is an interesting research area with a large variety of challe... more Document Image Understanding (DIU) is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. The main purpose of the present report is to describe the current status of DIU with particular attention to two subprocesses: document skew angle estimation and page decomposition. Several algorithms proposed in the literature are synthetically described. They are included in a novel classification scheme. Some methods proposed for the evaluation of page decomposition algorithms are described. Critical discussions are reported about the current status of the field and about the open problems. Some considerations about the logical layout analysis are also reported.
Lecture Notes in Computer Science, 1997
Model-based image coding has recently attracted much attention as a basis for the next generation... more Model-based image coding has recently attracted much attention as a basis for the next generation of communication services. This article proposes a model-based image coding for the mouth, which is aimed at capturing visual information related to speech, in order to make the decoded video sequence suitable for lip-reading. Such a coding system is basically composed of an analysis process
Proceedings of International Conference on Neural Networks (ICNN'96), 1996
A trainable vision-based system is presented, which is able to perform reliable, real time estima... more A trainable vision-based system is presented, which is able to perform reliable, real time estimates of the crowding level present on the platforms of underground stations. Taking as input standard the B/W images of the scene, a classification of the crowding level is performed in terms of five qualitative crowding classes, ranging from no people to overcrowding. Visual feature extraction
. A scheme for describing the mouth of a speaker in color imagesequences is proposed, which is ba... more . A scheme for describing the mouth of a speaker in color imagesequences is proposed, which is based on a parametric 2D model of the lips.At each frame, key information is extracted from chrominance analysis, andoptimal parameters for the model are found by maximizing a suitable scorefunction. A detailed description of the techniques employed is given, andsome preliminary results shown.1 IntroductionThe mouth is a part of the human body which presents high interpersonalvariability....
Proc. 6th European Conf. on …, 1999
The collection of telephone databases, for training speech recognisers, is a time consuming and c... more The collection of telephone databases, for training speech recognisers, is a time consuming and costly work. In the paper we propose a method for producing simulated telephone data starting from clean wide band databases. The result of the simulation is the generation of a noisy ...
Document Image Understanding (DIU) is an interesting research area with a large variety of challe... more Document Image Understanding (DIU) is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. The main purpose of the present report is to describe the current status of DIU with particular attention to two subprocesses: document skew angle estimation and page decomposition. Several algorithms proposed in the literature are synthetically described. They are included in a novel classification scheme. Some methods proposed for the evaluation of page decomposition algorithms are described. Critical discussions are reported about the current status of the field and about the open problems. Some considerations about the logical layout analysis are also reported.
Lecture Notes in Computer Science, 1997
Model-based image coding has recently attracted much attention as a basis for the next generation... more Model-based image coding has recently attracted much attention as a basis for the next generation of communication services. This article proposes a model-based image coding for the mouth, which is aimed at capturing visual information related to speech, in order to make the decoded video sequence suitable for lip-reading. Such a coding system is basically composed of an analysis process
Proceedings of International Conference on Neural Networks (ICNN'96), 1996
A trainable vision-based system is presented, which is able to perform reliable, real time estima... more A trainable vision-based system is presented, which is able to perform reliable, real time estimates of the crowding level present on the platforms of underground stations. Taking as input standard the B/W images of the scene, a classification of the crowding level is performed in terms of five qualitative crowding classes, ranging from no people to overcrowding. Visual feature extraction
. A scheme for describing the mouth of a speaker in color imagesequences is proposed, which is ba... more . A scheme for describing the mouth of a speaker in color imagesequences is proposed, which is based on a parametric 2D model of the lips.At each frame, key information is extracted from chrominance analysis, andoptimal parameters for the model are found by maximizing a suitable scorefunction. A detailed description of the techniques employed is given, andsome preliminary results shown.1 IntroductionThe mouth is a part of the human body which presents high interpersonalvariability....
Proc. 6th European Conf. on …, 1999
The collection of telephone databases, for training speech recognisers, is a time consuming and c... more The collection of telephone databases, for training speech recognisers, is a time consuming and costly work. In the paper we propose a method for producing simulated telephone data starting from clean wide band databases. The result of the simulation is the generation of a noisy ...
Document Image Understanding (DIU) is an interesting research area with a large variety of challe... more Document Image Understanding (DIU) is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. The main purpose of the present report is to describe the current status of DIU with particular attention to two subprocesses: document skew angle estimation and page decomposition. Several algorithms proposed in the literature are synthetically described. They are included in a novel classification scheme. Some methods proposed for the evaluation of page decomposition algorithms are described. Critical discussions are reported about the current status of the field and about the open problems. Some considerations about the logical layout analysis are also reported.
Document Image Understanding (DIU) is an interesting research area with a large variety of challe... more Document Image Understanding (DIU) is an interesting research area with a large variety of challenging applications. Researchers have worked from decades on this topic, as witnessed by the scientific literature. The main purpose of the present report is to describe the current status of DIU with particular attention to two subprocesses: document skew angle estimation and page decomposition. Several algorithms proposed in the literature are synthetically described. They are included in a novel classification scheme. Some methods proposed for the evaluation of page decomposition algorithms are described. Critical discussions are reported about the current status of the field and about the open problems. Some considerations about the logical layout analysis are also reported.