Hideki Kasuya - Academia.edu (original) (raw)
Papers by Hideki Kasuya
The Journal of the Acoustical Society of America, Nov 1, 1986
In order to evaluate noise components included in pathologic voice signals, a novel acoustic meas... more In order to evaluate noise components included in pathologic voice signals, a novel acoustic measure, normalized noise energy (NNE), is proposed and its effectiveness for the detection of laryngeal pathologies is investigated with 250 vowel samples spoken by 64 control (normal) subjects and 186 patients with various laryngeal diseases. The NNE is automatically computed from the voice signals using an adaptive comb filtering method performed in the frequency domain. Experiments with the voice sample s show that the NNE is especially effective for detecting glottic cancer, recurrent nerve paralysis, and vocal cord nodules. Specifically, when glottic cancer is represented in terms of the T classification adopted by the UICC (Union Internationale Contre le Cancer), glottic T2-T4 cancer can be perfectly discriminated from normal samples, but 22.6% of patients with glottic T1 cancer are incorrectly classified as normal, with an error rate of 9.4% for normal subjects.
Lecture Notes in Computer Science, 2000
The Utsunomiya University (UU) Spoken Dialogue Database for Paralinguistic Information Studies, n... more The Utsunomiya University (UU) Spoken Dialogue Database for Paralinguistic Information Studies, now available to the public, is introduced. The UU database is intended mainly for use in understanding the usage, structure and effect of paralinguistic information in expressive Japanese conversational speech. This paper describes the outline, design, building, and key properties of the UU database, to show how the corpus
Electronics and Communications in Japan (Part I Communications)
ABSTRACT
Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
ABSTRACT
We are investigating acoustical analysis for dysarthric speech, which appears as a symptom of neu... more We are investigating acoustical analysis for dysarthric speech, which appears as a symptom of neurologic disease, in order to elucidate its physiological and acoustical mechanism, and to develop aids for diagnosis and training, etc. In this report, acoustical characteristics of various kinds of dysarthrias are measured. As a result, shrinking of the F0 range as well as vowel space are observed in dysarthric speech. We performed a perceptual experiment to clarify how such parameters affect so-called "monotonous" impression, and found that abnormality in the F0 range affects the monotonous impression.
Japan Journal of Logopedics and Phoniatrics
Japanese Journal of Clinical Oncology
An acoustic screening method for indicating the possible presence of laryngeal cancer was investi... more An acoustic screening method for indicating the possible presence of laryngeal cancer was investigated. Three acoustic parameters, comprising perturbations in pitch-period and peak-amplitude sequences, and vocal noise, were measured from a sustained vowel, e, spoken by the subjects taking part in the investigation. Experiments to discriminate between normal and cancer groups were performed with voice samples taken from 64 normal control subjects and 57 patients with laryngeal cancer. Experiments were also carried out to test the perceptual significance of the three parameters. From the results, we have been able to conclude that their combined use will enable us to build an acoustic screening system for laryngeal cancer.
We are investigating acoustical analysis for dysarthric speech, which appears as a symptom of neu... more We are investigating acoustical analysis for dysarthric speech, which appears as a symptom of neurologic disease, in order to elucidate its physiological and acoustical mechanism, and to develop aids for diagnosis and training, etc. In this report, acoustical characteristics of various kinds of dysarthrias are measured. As a result, shrinking of the F 0 range as well as vowel space are observed in dysarthric speech. Also, from the comparison of F 0 range and vowel formant frequencies it is suggested that speech effort to produce wider F 0 range can influence vowel quality as well.
The International journal of biological markers
Tryptophan degradation metabolites are known to suppress T-cell function, which is a mechanism of... more Tryptophan degradation metabolites are known to suppress T-cell function, which is a mechanism of resistance of tumor cells against immune surveillance. The aim of this study was to evaluate tryptophan degradation along with serum neopterin levels in benign and malignant breast disease. Serum tryptophan and kynurenine levels and neopterin concentrations of 30 patients with malignant and 27 patients with benign breast disease were determined by HPLC and ELISA, respectively. The slight increase in tryptophan degradation in a subgroup of cancer patients with higher grade tumors was not statistically significant, but the increased degradation was correlated with higher neopterin concentrations. Neopterin levels in patients with malignant breast disease were significantly higher than in the benign group (p<0.05). Tryptophan degradation positively correlates with the aggressiveness of the tumor because it changes with tumor grade rather than disease stage.
Electronics and Communications in Japan (Part III: Fundamental Electronic Science), 1999
ABSTRACT An analysis-conversion-synthesis system taking account of cycle-to-cycle perturbation an... more ABSTRACT An analysis-conversion-synthesis system taking account of cycle-to-cycle perturbation and designed for speech research was constructed. This system can analyze, convert, and synthesize acoustic characteristics related to voice quality, such as the perturbation of the fundamental period, effective value and spectrum, mean fundamental frequency, average spectrum, and laryngeal noise. Analysis–synthesis experiments show that the spectral envelopes and perturbations can be reconstructed from a few parameters. The analyzed–synthesized speech signals retain the spectrum of the original speech signals. The perceptual difference between the original and the analyzed–synthesized signals is very small. In the future, we will investigate the relationships between acoustic characteristics and perceptual impressions and the relationships between the naturalness of synthesized voices and the characteristics of perturbation. © 1999 Scripta Technica, Electron Comm Jpn Pt 3, 82(12): 1–12, 1999
The Journal of the Acoustical Society of America, Nov 1, 1986
In order to evaluate noise components included in pathologic voice signals, a novel acoustic meas... more In order to evaluate noise components included in pathologic voice signals, a novel acoustic measure, normalized noise energy (NNE), is proposed and its effectiveness for the detection of laryngeal pathologies is investigated with 250 vowel samples spoken by 64 control (normal) subjects and 186 patients with various laryngeal diseases. The NNE is automatically computed from the voice signals using an adaptive comb filtering method performed in the frequency domain. Experiments with the voice sample s show that the NNE is especially effective for detecting glottic cancer, recurrent nerve paralysis, and vocal cord nodules. Specifically, when glottic cancer is represented in terms of the T classification adopted by the UICC (Union Internationale Contre le Cancer), glottic T2-T4 cancer can be perfectly discriminated from normal samples, but 22.6% of patients with glottic T1 cancer are incorrectly classified as normal, with an error rate of 9.4% for normal subjects.
Lecture Notes in Computer Science, 2000
The Utsunomiya University (UU) Spoken Dialogue Database for Paralinguistic Information Studies, n... more The Utsunomiya University (UU) Spoken Dialogue Database for Paralinguistic Information Studies, now available to the public, is introduced. The UU database is intended mainly for use in understanding the usage, structure and effect of paralinguistic information in expressive Japanese conversational speech. This paper describes the outline, design, building, and key properties of the UU database, to show how the corpus
Electronics and Communications in Japan (Part I Communications)
ABSTRACT
Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
ABSTRACT
We are investigating acoustical analysis for dysarthric speech, which appears as a symptom of neu... more We are investigating acoustical analysis for dysarthric speech, which appears as a symptom of neurologic disease, in order to elucidate its physiological and acoustical mechanism, and to develop aids for diagnosis and training, etc. In this report, acoustical characteristics of various kinds of dysarthrias are measured. As a result, shrinking of the F0 range as well as vowel space are observed in dysarthric speech. We performed a perceptual experiment to clarify how such parameters affect so-called "monotonous" impression, and found that abnormality in the F0 range affects the monotonous impression.
Japan Journal of Logopedics and Phoniatrics
Japanese Journal of Clinical Oncology
An acoustic screening method for indicating the possible presence of laryngeal cancer was investi... more An acoustic screening method for indicating the possible presence of laryngeal cancer was investigated. Three acoustic parameters, comprising perturbations in pitch-period and peak-amplitude sequences, and vocal noise, were measured from a sustained vowel, e, spoken by the subjects taking part in the investigation. Experiments to discriminate between normal and cancer groups were performed with voice samples taken from 64 normal control subjects and 57 patients with laryngeal cancer. Experiments were also carried out to test the perceptual significance of the three parameters. From the results, we have been able to conclude that their combined use will enable us to build an acoustic screening system for laryngeal cancer.
We are investigating acoustical analysis for dysarthric speech, which appears as a symptom of neu... more We are investigating acoustical analysis for dysarthric speech, which appears as a symptom of neurologic disease, in order to elucidate its physiological and acoustical mechanism, and to develop aids for diagnosis and training, etc. In this report, acoustical characteristics of various kinds of dysarthrias are measured. As a result, shrinking of the F 0 range as well as vowel space are observed in dysarthric speech. Also, from the comparison of F 0 range and vowel formant frequencies it is suggested that speech effort to produce wider F 0 range can influence vowel quality as well.
The International journal of biological markers
Tryptophan degradation metabolites are known to suppress T-cell function, which is a mechanism of... more Tryptophan degradation metabolites are known to suppress T-cell function, which is a mechanism of resistance of tumor cells against immune surveillance. The aim of this study was to evaluate tryptophan degradation along with serum neopterin levels in benign and malignant breast disease. Serum tryptophan and kynurenine levels and neopterin concentrations of 30 patients with malignant and 27 patients with benign breast disease were determined by HPLC and ELISA, respectively. The slight increase in tryptophan degradation in a subgroup of cancer patients with higher grade tumors was not statistically significant, but the increased degradation was correlated with higher neopterin concentrations. Neopterin levels in patients with malignant breast disease were significantly higher than in the benign group (p<0.05). Tryptophan degradation positively correlates with the aggressiveness of the tumor because it changes with tumor grade rather than disease stage.
Electronics and Communications in Japan (Part III: Fundamental Electronic Science), 1999
ABSTRACT An analysis-conversion-synthesis system taking account of cycle-to-cycle perturbation an... more ABSTRACT An analysis-conversion-synthesis system taking account of cycle-to-cycle perturbation and designed for speech research was constructed. This system can analyze, convert, and synthesize acoustic characteristics related to voice quality, such as the perturbation of the fundamental period, effective value and spectrum, mean fundamental frequency, average spectrum, and laryngeal noise. Analysis–synthesis experiments show that the spectral envelopes and perturbations can be reconstructed from a few parameters. The analyzed–synthesized speech signals retain the spectrum of the original speech signals. The perceptual difference between the original and the analyzed–synthesized signals is very small. In the future, we will investigate the relationships between acoustic characteristics and perceptual impressions and the relationships between the naturalness of synthesized voices and the characteristics of perturbation. © 1999 Scripta Technica, Electron Comm Jpn Pt 3, 82(12): 1–12, 1999