The Pitch Rise Paradigm: A New Task for Real-Time Endoscopy of Non-Stationary Phonation (original) (raw)
Related papers
Simulated Laryngeal High-Speed Videos for the Study of Normal and Dysphonic Vocal Fold Vibration
Journal of Speech, Language, and Hearing Research
Purpose: Laryngeal high-speed videoendoscopy (LHSV) has been recognized as a highly valuable modality for the scientific investigations of vocal fold (VF) vibrations. In contrast to stroboscopic imaging, LHSV enables visualizing aperiodic VF vibrations. However, the technique is less well established in the clinical care of disordered voices, partly because the properties of aperiodic vibration patterns are not yet described comprehensively. To address this, a computer model for simulation of VF vibration patterns observed in a variety of different phonation types is proposed. Method: A previously published kinematic model of mucosal wave phenomena is generalized to be capable of left–right asymmetry and to simulate endoscopic videos instead of only kymograms of VF vibrations at single sagittal positions. The most influential control parameters are the glottal halfwidths, the oscillation frequencies, the amplitudes, and the phase delays. Results: The presented videos demonstrate zip...
Comparative Analysis of Vocal Fold Vibration Using High-Speed Videoendoscopy and Digital Kymography
Journal of Voice, 2014
To compare high-speed videoendoscopy (HSV) and digital kymography (DKG) in the vocal fold vibration analysis of normophonic women with no vocal fold abnormalities. Study Design. Prospective study comparing quantitative parameters extracted by HSV and DKG. Methods. Eighteen normophonic women whose age ranged from 18 to 45 years participated in the study. The procedures comprised HSV and DKG of the medial line of the vocal folds. The parameters evaluated were fundamental frequency (F 0), open quotient (OQ), and duration of vibration cycle phases (open phase, closed phase, opening phase, and closing phase). Results. The F 0 results of HSVand DKG were similar. However, significant differences were found in both duration of vibration cycle phases and OQ, indicating a longer open phase in the vocal fold vibration when this phase was measured by HSV. Conclusions. The results emphasize the need to set up different normative threshold values for both HSV and DKG.
Graphical Evaluation of Vocal Fold Vibratory Patterns by High-Speed Videolaryngoscopy
Journal of Voice, 2014
Objective. To characterize the voice and vocal fold function of an individual, it is essential to evaluate vocal fold vibration. The most widely used method for this purpose has been videolaryngoscopy. Methods. This article proposes a digital image processing algorithm to estimate the glottal area (ie, the space between the vocal folds) and produce graphs of the opening and closing phases of the glottal cycle. In eight subjects without voice disorders, vocal fold movements were recorded by high-speed videolaryngoscopy at 4000 frames per second. The video data were processed by a combination of image segmentation techniques that estimate the glottal area. The segmented area was used to construct the glottal waveform. Results. The graphs revealed important properties of vocal fold vibration, including amplitude, velocity, and other characteristics that have a major influence on voice quality. Conclusions. The combination of the high-speed technology with the proposed method improves the vocal fold analysis given a numerical feedback through graphical representation of the real vibratory patterns of the folds.
Journal of Voice, 2016
To quantify vibratory characteristics of various laryngeal disorders seen by high-speed digital imaging (HSDI). Methods. HSDI was performed on 78 patients with various laryngeal disorders (20 with polyp, 16 with carcinoma, 13 with leukoplakia, 6 with vocal fold nodule, and 33 with others) and 29 vocally healthy subjects. Obtained data were quantitatively evaluated by frame-by-frame analysis, laryngotopography, digital kymography, and glottal area waveform. Results. Overall, patients with laryngeal pathologies showed greater asymmetry in amplitude, mucosal wave and phase, smaller mucosal wave, and poorer glottal closure than vocally healthy subjects. Furthermore, disease-specific vibratory disturbances that generally agreed with the findings in the literature were quantified: comparing polyp with nodule, differences were noted in longitudinal phase difference, amplitude, and mucosal wave. In comparison with leukoplakia and cancer, nonvibrating area was more frequently noted in cancer. Conclusions. The HSDI analysis of various voice disorders using multiple methods can help phonosurgeons to properly diagnose various laryngeal pathologies and to estimate the degree of their vocal disturbances.
Vocal Fold Vibrations: High-Speed Imaging, Kymography, and Acoustic Analysis: A Preliminary Report
The Laryngoscope, 2000
To evaluate a new analysis system, High-Speed Tool Box (H. Larsson, custom-made program for image analysis, version 1.1, Department of Logopedics and Phoniatrics, Huddinge University Hospital, Huddinge, Sweden, 1998) for studying vocal fold vibrations using a high-speed camera and to relate findings from these analyses to sound characteristics. A Weinberger Speedcam + 500 system (Weinberger AG, Dietikon, Switzerland) was used with a frame rate of 1,904 frames per second. Images were stored and analyzed digitally. Analysis included automatic glottal edge detection and calculation of glottal area variations, as well as kymography. These signals were compared with acoustic waveforms using the Soundswell program (Hitech Development AB, Stockholm, Sweden). The High-Speed Tool Box was applied on two types of high-speed recordings: a diplophonic phonation and a tremor voice. Relations between glottal vibratory patterns and the sound waveform were analyzed. In the diplophonic phonation, the glottal area waveform, as well as the kymogram, showed a specific pattern of repetitive glottal closures, which was also seen in the acoustic waveform. In the tremor voice, fundamental frequency (F0) fluctuations in the acoustic waveform were reflected in slow variations in amplitude in the glottal area waveform. For studying details of mucosal movements during these kinds of abnormal vibrations, the glottal area waveform was particularly useful. Our results suggest that this combined high-speed acoustic-kymographic analysis package is a promising aid for separating and specifying different voice qualities such as diplophonia and voice tremor. Apart from clinical use, this finding should be of help for specification of the terminology of different voice qualities.
Videokymography: a New High-Speed Method for the Examination of Vocal-Fold Vibrations
Otorinolaryngologie a Foniatrie
Videokymography is a new optical high-speed method for investigation of vibrations which was developed especially for examination of vocal-fold vibrations. Videokymography is based on a modified CCD video camera, which is able to work in two different modes: standard and high-speed. In the standard mode the camera works as a normal commercial video camera providing 25 images (50 interlaced fields) per second. In the high-speed mode the camera delivers images from a single selected line with a frequency of almost 8000 line images/s. The successive line images are put below each other, creating a new videokymographic image monitoring vibration of the selected part of the vocal folds in time. A foot switch makes it possible to change instantaneously between the standard and high-speed modes. Ordinary videolaryngoscopic equipment with a powerful endoscopic continuous-light source can be used for the videokymographic examination of the vocal folds. Both the standard as well as high-speed images can be recorded by means of a normal video recorder, which makes the technique cost-friendly. The method is able to objectively evaluate important parameters of the vocal-fold vibration, such as the open, opening, closing and closed phases of the vibratory cycle, propagation of mucosal waves, left-right differences in phase or amplitude, etc. Videokymography provides more detailed information on voice disorders and considerably enriches laryngostroboscopy. There is no problem for videokymography to evaluate hoarse or unstable voices in which laryngostroboscopy fails. Also, the method is able to reveal structural irregularities on the medial surface of the vocal fold (e.g., sulcus glottidis) which can easily be overlooked in laryngostroboscopy. It is demonstrated how combination of a few laryngostroboscopic and videokymographic images can shortly and efficiently summarize important anatomical, physiological/pathological and vibrational properties of the laryngeal tissues in various patients.
Glottal flow waveforms estimated by inverse filtering acoustic speech pressure signals were compared to glottal area functions obtained by digital high-speed imaging of the vocal fold vibration. Speech data consisted of breathy, normal and pressed phonations produced by two male and one female subjects. The results yield both qualitative and quantitative information about the relationship between the glottal flow and the corresponding area function. It was shown, for example, that a distinct knee in the glottal flow waveform in the opening phase corresponds to the abrupt opening of the vocal folds in normal and pressed phonation. In addition, the obtained quantitative data corroborates known theoretical considerations according to which the shape of the glottal flow is more asymmetric than the corresponding area function.
Assessment of vocal fold Phonatory function A Review
This article attempts to review the various im-aging modalities that can be used to assess vo-cal fold function including development of indi-rect laryngoscopy, direct laryngoscopy, video laryngoscopy, stroboscopy etc. Focus of this article is primarily on imaging approaches for observing / documenting vibratory behaviour of vocal folds during phonation.
IEEE Transactions on Medical Imaging, 2000
Endoscopic high-speed laryngoscopy in combination with image analysis strategies is the most promising approach to investigate the interrelation between vocal fold vibrations and voice disorders. So far, due to the lack of an objective and standardized analysis procedure a unique characterization of vocal fold vibrations has not been achieved yet. We present a visualization and analysis strategy which transforms the segmented edges of vibrating vocal folds into a single 2-D image, denoted Phonovibrogram (PVG). Within a PVG the individual type of vocal fold vibration becomes uniquely characterized by specific geometric patterns. The PVG geometries give an intuitive access on the type and degree of the laryngeal asymmetry and can be quantified using an image segmentation approach. The PVG analysis was applied to 14 representative recordings derived from a high-speed database comprising normal and pathological voices. We demonstrate that PVGs are capable to differentiate and quantify different types of normal and pathological vocal fold vibrations. The objective and precise quantification of the PVG geometry may have the potential to realize a novel classification of vocal fold vibrations.