UMIT GUZ | Işik University (original) (raw)
Papers by UMIT GUZ
Springer eBooks, Aug 18, 2010
A Novel Image Compression Method Based on Classified Energy and Pattern Blocks: Initial Results U... more A Novel Image Compression Method Based on Classified Energy and Pattern Blocks: Initial Results Umit Guz1, Hakan Gurkan1, and B. Siddik Yarman2 1 Isik University, Engineering Faculty, Department of Electronics Engineering, Sile, Istanbul, Turkey 2 Istanbul University, Engineering ...
2018 IEEE Spoken Language Technology Workshop (SLT), 2018
The objective of this work is to develop effective multiview semi-supervised machine learning str... more The objective of this work is to develop effective multiview semi-supervised machine learning strategies for sentence boundary classification problem when only small sets of sentence boundary labeled data are available. We propose three-view and committee-based learning strategies incorporating with co-training algorithms with agreement, disagreement, and self-combined learning strategies using prosodic, lexical and morphological information. We compare experimental results of proposed three-view and committee-based learning strategies to other semi-supervised learning strategies in the literature namely, self-training and co-training with agreement, disagreement, and self-combined strategies. The experiment results show that sentence segmentation performance can be highly improved using multi-view learning strategies that we propose since data sets can be represented by three redundantly sufficient and disjoint feature sets. We show that the proposed strategies substantially improve the average performance when only a small set of manually labeled data is available for Turkish and English spoken languages, respectively.
In this paper, a novel method to compress ElectroEncephaloGram (EEG) Signal is proposed. The prop... more In this paper, a novel method to compress ElectroEncephaloGram (EEG) Signal is proposed. The proposed method is based on the generation Classified Signature and Envelope Vector Sets (CSEVS) by using an effective k-means clustering algorithm. In this work, on a frame basis, any EEG signal is modeled by multiplying three parameters as called the Classified Signature Vector, Classified Envelope Vector, and Frame-Scaling Coefficient. In this case, EEG signal for each frame is described in terms of the two indices R and K of CSEVS and the frame-scaling coefficient. The proposed method is assessed through the use of root-mean-square error (RMSE) and visual inspection measures. The proposed method achieves good compression ratios with low level reconstruction error while preserving diagnostic information in the reconstructed EEG signal.
Signal, Image and Video Processing, 2018
In this research work, we present a newly fingertip electrocardiogram (ECG) data acquisition devi... more In this research work, we present a newly fingertip electrocardiogram (ECG) data acquisition device capable of recording the lead-1 ECG signal through the right- and left-hand thumb fingers. The proposed device is high-sensitive, dry-contact, portable, user-friendly, inexpensive, and does not require using conventional components which are cumbersome and irritating such as wet adhesive Ag/AgCl electrodes. One of the other advantages of this device is to make it possible to record and use the lead-1 ECG signal easily in any condition and anywhere incorporating with any platform to use for advanced applications such as biometric recognition and clinical diagnostics. Furthermore, we proposed a biometric identification method based on combining autocorrelation and discrete cosine transform-based features, cepstral features, and QRS beat information. The proposed method was evaluated on three fingertip ECG signal databases recorded by utilizing the proposed device. The experimental results demonstrate that the proposed biometric identification method achieves person recognition rate values of 100% (30 out of 30), 100$$\%$$% (45 out of 45), and 98.33$$\%$$% (59 out of 60) for 30, 45, and 60 subjects, respectively.
IEEE Transactions on Audio, Speech, and Language Processing, 2010
A Novel Image Compression Method Based on Classified Energy and Pattern Building Blocks, 2011
In this paper, a novel image compression method based on generation of the so-called classified e... more In this paper, a novel image compression method based on generation of the so-called classified energy and pattern blocks (CEPB) is introduced and evaluation results are presented. The CEPB is constructed using the training images and then located at both the transmitter and receiver sides of the communication system. Then the energy and pattern blocks of input images to be reconstructed are determined by the same way in the construction of the CEPB. This process is also associated with a matching
procedure to determine the index numbers of the classified energy and pattern blocks in the CEPB which best represents (matches) the energy and pattern blocks of the input images. Encoding parameters are block scaling coefficient and index numbers of energy and pattern blocks determined for each block of the input images. These parameters are sent from the transmitter part to the receiver part and the classified energy and pattern blocks associated with the index numbers are pulled from the CEPB. Then the input image is reconstructed block by block in the receiver part using a mathematical model that is proposed. Evaluation results show that the method provides considerable image compression ratios and image quality even at low bit rates.
Digital Signal Processing
Pattern Recognition Letters
The primary objective of sentence segmentation process is to determine the sentence boundaries of... more The primary objective of sentence segmentation process is to determine the sentence boundaries of a stream of words output by the automatic speech recognizers. Statistical methods developed for sentence segmentation requires a significant amount of labeled data which is time-consuming, labor intensive and expensive. In this work, we propose new multi-view semi-supervised learning strategies for sentence boundary classification problem using lexical, prosodic, and morphological information. The aim is to find effective semi-supervised machine learning strategies when only small sets of sentence boundary labeled data are available. We primarily investigate two semi-supervised learning approaches, called self-training and co-training. Different example selection strategies were also used for co-training, namely, agreement, disagreement and self-combined. Furthermore, we propose three-view and committee-based algorithms incorporating with agreement, disagreement and self-combined strategies using three disjoint feature sets. We present comparative results of different learning strategies on the sentence segmentation task. The experimental results show that the sentence segmentation performance can be highly improved using multi-view learning strategies that we proposed since data sets can be represented by three redundantly sufficient and disjoint feature sets. We show that the proposed strategies substantially improve the average baseline F-measure of 67.66% to 75.15% and 64.84% to 66.32% when only a small set of manually labeled data is available for Turkish and English spoken languages, respectively.
Signal, Image and Video Processing
Computer Speech Language, Apr 1, 2010
There are many speech and language processing problems which require cascaded classification task... more There are many speech and language processing problems which require cascaded classification tasks. While model adaptation has been shown to be useful in isolated speech and language processing tasks, it is not clear what constitutes system adaptation for such complex systems. This paper studies the following questions: In cases where a sequence of classification tasks is employed, how important is to adapt the earlier or latter systems? Is the performance improvement obtained in the earlier stages via adaptation carried on to later stages in cases where the later stages perform adaptation using similar data and/or methods? In this study, as part of a larger scale multiparty meeting understanding system, we analyze various methods for adapting dialog act segmentation and tagging models trained on conversational telephone speech (CTS) to meeting style conversations. We investigate the effect of using adapted and unadapted models for dialog act segmentation with those of tagging, showing the effect of model adaptation for cascaded classification tasks. Our results indicate that we can achieve significantly better dialog act segmentation and tagging by adapting the out-of-domain models, especially when the amount of in-domain data is limited. Experimental results show that it is more effective to adapt the models in the latter classification tasks, in our case dialog act tagging, when dealing with a sequence of cascaded classification tasks.
International Symposium on Signals, Circuits and Systems ISSCS2013, 2013
In this work, a new biomedical image compression method is proposed based on the classified energ... more In this work, a new biomedical image compression method is proposed based on the classified energy and pattern blocks (CEPB). CEPB based compression method is specifically applied on the Computed Tomography (CT) images and the evaluation results are presented. Essentially, the CEPB is uniquely designed and structured codebook which is located on the both the transmitter and receiver part of a communication system in order to implement encoding and decoding processes. The encoding parameters are block scaling coefficient (BSC) and the index numbers of energy (IE) and pattern blocks (IP) determined for each block of the input images based on the CEPB. The evaluation results show that the newly proposed method provides considerable image compression ratios and image quality.
Springer eBooks, Aug 18, 2010
A Novel Image Compression Method Based on Classified Energy and Pattern Blocks: Initial Results U... more A Novel Image Compression Method Based on Classified Energy and Pattern Blocks: Initial Results Umit Guz1, Hakan Gurkan1, and B. Siddik Yarman2 1 Isik University, Engineering Faculty, Department of Electronics Engineering, Sile, Istanbul, Turkey 2 Istanbul University, Engineering ...
2018 IEEE Spoken Language Technology Workshop (SLT), 2018
The objective of this work is to develop effective multiview semi-supervised machine learning str... more The objective of this work is to develop effective multiview semi-supervised machine learning strategies for sentence boundary classification problem when only small sets of sentence boundary labeled data are available. We propose three-view and committee-based learning strategies incorporating with co-training algorithms with agreement, disagreement, and self-combined learning strategies using prosodic, lexical and morphological information. We compare experimental results of proposed three-view and committee-based learning strategies to other semi-supervised learning strategies in the literature namely, self-training and co-training with agreement, disagreement, and self-combined strategies. The experiment results show that sentence segmentation performance can be highly improved using multi-view learning strategies that we propose since data sets can be represented by three redundantly sufficient and disjoint feature sets. We show that the proposed strategies substantially improve the average performance when only a small set of manually labeled data is available for Turkish and English spoken languages, respectively.
In this paper, a novel method to compress ElectroEncephaloGram (EEG) Signal is proposed. The prop... more In this paper, a novel method to compress ElectroEncephaloGram (EEG) Signal is proposed. The proposed method is based on the generation Classified Signature and Envelope Vector Sets (CSEVS) by using an effective k-means clustering algorithm. In this work, on a frame basis, any EEG signal is modeled by multiplying three parameters as called the Classified Signature Vector, Classified Envelope Vector, and Frame-Scaling Coefficient. In this case, EEG signal for each frame is described in terms of the two indices R and K of CSEVS and the frame-scaling coefficient. The proposed method is assessed through the use of root-mean-square error (RMSE) and visual inspection measures. The proposed method achieves good compression ratios with low level reconstruction error while preserving diagnostic information in the reconstructed EEG signal.
Signal, Image and Video Processing, 2018
In this research work, we present a newly fingertip electrocardiogram (ECG) data acquisition devi... more In this research work, we present a newly fingertip electrocardiogram (ECG) data acquisition device capable of recording the lead-1 ECG signal through the right- and left-hand thumb fingers. The proposed device is high-sensitive, dry-contact, portable, user-friendly, inexpensive, and does not require using conventional components which are cumbersome and irritating such as wet adhesive Ag/AgCl electrodes. One of the other advantages of this device is to make it possible to record and use the lead-1 ECG signal easily in any condition and anywhere incorporating with any platform to use for advanced applications such as biometric recognition and clinical diagnostics. Furthermore, we proposed a biometric identification method based on combining autocorrelation and discrete cosine transform-based features, cepstral features, and QRS beat information. The proposed method was evaluated on three fingertip ECG signal databases recorded by utilizing the proposed device. The experimental results demonstrate that the proposed biometric identification method achieves person recognition rate values of 100% (30 out of 30), 100$$\%$$% (45 out of 45), and 98.33$$\%$$% (59 out of 60) for 30, 45, and 60 subjects, respectively.
IEEE Transactions on Audio, Speech, and Language Processing, 2010
A Novel Image Compression Method Based on Classified Energy and Pattern Building Blocks, 2011
In this paper, a novel image compression method based on generation of the so-called classified e... more In this paper, a novel image compression method based on generation of the so-called classified energy and pattern blocks (CEPB) is introduced and evaluation results are presented. The CEPB is constructed using the training images and then located at both the transmitter and receiver sides of the communication system. Then the energy and pattern blocks of input images to be reconstructed are determined by the same way in the construction of the CEPB. This process is also associated with a matching
procedure to determine the index numbers of the classified energy and pattern blocks in the CEPB which best represents (matches) the energy and pattern blocks of the input images. Encoding parameters are block scaling coefficient and index numbers of energy and pattern blocks determined for each block of the input images. These parameters are sent from the transmitter part to the receiver part and the classified energy and pattern blocks associated with the index numbers are pulled from the CEPB. Then the input image is reconstructed block by block in the receiver part using a mathematical model that is proposed. Evaluation results show that the method provides considerable image compression ratios and image quality even at low bit rates.
Digital Signal Processing
Pattern Recognition Letters
The primary objective of sentence segmentation process is to determine the sentence boundaries of... more The primary objective of sentence segmentation process is to determine the sentence boundaries of a stream of words output by the automatic speech recognizers. Statistical methods developed for sentence segmentation requires a significant amount of labeled data which is time-consuming, labor intensive and expensive. In this work, we propose new multi-view semi-supervised learning strategies for sentence boundary classification problem using lexical, prosodic, and morphological information. The aim is to find effective semi-supervised machine learning strategies when only small sets of sentence boundary labeled data are available. We primarily investigate two semi-supervised learning approaches, called self-training and co-training. Different example selection strategies were also used for co-training, namely, agreement, disagreement and self-combined. Furthermore, we propose three-view and committee-based algorithms incorporating with agreement, disagreement and self-combined strategies using three disjoint feature sets. We present comparative results of different learning strategies on the sentence segmentation task. The experimental results show that the sentence segmentation performance can be highly improved using multi-view learning strategies that we proposed since data sets can be represented by three redundantly sufficient and disjoint feature sets. We show that the proposed strategies substantially improve the average baseline F-measure of 67.66% to 75.15% and 64.84% to 66.32% when only a small set of manually labeled data is available for Turkish and English spoken languages, respectively.
Signal, Image and Video Processing
Computer Speech Language, Apr 1, 2010
There are many speech and language processing problems which require cascaded classification task... more There are many speech and language processing problems which require cascaded classification tasks. While model adaptation has been shown to be useful in isolated speech and language processing tasks, it is not clear what constitutes system adaptation for such complex systems. This paper studies the following questions: In cases where a sequence of classification tasks is employed, how important is to adapt the earlier or latter systems? Is the performance improvement obtained in the earlier stages via adaptation carried on to later stages in cases where the later stages perform adaptation using similar data and/or methods? In this study, as part of a larger scale multiparty meeting understanding system, we analyze various methods for adapting dialog act segmentation and tagging models trained on conversational telephone speech (CTS) to meeting style conversations. We investigate the effect of using adapted and unadapted models for dialog act segmentation with those of tagging, showing the effect of model adaptation for cascaded classification tasks. Our results indicate that we can achieve significantly better dialog act segmentation and tagging by adapting the out-of-domain models, especially when the amount of in-domain data is limited. Experimental results show that it is more effective to adapt the models in the latter classification tasks, in our case dialog act tagging, when dealing with a sequence of cascaded classification tasks.
International Symposium on Signals, Circuits and Systems ISSCS2013, 2013
In this work, a new biomedical image compression method is proposed based on the classified energ... more In this work, a new biomedical image compression method is proposed based on the classified energy and pattern blocks (CEPB). CEPB based compression method is specifically applied on the Computed Tomography (CT) images and the evaluation results are presented. Essentially, the CEPB is uniquely designed and structured codebook which is located on the both the transmitter and receiver part of a communication system in order to implement encoding and decoding processes. The encoding parameters are block scaling coefficient (BSC) and the index numbers of energy (IE) and pattern blocks (IP) determined for each block of the input images based on the CEPB. The evaluation results show that the newly proposed method provides considerable image compression ratios and image quality.