UMIT GUZ | Işik University (original) (raw)

Papers by UMIT GUZ

Research paper thumbnail of A Novel Image Compression Method Based on Classified Energy and Pattern Blocks: Initial Results

Springer eBooks, Aug 18, 2010

A Novel Image Compression Method Based on Classified Energy and Pattern Blocks: Initial Results U... more A Novel Image Compression Method Based on Classified Energy and Pattern Blocks: Initial Results Umit Guz1, Hakan Gurkan1, and B. Siddik Yarman2 1 Isik University, Engineering Faculty, Department of Electronics Engineering, Sile, Istanbul, Turkey 2 Istanbul University, Engineering ...

Research paper thumbnail of Bürünsel, sözcüksel ve biçimbilgisel bilgiyi kullanan co-training ile Türkçe konuşma dilinin otomatik cümle bölütlemesi

Research paper thumbnail of Extension of Conventional Co-Training Learning Strategies to Three-View and Committee-Based Learning Strategies for Effective Automatic Sentence Segmentation

2018 IEEE Spoken Language Technology Workshop (SLT), 2018

The objective of this work is to develop effective multiview semi-supervised machine learning str... more The objective of this work is to develop effective multiview semi-supervised machine learning strategies for sentence boundary classification problem when only small sets of sentence boundary labeled data are available. We propose three-view and committee-based learning strategies incorporating with co-training algorithms with agreement, disagreement, and self-combined learning strategies using prosodic, lexical and morphological information. We compare experimental results of proposed three-view and committee-based learning strategies to other semi-supervised learning strategies in the literature namely, self-training and co-training with agreement, disagreement, and self-combined strategies. The experiment results show that sentence segmentation performance can be highly improved using multi-view learning strategies that we propose since data sets can be represented by three redundantly sufficient and disjoint feature sets. We show that the proposed strategies substantially improve the average performance when only a small set of manually labeled data is available for Turkish and English spoken languages, respectively.

Research paper thumbnail of Elektroensefalogram (EEG) İşaretlerinin Sikiştirilmasinda Özgün Bir Yaklaşim A Novel Electroencephalogram (EEG) Data Compression Technique

In this paper, a novel method to compress ElectroEncephaloGram (EEG) Signal is proposed. The prop... more In this paper, a novel method to compress ElectroEncephaloGram (EEG) Signal is proposed. The proposed method is based on the generation Classified Signature and Envelope Vector Sets (CSEVS) by using an effective k-means clustering algorithm. In this work, on a frame basis, any EEG signal is modeled by multiplying three parameters as called the Classified Signature Vector, Classified Envelope Vector, and Frame-Scaling Coefficient. In this case, EEG signal for each frame is described in terms of the two indices R and K of CSEVS and the frame-scaling coefficient. The proposed method is assessed through the use of root-mean-square error (RMSE) and visual inspection measures. The proposed method achieves good compression ratios with low level reconstruction error while preserving diagnostic information in the reconstructed EEG signal.

Research paper thumbnail of Biometric identification using fingertip electrocardiogram signals

Signal, Image and Video Processing, 2018

In this research work, we present a newly fingertip electrocardiogram (ECG) data acquisition devi... more In this research work, we present a newly fingertip electrocardiogram (ECG) data acquisition device capable of recording the lead-1 ECG signal through the right- and left-hand thumb fingers. The proposed device is high-sensitive, dry-contact, portable, user-friendly, inexpensive, and does not require using conventional components which are cumbersome and irritating such as wet adhesive Ag/AgCl electrodes. One of the other advantages of this device is to make it possible to record and use the lead-1 ECG signal easily in any condition and anywhere incorporating with any platform to use for advanced applications such as biometric recognition and clinical diagnostics. Furthermore, we proposed a biometric identification method based on combining autocorrelation and discrete cosine transform-based features, cepstral features, and QRS beat information. The proposed method was evaluated on three fingertip ECG signal databases recorded by utilizing the proposed device. The experimental results demonstrate that the proposed biometric identification method achieves person recognition rate values of 100% (30 out of 30), 100$$\%$$% (45 out of 45), and 98.33$$\%$$% (59 out of 60) for 30, 45, and 60 subjects, respectively.

Research paper thumbnail of Multi-View Semi-Supervised Learning for Dialog Act Segmentation of Speech

IEEE Transactions on Audio, Speech, and Language Processing, 2010

Research paper thumbnail of A Novel Image Compression Method Based on Classified Energy and Pattern Building Blocks

A Novel Image Compression Method Based on Classified Energy and Pattern Building Blocks, 2011

In this paper, a novel image compression method based on generation of the so-called classified e... more In this paper, a novel image compression method based on generation of the so-called classified energy and pattern blocks (CEPB) is introduced and evaluation results are presented. The CEPB is constructed using the training images and then located at both the transmitter and receiver sides of the communication system. Then the energy and pattern blocks of input images to be reconstructed are determined by the same way in the construction of the CEPB. This process is also associated with a matching
procedure to determine the index numbers of the classified energy and pattern blocks in the CEPB which best represents (matches) the energy and pattern blocks of the input images. Encoding parameters are block scaling coefficient and index numbers of energy and pattern blocks determined for each block of the input images. These parameters are sent from the transmitter part to the receiver part and the classified energy and pattern blocks associated with the index numbers are pulled from the CEPB. Then the input image is reconstructed block by block in the receiver part using a mathematical model that is proposed. Evaluation results show that the method provides considerable image compression ratios and image quality even at low bit rates.

Research paper thumbnail of A novel biometric identification system based on fingertip electrocardiogram and speech signals

Digital Signal Processing

Research paper thumbnail of The 24th International Symposium on Computer and Information Sciences, ISCIS 2009, is organized by Middle East Technical University, in the METU-Northern Cyprus Campus. There are 6 main tracks in ISCIS 2009: 1. Computer Vision, Pattern Recognition and Image Processing, chaired by Enis Çetin, Bilk...

Research paper thumbnail of Effective semi-supervised learning strategies for automatic sentence segmentation

Pattern Recognition Letters

The primary objective of sentence segmentation process is to determine the sentence boundaries of... more The primary objective of sentence segmentation process is to determine the sentence boundaries of a stream of words output by the automatic speech recognizers. Statistical methods developed for sentence segmentation requires a significant amount of labeled data which is time-consuming, labor intensive and expensive. In this work, we propose new multi-view semi-supervised learning strategies for sentence boundary classification problem using lexical, prosodic, and morphological information. The aim is to find effective semi-supervised machine learning strategies when only small sets of sentence boundary labeled data are available. We primarily investigate two semi-supervised learning approaches, called self-training and co-training. Different example selection strategies were also used for co-training, namely, agreement, disagreement and self-combined. Furthermore, we propose three-view and committee-based algorithms incorporating with agreement, disagreement and self-combined strategies using three disjoint feature sets. We present comparative results of different learning strategies on the sentence segmentation task. The experimental results show that the sentence segmentation performance can be highly improved using multi-view learning strategies that we proposed since data sets can be represented by three redundantly sufficient and disjoint feature sets. We show that the proposed strategies substantially improve the average baseline F-measure of 67.66% to 75.15% and 64.84% to 66.32% when only a small set of manually labeled data is available for Turkish and English spoken languages, respectively.

Research paper thumbnail of Compression of the biomedical images using quadtree-based partitioned universally classified energy and pattern blocks

Signal, Image and Video Processing

Research paper thumbnail of Cascaded model adaptation for dialog act segmentation and tagging

Computer Speech Language, Apr 1, 2010

There are many speech and language processing problems which require cascaded classification task... more There are many speech and language processing problems which require cascaded classification tasks. While model adaptation has been shown to be useful in isolated speech and language processing tasks, it is not clear what constitutes system adaptation for such complex systems. This paper studies the following questions: In cases where a sequence of classification tasks is employed, how important is to adapt the earlier or latter systems? Is the performance improvement obtained in the earlier stages via adaptation carried on to later stages in cases where the later stages perform adaptation using similar data and/or methods? In this study, as part of a larger scale multiparty meeting understanding system, we analyze various methods for adapting dialog act segmentation and tagging models trained on conversational telephone speech (CTS) to meeting style conversations. We investigate the effect of using adapted and unadapted models for dialog act segmentation with those of tagging, showing the effect of model adaptation for cascaded classification tasks. Our results indicate that we can achieve significantly better dialog act segmentation and tagging by adapting the out-of-domain models, especially when the amount of in-domain data is limited. Experimental results show that it is more effective to adapt the models in the latter classification tasks, in our case dialog act tagging, when dealing with a sequence of cascaded classification tasks.

Research paper thumbnail of Farkli Ses Kaynaklarindan Üreti̇len Temel Tanim Di̇zi̇leri̇ İle Konuşma İşaretleri̇ni̇n Modellenmesi̇

Research paper thumbnail of A New Modelling Method of the ECG Signals Based on the use of an Optimized Predefined Functional Database

Research paper thumbnail of A novel computed tomography image compression method based on classified energy and pattern blocks

International Symposium on Signals, Circuits and Systems ISSCS2013, 2013

In this work, a new biomedical image compression method is proposed based on the classified energ... more In this work, a new biomedical image compression method is proposed based on the classified energy and pattern blocks (CEPB). CEPB based compression method is specifically applied on the Computed Tomography (CT) images and the evaluation results are presented. Essentially, the CEPB is uniquely designed and structured codebook which is located on the both the transmitter and receiver part of a communication system in order to implement encoding and decoding processes. The encoding parameters are block scaling coefficient (BSC) and the index numbers of energy (IE) and pattern blocks (IP) determined for each block of the input images based on the CEPB. The evaluation results show that the newly proposed method provides considerable image compression ratios and image quality.

Research paper thumbnail of Konu��ma ����aretler��n��n Konu��macidan Ba��imsiz Genelle��t��r��lm���� ��nceden Tanimli Temel Ses Ve Zarf Vekt��r Bankalari ��le Opt��mum Modellenmes��

Research paper thumbnail of Medikal G��r��nt��lerin S��k����t��r��lmas��na Yeni Bir Yakla����m: SYMPES

Research paper thumbnail of Konu��ma ����aretler��n��n D��lden Ve Konu��macidan Ba��imsiz ��nceden Tanimli Temel Tanim Ve Zarf Fonks��yon Setler�� ��le Modellenmes��ne ��l����k��n Yen�� B��r Y��ntem

Research paper thumbnail of Elektrokard��yogram (Ekg) ����aretler��n��n Genelle��t��r��lm���� Temel Tanim Ve Zarf Vekt��r Bankalari ��le Modellenmes��

Research paper thumbnail of Temel Tanim Ve Zarf Vekt��r Bankalari ��le Elektrokard��yogram (Ekg) ����aretler��n��n Siki��tirilmasi

Research paper thumbnail of A Novel Image Compression Method Based on Classified Energy and Pattern Blocks: Initial Results

Springer eBooks, Aug 18, 2010

A Novel Image Compression Method Based on Classified Energy and Pattern Blocks: Initial Results U... more A Novel Image Compression Method Based on Classified Energy and Pattern Blocks: Initial Results Umit Guz1, Hakan Gurkan1, and B. Siddik Yarman2 1 Isik University, Engineering Faculty, Department of Electronics Engineering, Sile, Istanbul, Turkey 2 Istanbul University, Engineering ...

Research paper thumbnail of Bürünsel, sözcüksel ve biçimbilgisel bilgiyi kullanan co-training ile Türkçe konuşma dilinin otomatik cümle bölütlemesi

Research paper thumbnail of Extension of Conventional Co-Training Learning Strategies to Three-View and Committee-Based Learning Strategies for Effective Automatic Sentence Segmentation

2018 IEEE Spoken Language Technology Workshop (SLT), 2018

The objective of this work is to develop effective multiview semi-supervised machine learning str... more The objective of this work is to develop effective multiview semi-supervised machine learning strategies for sentence boundary classification problem when only small sets of sentence boundary labeled data are available. We propose three-view and committee-based learning strategies incorporating with co-training algorithms with agreement, disagreement, and self-combined learning strategies using prosodic, lexical and morphological information. We compare experimental results of proposed three-view and committee-based learning strategies to other semi-supervised learning strategies in the literature namely, self-training and co-training with agreement, disagreement, and self-combined strategies. The experiment results show that sentence segmentation performance can be highly improved using multi-view learning strategies that we propose since data sets can be represented by three redundantly sufficient and disjoint feature sets. We show that the proposed strategies substantially improve the average performance when only a small set of manually labeled data is available for Turkish and English spoken languages, respectively.

Research paper thumbnail of Elektroensefalogram (EEG) İşaretlerinin Sikiştirilmasinda Özgün Bir Yaklaşim A Novel Electroencephalogram (EEG) Data Compression Technique

In this paper, a novel method to compress ElectroEncephaloGram (EEG) Signal is proposed. The prop... more In this paper, a novel method to compress ElectroEncephaloGram (EEG) Signal is proposed. The proposed method is based on the generation Classified Signature and Envelope Vector Sets (CSEVS) by using an effective k-means clustering algorithm. In this work, on a frame basis, any EEG signal is modeled by multiplying three parameters as called the Classified Signature Vector, Classified Envelope Vector, and Frame-Scaling Coefficient. In this case, EEG signal for each frame is described in terms of the two indices R and K of CSEVS and the frame-scaling coefficient. The proposed method is assessed through the use of root-mean-square error (RMSE) and visual inspection measures. The proposed method achieves good compression ratios with low level reconstruction error while preserving diagnostic information in the reconstructed EEG signal.

Research paper thumbnail of Biometric identification using fingertip electrocardiogram signals

Signal, Image and Video Processing, 2018

In this research work, we present a newly fingertip electrocardiogram (ECG) data acquisition devi... more In this research work, we present a newly fingertip electrocardiogram (ECG) data acquisition device capable of recording the lead-1 ECG signal through the right- and left-hand thumb fingers. The proposed device is high-sensitive, dry-contact, portable, user-friendly, inexpensive, and does not require using conventional components which are cumbersome and irritating such as wet adhesive Ag/AgCl electrodes. One of the other advantages of this device is to make it possible to record and use the lead-1 ECG signal easily in any condition and anywhere incorporating with any platform to use for advanced applications such as biometric recognition and clinical diagnostics. Furthermore, we proposed a biometric identification method based on combining autocorrelation and discrete cosine transform-based features, cepstral features, and QRS beat information. The proposed method was evaluated on three fingertip ECG signal databases recorded by utilizing the proposed device. The experimental results demonstrate that the proposed biometric identification method achieves person recognition rate values of 100% (30 out of 30), 100$$\%$$% (45 out of 45), and 98.33$$\%$$% (59 out of 60) for 30, 45, and 60 subjects, respectively.

Research paper thumbnail of Multi-View Semi-Supervised Learning for Dialog Act Segmentation of Speech

IEEE Transactions on Audio, Speech, and Language Processing, 2010

Research paper thumbnail of A Novel Image Compression Method Based on Classified Energy and Pattern Building Blocks

A Novel Image Compression Method Based on Classified Energy and Pattern Building Blocks, 2011

In this paper, a novel image compression method based on generation of the so-called classified e... more In this paper, a novel image compression method based on generation of the so-called classified energy and pattern blocks (CEPB) is introduced and evaluation results are presented. The CEPB is constructed using the training images and then located at both the transmitter and receiver sides of the communication system. Then the energy and pattern blocks of input images to be reconstructed are determined by the same way in the construction of the CEPB. This process is also associated with a matching
procedure to determine the index numbers of the classified energy and pattern blocks in the CEPB which best represents (matches) the energy and pattern blocks of the input images. Encoding parameters are block scaling coefficient and index numbers of energy and pattern blocks determined for each block of the input images. These parameters are sent from the transmitter part to the receiver part and the classified energy and pattern blocks associated with the index numbers are pulled from the CEPB. Then the input image is reconstructed block by block in the receiver part using a mathematical model that is proposed. Evaluation results show that the method provides considerable image compression ratios and image quality even at low bit rates.

Research paper thumbnail of A novel biometric identification system based on fingertip electrocardiogram and speech signals

Digital Signal Processing

Research paper thumbnail of The 24th International Symposium on Computer and Information Sciences, ISCIS 2009, is organized by Middle East Technical University, in the METU-Northern Cyprus Campus. There are 6 main tracks in ISCIS 2009: 1. Computer Vision, Pattern Recognition and Image Processing, chaired by Enis Çetin, Bilk...

Research paper thumbnail of Effective semi-supervised learning strategies for automatic sentence segmentation

Pattern Recognition Letters

The primary objective of sentence segmentation process is to determine the sentence boundaries of... more The primary objective of sentence segmentation process is to determine the sentence boundaries of a stream of words output by the automatic speech recognizers. Statistical methods developed for sentence segmentation requires a significant amount of labeled data which is time-consuming, labor intensive and expensive. In this work, we propose new multi-view semi-supervised learning strategies for sentence boundary classification problem using lexical, prosodic, and morphological information. The aim is to find effective semi-supervised machine learning strategies when only small sets of sentence boundary labeled data are available. We primarily investigate two semi-supervised learning approaches, called self-training and co-training. Different example selection strategies were also used for co-training, namely, agreement, disagreement and self-combined. Furthermore, we propose three-view and committee-based algorithms incorporating with agreement, disagreement and self-combined strategies using three disjoint feature sets. We present comparative results of different learning strategies on the sentence segmentation task. The experimental results show that the sentence segmentation performance can be highly improved using multi-view learning strategies that we proposed since data sets can be represented by three redundantly sufficient and disjoint feature sets. We show that the proposed strategies substantially improve the average baseline F-measure of 67.66% to 75.15% and 64.84% to 66.32% when only a small set of manually labeled data is available for Turkish and English spoken languages, respectively.

Research paper thumbnail of Compression of the biomedical images using quadtree-based partitioned universally classified energy and pattern blocks

Signal, Image and Video Processing

Research paper thumbnail of Cascaded model adaptation for dialog act segmentation and tagging

Computer Speech Language, Apr 1, 2010

There are many speech and language processing problems which require cascaded classification task... more There are many speech and language processing problems which require cascaded classification tasks. While model adaptation has been shown to be useful in isolated speech and language processing tasks, it is not clear what constitutes system adaptation for such complex systems. This paper studies the following questions: In cases where a sequence of classification tasks is employed, how important is to adapt the earlier or latter systems? Is the performance improvement obtained in the earlier stages via adaptation carried on to later stages in cases where the later stages perform adaptation using similar data and/or methods? In this study, as part of a larger scale multiparty meeting understanding system, we analyze various methods for adapting dialog act segmentation and tagging models trained on conversational telephone speech (CTS) to meeting style conversations. We investigate the effect of using adapted and unadapted models for dialog act segmentation with those of tagging, showing the effect of model adaptation for cascaded classification tasks. Our results indicate that we can achieve significantly better dialog act segmentation and tagging by adapting the out-of-domain models, especially when the amount of in-domain data is limited. Experimental results show that it is more effective to adapt the models in the latter classification tasks, in our case dialog act tagging, when dealing with a sequence of cascaded classification tasks.

Research paper thumbnail of Farkli Ses Kaynaklarindan Üreti̇len Temel Tanim Di̇zi̇leri̇ İle Konuşma İşaretleri̇ni̇n Modellenmesi̇

Research paper thumbnail of A New Modelling Method of the ECG Signals Based on the use of an Optimized Predefined Functional Database

Research paper thumbnail of A novel computed tomography image compression method based on classified energy and pattern blocks

International Symposium on Signals, Circuits and Systems ISSCS2013, 2013

In this work, a new biomedical image compression method is proposed based on the classified energ... more In this work, a new biomedical image compression method is proposed based on the classified energy and pattern blocks (CEPB). CEPB based compression method is specifically applied on the Computed Tomography (CT) images and the evaluation results are presented. Essentially, the CEPB is uniquely designed and structured codebook which is located on the both the transmitter and receiver part of a communication system in order to implement encoding and decoding processes. The encoding parameters are block scaling coefficient (BSC) and the index numbers of energy (IE) and pattern blocks (IP) determined for each block of the input images based on the CEPB. The evaluation results show that the newly proposed method provides considerable image compression ratios and image quality.

Research paper thumbnail of Konu��ma ����aretler��n��n Konu��macidan Ba��imsiz Genelle��t��r��lm���� ��nceden Tanimli Temel Ses Ve Zarf Vekt��r Bankalari ��le Opt��mum Modellenmes��

Research paper thumbnail of Medikal G��r��nt��lerin S��k����t��r��lmas��na Yeni Bir Yakla����m: SYMPES

Research paper thumbnail of Konu��ma ����aretler��n��n D��lden Ve Konu��macidan Ba��imsiz ��nceden Tanimli Temel Tanim Ve Zarf Fonks��yon Setler�� ��le Modellenmes��ne ��l����k��n Yen�� B��r Y��ntem

Research paper thumbnail of Elektrokard��yogram (Ekg) ����aretler��n��n Genelle��t��r��lm���� Temel Tanim Ve Zarf Vekt��r Bankalari ��le Modellenmes��

Research paper thumbnail of Temel Tanim Ve Zarf Vekt��r Bankalari ��le Elektrokard��yogram (Ekg) ����aretler��n��n Siki��tirilmasi