Klara Ceberio Berger - Academia.edu (original) (raw)
Uploads
Papers by Klara Ceberio Berger
Gramatika Jaietan Patxi Goenagaren Omenez 2008 Isbn 978 84 9860 085 8 Pags 153 172, 2008
Gogoa Euskal Herriko Unibersitateko Hizkuntza Ezagutza Komunikazio Eta Ekintzari Buruzko Aldizkaria, 2005
... izango dugu kontuan. Definizio labur bat ematearren, anafora, diskurtsoan lehenago aipatutako... more ... izango dugu kontuan. Definizio labur bat ematearren, anafora, diskurtsoan lehenago aipatutako zerbaiti erreferentzia egiten dion elementua da, kasu batzutan testua errepikakorra gerta ez dadin erabiltzen dena. Lan hau Lengoaia ...
Applied Linguistics Now Recurso Electronico Understanding Language and Mind La Linguistica Aplicada Actual Comprendiendo El Lenguaje Y La Mente 2009 Isbn 978 84 692 1479 4 Pags 1319 1332, 2009
Lecture Notes in Computer Science, 2010
In this paper we present the first machine learning approach to resolve the pronominal anaphora i... more In this paper we present the first machine learning approach to resolve the pronominal anaphora in Basque language. In this work we consider different classifiers in order to find the system that fits best to the characteristics of the language under examination. We do not restrict our study to the classifiers typically used for this task, we have considered others, such as Random Forest or VFI, in order to make a general comparison. We determine the feature vector obtained with our linguistic processing system and we analyze the contribution of different subsets of features, as well as the weight of each feature used in the task.
We present a new morphological processor for Biscayan, a dialect of Basque, developed on the desc... more We present a new morphological processor for Biscayan, a dialect of Basque, developed on the description of the morphology of standard Basque. The database for the standard morphology has been extended for dialects and an opensource tool for morphological description named foma is used for building the processor. XuxenB, a spelling checker/corrector for this dialect, is the first application of this work.
This paper describes the process followed in the annotation of pronominal anaphora in the Eus3LB ... more This paper describes the process followed in the annotation of pronominal anaphora in the Eus3LB corpus 1 of Basque. Our aim is to use this annotation as the basis for later computational treatment of our language. We present the linguistic analysis carried out, the criteria defined for the tagging and some relevant linguistic conclusions about the features of the antecedents needed to link them correctly to their anaphoric elements.
El Valor De La Diversidad Linguistica Actas Del Viii Congreso De Linguistica General 2008 Isbn 978 84 691 4124 3 Pag 26, 2008
Basque is a highly inflected and agglutinative language . Two-level morphology has been applied s... more Basque is a highly inflected and agglutinative language . Two-level morphology has been applied successfully to this kind of languages and there are two-level based descriptions for very different languages. After doing the morphological description for a language, it is easy to develop a spelling checker/corrector for this language. However, what happens if we want to use the speller in the "free world" (OpenOffice, Mozilla, emacs, LaTeX, ...)? Ispell and similar tools (aspell, hunspell, myspell) are the usual mechanisms for these purposes, but they do not fit the two-level model. In the absence of two-level morphology based mechanisms, an automatic conversion from two-level description to hunspell is described in this paper.
Lecture Notes in Computer Science, 2010
In this paper we present a machine learning approach to resolve the pronominal anaphora in Basque... more In this paper we present a machine learning approach to resolve the pronominal anaphora in Basque language. We consider different classifiers in order to find the system that fits best to the characteristics of the language under examination. We apply the combination of classifiers which improves results obtained with single classifiers. The main contribution of the paper is the use of bagging having as base classifier a non-soft one for the anaphora resolution in Basque.
Gramatika Jaietan Patxi Goenagaren Omenez 2008 Isbn 978 84 9860 085 8 Pags 153 172, 2008
Gogoa Euskal Herriko Unibersitateko Hizkuntza Ezagutza Komunikazio Eta Ekintzari Buruzko Aldizkaria, 2005
... izango dugu kontuan. Definizio labur bat ematearren, anafora, diskurtsoan lehenago aipatutako... more ... izango dugu kontuan. Definizio labur bat ematearren, anafora, diskurtsoan lehenago aipatutako zerbaiti erreferentzia egiten dion elementua da, kasu batzutan testua errepikakorra gerta ez dadin erabiltzen dena. Lan hau Lengoaia ...
Applied Linguistics Now Recurso Electronico Understanding Language and Mind La Linguistica Aplicada Actual Comprendiendo El Lenguaje Y La Mente 2009 Isbn 978 84 692 1479 4 Pags 1319 1332, 2009
Lecture Notes in Computer Science, 2010
In this paper we present the first machine learning approach to resolve the pronominal anaphora i... more In this paper we present the first machine learning approach to resolve the pronominal anaphora in Basque language. In this work we consider different classifiers in order to find the system that fits best to the characteristics of the language under examination. We do not restrict our study to the classifiers typically used for this task, we have considered others, such as Random Forest or VFI, in order to make a general comparison. We determine the feature vector obtained with our linguistic processing system and we analyze the contribution of different subsets of features, as well as the weight of each feature used in the task.
We present a new morphological processor for Biscayan, a dialect of Basque, developed on the desc... more We present a new morphological processor for Biscayan, a dialect of Basque, developed on the description of the morphology of standard Basque. The database for the standard morphology has been extended for dialects and an opensource tool for morphological description named foma is used for building the processor. XuxenB, a spelling checker/corrector for this dialect, is the first application of this work.
This paper describes the process followed in the annotation of pronominal anaphora in the Eus3LB ... more This paper describes the process followed in the annotation of pronominal anaphora in the Eus3LB corpus 1 of Basque. Our aim is to use this annotation as the basis for later computational treatment of our language. We present the linguistic analysis carried out, the criteria defined for the tagging and some relevant linguistic conclusions about the features of the antecedents needed to link them correctly to their anaphoric elements.
El Valor De La Diversidad Linguistica Actas Del Viii Congreso De Linguistica General 2008 Isbn 978 84 691 4124 3 Pag 26, 2008
Basque is a highly inflected and agglutinative language . Two-level morphology has been applied s... more Basque is a highly inflected and agglutinative language . Two-level morphology has been applied successfully to this kind of languages and there are two-level based descriptions for very different languages. After doing the morphological description for a language, it is easy to develop a spelling checker/corrector for this language. However, what happens if we want to use the speller in the "free world" (OpenOffice, Mozilla, emacs, LaTeX, ...)? Ispell and similar tools (aspell, hunspell, myspell) are the usual mechanisms for these purposes, but they do not fit the two-level model. In the absence of two-level morphology based mechanisms, an automatic conversion from two-level description to hunspell is described in this paper.
Lecture Notes in Computer Science, 2010
In this paper we present a machine learning approach to resolve the pronominal anaphora in Basque... more In this paper we present a machine learning approach to resolve the pronominal anaphora in Basque language. We consider different classifiers in order to find the system that fits best to the characteristics of the language under examination. We apply the combination of classifiers which improves results obtained with single classifiers. The main contribution of the paper is the use of bagging having as base classifier a non-soft one for the anaphora resolution in Basque.