Rafael Glauber | UFBA - Federal University of Bahia (original) (raw)

Rafael Glauber

Uploads

Papers by Rafael Glauber

A quantidade de textos em linguagem natural disponível na Internet vem aumentando os desafios do ... more A quantidade de textos em linguagem natural disponível na Internet vem aumentando os desafios do seu processamento automatizado. Diversas abordagens de Extração da Informação estão sendo propostas, principalmente no que cerne as entidades e as suas relações. Extrair relações abertas, ou seja, sem um conhecimento pré-determinado, tem sido um dos importantes desafios do processamento de textos na Internet. A abordagem aberta pode ser dividida em duas etapas: (i) extração e (ii) classificação. Porém, os trabalhos relacionados apresentam a dependência do idioma em ambas as etapas. Assim, este trabalho propõe um conjunto de features usado na etapa de classificação que não utilizam termos presentes em um idioma específico, tornado o método de classificação independente de idioma. Experimentos foram realizados em três diferentes corpora com sentenças extraídas da Web, Wikipédia e do New York Times (em Inglês) e os resultados apresentados neste artigo foram promissores para o direcionamento da pesquisa.

Open Information Extraction (Open IE) enables the extraction of facts in large quantities of text... more Open Information Extraction (Open IE) enables the extraction of facts in large quantities of texts written in natural language. Despite the fact that almost research has been doing in English texts, methods and techniques for other languages have been less frequent. However, those languages other than English correspond to 48% of content available on websites around the world. In this work, we propose a method for extracting facts in Portuguese without predetermining the types of the facts. Additionally, we increased the quantity of those extracted facts by the use of an inference approach. Our inference method is composed of two issues: a transitive and a symmetric mechanism. To the best of our knowledge, this is the first time that inference approach is used to extract facts in Portuguese texts. Our proposal allowed an increase of 36% in quantity of valid facts extracted in a Portuguese Open IE system, and it is compatible in the quality of facts with English approaches.

Recommender systems are data filtering systems that sug- gest data items of interest by predictin... more Recommender systems are data filtering systems that sug- gest data items of interest by predicting user preferences. In this pa- per, we describe the recommender system developed by the team named uefs.br for the offline competition of the15th ECML PKDD Discovery Challenge 2013 on building a recommendation system for given names. The proposed system is a hybrid recommender system that

Rafael Glauber | UFBA - Federal University of Bahia (original) (raw)

Uploads

Papers by Rafael Glauber

Log In