Daniela Claro - Academia.edu (original) (raw)
Papers by Daniela Claro
Proceedings of the 24th International Conference on Enterprise Information Systems
O artigo objetiva apresentar o processo de informatizacao do Projeto Atlas Linguistico do Brasil ... more O artigo objetiva apresentar o processo de informatizacao do Projeto Atlas Linguistico do Brasil (ALiB), que introduziu o gerenciamento dos dados coletados em campo e a sua disponibilizacao, com o intuito de disseminar as informacoes adquiridas atraves dos inqueritos realizados com os informantes no âmbito do Projeto ALiB a partir dos aportes teoricos da Dialetologia e da Ciencia da Computacao. Esse processo vem se concretizando gradualmente devido a importância dos dados coletados, de forma manual, por quase duas decadas. No que se refere aos procedimentos metodologicos, duas etapas foram bem definidas: a modelagem do banco de dados e o desenvolvimento do sistema ALiBWeb, o qual esta em sua segunda versao e em fase de validacao para ser gratuitamente disponibilizado na Internet . Nesta versao, destaca-se a geracao de cartas linguisticas no sistema, por linguistas, que serao disponibilizadas publicamente. A socializacao destas informacoes, em âmbito nacional e internacional, e um do...
Proceedings of the XV Brazilian Symposium on Information Systems, 2019
Interoperability is the ability of heterogeneous systems to communicate with another system trans... more Interoperability is the ability of heterogeneous systems to communicate with another system transparently. Usually, interoperability is classified into syntactic, semantic, and pragmatic. The syntactic level is related to the grammar and vocabulary of the message swapped, the semantic level with the meaning of the data and the pragmatic level with the understanding of the messages sent and received. A set of systems is pragmatically interoperable when they share the same expectations about the effect of messages exchanged between them. Due to the vast diversity of definitions and no consensus, provide a pragmatic interoperability solution is a challenge. In this paper, we propose a conceptual framework that aims to contribute to the unification of the concept of pragmatic interoperability and common elements necessary for its realization. For this, a unified definition and conceptual framework are presented. The framework was applied in three different scenarios to demonstrate its applicability and, consequently, validation of the unified concept.
Proceedings of the Brazilian Symposium on Multimedia and the Web, 2021
Nowadays, many organizations store and publish their data and services based on the Cloud Computi... more Nowadays, many organizations store and publish their data and services based on the Cloud Computing paradigm. In this scenario, cloud consumers access these resources anytime and anywhere. Software as a Service (SaaS) and Data as a Service are examples of cloud services. While DaaS delivers and manages data on-demand, SaaS is a delivery model of applications in a cloud environment. However, the vast amount of social data and applications enable different formats of DaaS, such as non-structured (e.g., text), semi-structured (e.g., JSON), and structured format (e.g., Relational Database). The lack of standardization makes users dependent on a system due to the lack of interoperability among different providers. Interoperability is heterogeneous systems' ability to communicate transparently, and it is classified into syntactic, semantic, and pragmatic levels. Middleware for SaaS and DaaS (MIDAS) is a solution to provide interoperability among cloud services. Although the latest version of MIDAS promotes a semantic approach, pragmatic aspects are not addressed. This paper enhances MIDAS to provide pragmatic interoperability in a cloud environment. Our approach presents the necessary elements that MIDAS must consider to provide pragmatic interoperability among cloud services. We conduct a set of experiments to validate our pragmatic MIDAS. We evaluate the overhead of our approach, the correctness of our novel MIDAS, and the effort to implement the MIDAS middleware with dynamic pragmatic information. Results evidence that our approach is towards pragmatic interoperability among cloud services.
IET Software, 2017
Leptospirosis is a potentially life-threatening disease primarily affecting low-income population... more Leptospirosis is a potentially life-threatening disease primarily affecting low-income populations, with an estimated annual incidence of 1.03 million infections worldwide. This disease has symptoms often confused with other febrile syndromes, such as dengue fever, influenza and viral hepatitis, often making diagnosis challenging. Improving the accuracy of early diagnosis of patients with leptospirosis will increase the speed of appropriate antibiotic treatment delivery, and both will improve clinical outcomes for this potentially fatal disease. The authors conducted an analysis of clinically and epidemiologically defined leptospirosis cases to predict disease using data mining classification algorithms. They conducted four sets of experiments to evaluate the performance of the algorithms, assessing their predictive accuracy of using different training and test datasets. The JRIP algorithm achieved 84% sensitivity using a dataset of only confirmed leptospirosis cases, and a specificity of 99% using a dataset of only confirmed dengue cases. Therefore, the approach successfully predicted leptospirosis cases, differentiated them from similar febrile illnesses, and may represent a new tool to assist health professionals, particularly in endemic areas for leptospirosis, accelerating targeted treatment and minimising disease exacerbation and mortality.
2016 11th Iberian Conference on Information Systems and Technologies (CISTI), 2016
Leptospirosis is a disease that affects mainly low-income populations, with an incidence of 500,0... more Leptospirosis is a disease that affects mainly low-income populations, with an incidence of 500,000 cases per year worldwide[1]. The disease has symptoms often confused with other febrile syndromes, such as dengue, influenza and viral hepatitis. Improved diagnosis of patients with leptospirosis is very important for health professionals, epidemiological surveillance and primarily for rapid evaluation and appropriate treatment of patients. In this work, an analysis of the data mining techniques classification was performed, evaluating algorithms of the methods of Decision Tree, Classification Rules and Bayesian Classification. Of these, JRip was the model with the best performance, yielding 85% sensitivity and 81% specificity. The algorithms successfully predicted the disease and may represent a new tool to assist health professionals in the daily hospital routine, especially in endemic areas for leptospirosis, accelerating targeted treatment, and minimizing disease exacerbation and mortality.
Anais Estendidos do XXVIII Simpósio Brasileiro de Sistemas Multimídia e Web (WebMedia 2022)
A disponibilidade de corpora anotados é uma importante tarefa de Open Information Extraction (Ope... more A disponibilidade de corpora anotados é uma importante tarefa de Open Information Extraction (Open IE). Porém, essa é uma tarefa difícil pois demanda trabalho manual de anotadores. Essa tarefa se torna ainda mais complicada no contexto da língua portuguesa, dada a sua complexidade e a falta de uma estrutura prévia para tarefas de anotação nesta língua. Ferramentas que possam agilizar esse processo tem um grande valor para a construção de conhecimento nesta área. Esse trabalho propôs uma ferramenta capaz de auxiliar no processo de construção de corpora anotados, através da anotação e identificação de novas triplas relacionais nas sentenças. Para validação, foi definido um grupo de especialistas, composto por três especialistas na tarefa, e um grupo de controle, composto por indivíduos sem conhecimento no processo para teste de usabilidade da ferramenta. A ferramenta foi utilizada para anotação de um corpus em português, mas não foi identificado nenhum impedimento para a utilização de...
Estudos Linguísticos e Literários, 2021
O artigo tem por objetivo apresentar uma análise comparativa, com abordagem quantitativa e diatóp... more O artigo tem por objetivo apresentar uma análise comparativa, com abordagem quantitativa e diatópica, referente a termos registrados no Atlas Linguístico Galego em relação aos coletados no Twitter. Especificamente, pretende-se analisar a vitalidade dos termos que constam no ALGa (volume V), verificando-se se tais termos continuam sendo utilizados para se comunicar nos tweets. Para concretização do objetivo, desenvolveu-se uma metodologia específica que foi testada com os dados selecionados. Os resultados obtidos revelam que é possível analisar a vitalidade de alguns termos, mas que alguns ajustes metodológicos são necessários a fim de alcançar o objetivo com os termos do ALGa.
Grouping by similarity represents a significant step in strategies of Web Services discovery and ... more Grouping by similarity represents a significant step in strategies of Web Services discovery and composition. Many clustering methods process the service descriptions in natural language to estimate the degree of correlation between them. However, the use of knowledge bases in specific languages limits the applicability of these methods. In this paper we make an analysis of language independent methods for grouping similar Web Services using their natural language descriptions. In particular, we applied Latent Semantic Indexing (LSI), a language-independent method of Information Retrieval (IR). Moreover, an experimental analysis was performed with three similarity measures in order to determine which one is best suited to duplicated Web Services detection from service's descriptions in two languages.
Feature interaction is an undesirable interaction between services of a composition which may vio... more Feature interaction is an undesirable interaction between services of a composition which may violate the functional and non-functional user's requirements. Due to the dynamic nature, heterogeneity and openness of web services, solve feature interaction is a complex task because it is difficult to control services that was developed by differents vendors. There is no access to such web service implementations. The great challenge is feature interaction prevention. It can be done in online or offline mode, however, there aren't works that prevent in online mode. In this article, an autonomic mechanism, based on neural networks and genetic algorithms was proposed to prevent feature interaction in web services composition. The results demonstrates a reaction time and accuracy appropriate to monitor and detect the causes of feature interaction causes in order to facilitate the prevention.
I GranDSI-BR: Grandes Desafios da Pesquisa em Sistemas de Informação no Brasil para o período de 2016 a 2026, 2017
Dependency Parsers (DP) are parsers that analyze dependencies between words in a sentence. Curren... more Dependency Parsers (DP) are parsers that analyze dependencies between words in a sentence. Currently, dependency parser evaluation is a problem whose solutions are not well defined in the scientific community. Although the DP intrinsic metrics are the foremost choice of evaluation, extrinsic evaluation enables a different evaluation based on a downstream. Different results of DP can vary according to the domain task. Thus, this work applies an Open Information Extraction (OIE) method in Portuguese to provide an extrinsic evaluation of a set of CONLL Dependency Parsers. Our results demonstrate that there is a difference in the evaluation of Dependency Parsers considering a particular task. CCS Concepts: • Computing methodologies → Natural language processing.
Proceedings of the Brazilian Symposium on Multimedia and the Web, 2020
A SaaS (Software as a Service) can transparently consume a DaaS (Data as a Service). However, het... more A SaaS (Software as a Service) can transparently consume a DaaS (Data as a Service). However, heterogeneous DaaS and its evolution can disrupt the SaaS execution. In such cases, a middleware can provide such interoperability and monitor the DaaS trackback and its evolution by retrieving its metadata. For instance, the middleware MIDAS manually provides such interoperability. Considering the Web and the number of web pages and DaaS available, this task may be time-consuming and unfeasible. To automate this task, it is firstly important to distinguish a DaaS from a typical web page. Thus, this work aims to develop a model to identify DaaS from the Web. We collected a set of features from DaaS and non-DaaS pages to train our model, and we discuss some issues and strengths of our approach. We evaluate precision and recall, but we also measure the performance because this model will be embedded into a crawler in future versions of MIDAS. Our findings achieve high precision and low execution time, which can position our work in a proper direction to MIDAS evolution.
The Internet of Things (IoT) connects many devices daily together in the same environment. Each d... more The Internet of Things (IoT) connects many devices daily together in the same environment. Each device may follow the set of rules from a static environment. A static environment is usually controlled by an expert who knows all the necessary rules to provide this environment. The violation of one rule can cause a feature interaction. A feature interaction occurs when two or more devices generate instability in an environment. In a dynamic environment like IoT, devices' inclusion, and exclusion make it impossible for an expert to maintain all these rules up-to-date. It is necessary to provide an automatic solution to avoid violating these rules and maintain the environment's good performance. Thus, this work introduces a new approach to detect a feature interaction in dynamic environments automatically. Almost all previous work provide static rules defined by an expert in a controlled environment to detect an interaction. However, this is not possible in dynamic environments ...
Proceedings of the 19th International Conference on Enterprise Information Systems, 2017
Open Information Extraction (Open IE) enables the extraction of facts in large quantities of text... more Open Information Extraction (Open IE) enables the extraction of facts in large quantities of texts written in natural language. Despite the fact that almost research has been doing in English texts, methods and techniques for other languages have been less frequent. However, those languages other than English correspond to 48% of content available on websites around the world. In this work, we propose a method for extracting facts in Portuguese without predetermining the types of the facts. Additionally, we increased the quantity of those extracted facts by the use of an inference approach. Our inference method is composed of two issues: a transitive and a symmetric mechanism. To the best of our knowledge, this is the first time that inference approach is used to extract facts in Portuguese texts. Our proposal allowed an increase of 36% in quantity of valid facts extracted in a Portuguese Open IE system, and it is compatible in the quality of facts with English approaches.
2007 IEEE Congress on Services (Services 2007), 2007
The growth of web service formats over the Internet is increasing. These services should often be... more The growth of web service formats over the Internet is increasing. These services should often be composed to achieve a user request. We propose in this paper a framework to cope with web service composition, from discovery to optimization. This framework, called SPOC, is composed of four successive phases: discovery, planning, quote execution and optimization. Only RFQ (Request for Quote) web services are treated in this first version of SPOC. The Discovery phase aims to search for web services in a repository. The Planning phase determines which services satisfy the user request. The Quote Execution phase involves processing a Request For Quote (RFQ) by calling web services. The last phase optimizes these results giving tradeoff compositions. As a proof-of-concept example, we applied SPOC to public competition processes in Public Markets.
Proceedings of the 12th International Conference on Information Integration and Web-based Applications & Services - iiWAS '10, 2010
We propose an algorithm for checking the semantic compatibility of Web service parameters and for... more We propose an algorithm for checking the semantic compatibility of Web service parameters and for suggesting compatible parameters pairings between two Web services semi-automatically. This algorithm supports semi-automatic Web service composition in workflows where the order of the Web service is known. In our use case, we used the OWL-S semantic description of Web service. We automatically generated a Taverna-compatible
Proceedings of the 24th International Conference on Enterprise Information Systems
O artigo objetiva apresentar o processo de informatizacao do Projeto Atlas Linguistico do Brasil ... more O artigo objetiva apresentar o processo de informatizacao do Projeto Atlas Linguistico do Brasil (ALiB), que introduziu o gerenciamento dos dados coletados em campo e a sua disponibilizacao, com o intuito de disseminar as informacoes adquiridas atraves dos inqueritos realizados com os informantes no âmbito do Projeto ALiB a partir dos aportes teoricos da Dialetologia e da Ciencia da Computacao. Esse processo vem se concretizando gradualmente devido a importância dos dados coletados, de forma manual, por quase duas decadas. No que se refere aos procedimentos metodologicos, duas etapas foram bem definidas: a modelagem do banco de dados e o desenvolvimento do sistema ALiBWeb, o qual esta em sua segunda versao e em fase de validacao para ser gratuitamente disponibilizado na Internet . Nesta versao, destaca-se a geracao de cartas linguisticas no sistema, por linguistas, que serao disponibilizadas publicamente. A socializacao destas informacoes, em âmbito nacional e internacional, e um do...
Proceedings of the XV Brazilian Symposium on Information Systems, 2019
Interoperability is the ability of heterogeneous systems to communicate with another system trans... more Interoperability is the ability of heterogeneous systems to communicate with another system transparently. Usually, interoperability is classified into syntactic, semantic, and pragmatic. The syntactic level is related to the grammar and vocabulary of the message swapped, the semantic level with the meaning of the data and the pragmatic level with the understanding of the messages sent and received. A set of systems is pragmatically interoperable when they share the same expectations about the effect of messages exchanged between them. Due to the vast diversity of definitions and no consensus, provide a pragmatic interoperability solution is a challenge. In this paper, we propose a conceptual framework that aims to contribute to the unification of the concept of pragmatic interoperability and common elements necessary for its realization. For this, a unified definition and conceptual framework are presented. The framework was applied in three different scenarios to demonstrate its applicability and, consequently, validation of the unified concept.
Proceedings of the Brazilian Symposium on Multimedia and the Web, 2021
Nowadays, many organizations store and publish their data and services based on the Cloud Computi... more Nowadays, many organizations store and publish their data and services based on the Cloud Computing paradigm. In this scenario, cloud consumers access these resources anytime and anywhere. Software as a Service (SaaS) and Data as a Service are examples of cloud services. While DaaS delivers and manages data on-demand, SaaS is a delivery model of applications in a cloud environment. However, the vast amount of social data and applications enable different formats of DaaS, such as non-structured (e.g., text), semi-structured (e.g., JSON), and structured format (e.g., Relational Database). The lack of standardization makes users dependent on a system due to the lack of interoperability among different providers. Interoperability is heterogeneous systems' ability to communicate transparently, and it is classified into syntactic, semantic, and pragmatic levels. Middleware for SaaS and DaaS (MIDAS) is a solution to provide interoperability among cloud services. Although the latest version of MIDAS promotes a semantic approach, pragmatic aspects are not addressed. This paper enhances MIDAS to provide pragmatic interoperability in a cloud environment. Our approach presents the necessary elements that MIDAS must consider to provide pragmatic interoperability among cloud services. We conduct a set of experiments to validate our pragmatic MIDAS. We evaluate the overhead of our approach, the correctness of our novel MIDAS, and the effort to implement the MIDAS middleware with dynamic pragmatic information. Results evidence that our approach is towards pragmatic interoperability among cloud services.
IET Software, 2017
Leptospirosis is a potentially life-threatening disease primarily affecting low-income population... more Leptospirosis is a potentially life-threatening disease primarily affecting low-income populations, with an estimated annual incidence of 1.03 million infections worldwide. This disease has symptoms often confused with other febrile syndromes, such as dengue fever, influenza and viral hepatitis, often making diagnosis challenging. Improving the accuracy of early diagnosis of patients with leptospirosis will increase the speed of appropriate antibiotic treatment delivery, and both will improve clinical outcomes for this potentially fatal disease. The authors conducted an analysis of clinically and epidemiologically defined leptospirosis cases to predict disease using data mining classification algorithms. They conducted four sets of experiments to evaluate the performance of the algorithms, assessing their predictive accuracy of using different training and test datasets. The JRIP algorithm achieved 84% sensitivity using a dataset of only confirmed leptospirosis cases, and a specificity of 99% using a dataset of only confirmed dengue cases. Therefore, the approach successfully predicted leptospirosis cases, differentiated them from similar febrile illnesses, and may represent a new tool to assist health professionals, particularly in endemic areas for leptospirosis, accelerating targeted treatment and minimising disease exacerbation and mortality.
2016 11th Iberian Conference on Information Systems and Technologies (CISTI), 2016
Leptospirosis is a disease that affects mainly low-income populations, with an incidence of 500,0... more Leptospirosis is a disease that affects mainly low-income populations, with an incidence of 500,000 cases per year worldwide[1]. The disease has symptoms often confused with other febrile syndromes, such as dengue, influenza and viral hepatitis. Improved diagnosis of patients with leptospirosis is very important for health professionals, epidemiological surveillance and primarily for rapid evaluation and appropriate treatment of patients. In this work, an analysis of the data mining techniques classification was performed, evaluating algorithms of the methods of Decision Tree, Classification Rules and Bayesian Classification. Of these, JRip was the model with the best performance, yielding 85% sensitivity and 81% specificity. The algorithms successfully predicted the disease and may represent a new tool to assist health professionals in the daily hospital routine, especially in endemic areas for leptospirosis, accelerating targeted treatment, and minimizing disease exacerbation and mortality.
Anais Estendidos do XXVIII Simpósio Brasileiro de Sistemas Multimídia e Web (WebMedia 2022)
A disponibilidade de corpora anotados é uma importante tarefa de Open Information Extraction (Ope... more A disponibilidade de corpora anotados é uma importante tarefa de Open Information Extraction (Open IE). Porém, essa é uma tarefa difícil pois demanda trabalho manual de anotadores. Essa tarefa se torna ainda mais complicada no contexto da língua portuguesa, dada a sua complexidade e a falta de uma estrutura prévia para tarefas de anotação nesta língua. Ferramentas que possam agilizar esse processo tem um grande valor para a construção de conhecimento nesta área. Esse trabalho propôs uma ferramenta capaz de auxiliar no processo de construção de corpora anotados, através da anotação e identificação de novas triplas relacionais nas sentenças. Para validação, foi definido um grupo de especialistas, composto por três especialistas na tarefa, e um grupo de controle, composto por indivíduos sem conhecimento no processo para teste de usabilidade da ferramenta. A ferramenta foi utilizada para anotação de um corpus em português, mas não foi identificado nenhum impedimento para a utilização de...
Estudos Linguísticos e Literários, 2021
O artigo tem por objetivo apresentar uma análise comparativa, com abordagem quantitativa e diatóp... more O artigo tem por objetivo apresentar uma análise comparativa, com abordagem quantitativa e diatópica, referente a termos registrados no Atlas Linguístico Galego em relação aos coletados no Twitter. Especificamente, pretende-se analisar a vitalidade dos termos que constam no ALGa (volume V), verificando-se se tais termos continuam sendo utilizados para se comunicar nos tweets. Para concretização do objetivo, desenvolveu-se uma metodologia específica que foi testada com os dados selecionados. Os resultados obtidos revelam que é possível analisar a vitalidade de alguns termos, mas que alguns ajustes metodológicos são necessários a fim de alcançar o objetivo com os termos do ALGa.
Grouping by similarity represents a significant step in strategies of Web Services discovery and ... more Grouping by similarity represents a significant step in strategies of Web Services discovery and composition. Many clustering methods process the service descriptions in natural language to estimate the degree of correlation between them. However, the use of knowledge bases in specific languages limits the applicability of these methods. In this paper we make an analysis of language independent methods for grouping similar Web Services using their natural language descriptions. In particular, we applied Latent Semantic Indexing (LSI), a language-independent method of Information Retrieval (IR). Moreover, an experimental analysis was performed with three similarity measures in order to determine which one is best suited to duplicated Web Services detection from service's descriptions in two languages.
Feature interaction is an undesirable interaction between services of a composition which may vio... more Feature interaction is an undesirable interaction between services of a composition which may violate the functional and non-functional user's requirements. Due to the dynamic nature, heterogeneity and openness of web services, solve feature interaction is a complex task because it is difficult to control services that was developed by differents vendors. There is no access to such web service implementations. The great challenge is feature interaction prevention. It can be done in online or offline mode, however, there aren't works that prevent in online mode. In this article, an autonomic mechanism, based on neural networks and genetic algorithms was proposed to prevent feature interaction in web services composition. The results demonstrates a reaction time and accuracy appropriate to monitor and detect the causes of feature interaction causes in order to facilitate the prevention.
I GranDSI-BR: Grandes Desafios da Pesquisa em Sistemas de Informação no Brasil para o período de 2016 a 2026, 2017
Dependency Parsers (DP) are parsers that analyze dependencies between words in a sentence. Curren... more Dependency Parsers (DP) are parsers that analyze dependencies between words in a sentence. Currently, dependency parser evaluation is a problem whose solutions are not well defined in the scientific community. Although the DP intrinsic metrics are the foremost choice of evaluation, extrinsic evaluation enables a different evaluation based on a downstream. Different results of DP can vary according to the domain task. Thus, this work applies an Open Information Extraction (OIE) method in Portuguese to provide an extrinsic evaluation of a set of CONLL Dependency Parsers. Our results demonstrate that there is a difference in the evaluation of Dependency Parsers considering a particular task. CCS Concepts: • Computing methodologies → Natural language processing.
Proceedings of the Brazilian Symposium on Multimedia and the Web, 2020
A SaaS (Software as a Service) can transparently consume a DaaS (Data as a Service). However, het... more A SaaS (Software as a Service) can transparently consume a DaaS (Data as a Service). However, heterogeneous DaaS and its evolution can disrupt the SaaS execution. In such cases, a middleware can provide such interoperability and monitor the DaaS trackback and its evolution by retrieving its metadata. For instance, the middleware MIDAS manually provides such interoperability. Considering the Web and the number of web pages and DaaS available, this task may be time-consuming and unfeasible. To automate this task, it is firstly important to distinguish a DaaS from a typical web page. Thus, this work aims to develop a model to identify DaaS from the Web. We collected a set of features from DaaS and non-DaaS pages to train our model, and we discuss some issues and strengths of our approach. We evaluate precision and recall, but we also measure the performance because this model will be embedded into a crawler in future versions of MIDAS. Our findings achieve high precision and low execution time, which can position our work in a proper direction to MIDAS evolution.
The Internet of Things (IoT) connects many devices daily together in the same environment. Each d... more The Internet of Things (IoT) connects many devices daily together in the same environment. Each device may follow the set of rules from a static environment. A static environment is usually controlled by an expert who knows all the necessary rules to provide this environment. The violation of one rule can cause a feature interaction. A feature interaction occurs when two or more devices generate instability in an environment. In a dynamic environment like IoT, devices' inclusion, and exclusion make it impossible for an expert to maintain all these rules up-to-date. It is necessary to provide an automatic solution to avoid violating these rules and maintain the environment's good performance. Thus, this work introduces a new approach to detect a feature interaction in dynamic environments automatically. Almost all previous work provide static rules defined by an expert in a controlled environment to detect an interaction. However, this is not possible in dynamic environments ...
Proceedings of the 19th International Conference on Enterprise Information Systems, 2017
Open Information Extraction (Open IE) enables the extraction of facts in large quantities of text... more Open Information Extraction (Open IE) enables the extraction of facts in large quantities of texts written in natural language. Despite the fact that almost research has been doing in English texts, methods and techniques for other languages have been less frequent. However, those languages other than English correspond to 48% of content available on websites around the world. In this work, we propose a method for extracting facts in Portuguese without predetermining the types of the facts. Additionally, we increased the quantity of those extracted facts by the use of an inference approach. Our inference method is composed of two issues: a transitive and a symmetric mechanism. To the best of our knowledge, this is the first time that inference approach is used to extract facts in Portuguese texts. Our proposal allowed an increase of 36% in quantity of valid facts extracted in a Portuguese Open IE system, and it is compatible in the quality of facts with English approaches.
2007 IEEE Congress on Services (Services 2007), 2007
The growth of web service formats over the Internet is increasing. These services should often be... more The growth of web service formats over the Internet is increasing. These services should often be composed to achieve a user request. We propose in this paper a framework to cope with web service composition, from discovery to optimization. This framework, called SPOC, is composed of four successive phases: discovery, planning, quote execution and optimization. Only RFQ (Request for Quote) web services are treated in this first version of SPOC. The Discovery phase aims to search for web services in a repository. The Planning phase determines which services satisfy the user request. The Quote Execution phase involves processing a Request For Quote (RFQ) by calling web services. The last phase optimizes these results giving tradeoff compositions. As a proof-of-concept example, we applied SPOC to public competition processes in Public Markets.
Proceedings of the 12th International Conference on Information Integration and Web-based Applications & Services - iiWAS '10, 2010
We propose an algorithm for checking the semantic compatibility of Web service parameters and for... more We propose an algorithm for checking the semantic compatibility of Web service parameters and for suggesting compatible parameters pairings between two Web services semi-automatically. This algorithm supports semi-automatic Web service composition in workflows where the order of the Web service is known. In our use case, we used the OWL-S semantic description of Web service. We automatically generated a Taverna-compatible