Stanley Oliveira - Academia.edu (original) (raw)

Uploads

Papers by Stanley Oliveira

Research paper thumbnail of Secure Association Rule Sharing

Lecture Notes in Computer Science, 2004

Research paper thumbnail of Detection of broiler heat stress by using the generalised sequential pattern algorithm

Research paper thumbnail of Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field

PloS one, 2018

Soil CO2 emissions are regarded as one of the largest flows of the global carbon cycle and small ... more Soil CO2 emissions are regarded as one of the largest flows of the global carbon cycle and small changes in their magnitude can have a large effect on the CO2 concentration in the atmosphere. Thus, a better understanding of this attribute would enable the identification of promoters and the development of strategies to mitigate the risks of climate change. Therefore, our study aimed at using data mining techniques to predict the soil CO2 emission induced by crop management in sugarcane areas in Brazil. To do so, we used different variable selection methods (correlation, chi-square, wrapper) and classification (Decision tree, Bayesian models, neural networks, support vector machine, bagging with logistic regression), and finally we tested the efficiency of different approaches through the Receiver Operating Characteristic (ROC) curve. The original dataset consisted of 19 variables (18 independent variables and one dependent (or response) variable). The association between cover crop ...

Research paper thumbnail of Revisiting "Privacy Preserving Clustering by Data Transformation

Preserving the privacy of individuals when data are shared for clustering is a complex problem. T... more Preserving the privacy of individuals when data are shared for clustering is a complex problem. The challenge is how to protect the underlying data values subjected to clustering without jeopardizing the similarity between objects under analysis. In this short paper, we revisit a family of geometric data transformation methods (GDTMs) that distort numerical attributes by translations, scalings, rotations, or even by the combination of these geometric transformations. Such a method was designed to address privacy-preserving clustering, in scenarios where data owners must not only meet privacy requirements but also guarantee valid clustering results. We offer a detailed, comprehensive and up-to-date picture of methods for privacy-preserving clustering by data transformation.

Research paper thumbnail of using Bayesian classification

Research paper thumbnail of Predicting enzyme class from protein structure using Bayesian classification

Genetics and Molecular Research Gmr, Feb 1, 2006

Predicting enzyme class from protein structure parameters is a challenging problem in protein ana... more Predicting enzyme class from protein structure parameters is a challenging problem in protein analysis. We developed a method to predict enzyme class that combines the strengths of statistical and data-mining methods. This method has a strong mathematical foundation and is simple to implement, achieving an accuracy of 45%. A comparison with the methods found in the literature designed to predict enzyme class showed that our method outperforms the existing methods.

Research paper thumbnail of Protecting confidential knowledge by data sani-tation

Research paper thumbnail of Ainfo: a experi�ncia da Embrapa na disponibiliza��o e recupera��o de informa��o

Journal of Informetrics, 1998

Research paper thumbnail of Minera��o de dados para infer�ncia de rela��es solo-paisagem em mapeamentos digitais de solo

Research paper thumbnail of Data Perturbation by Rotation for Privacy-Preserving Clustering

Research paper thumbnail of PDB-Metrics: a web tool for exploring the PDB contents

Genetics and Molecular Research Gmr, Feb 1, 2006

PDB-Metrics (http://sms.cbi.cnptia.embrapa.br/SMS/ pdb_metrics/index.html) is a component of the ... more PDB-Metrics (http://sms.cbi.cnptia.embrapa.br/SMS/ pdb_metrics/index.html) is a component of the Diamond STING suite of programs for the analysis of protein sequence, structure and function. It summarizes the characteristics of the collection of protein structure descriptions deposited in the Protein Data Bank (PDB) and provides a Web interface to search and browse the PDB, using a variety of alternative criteria. PDB-Metrics is a powerful tool for bioinformaticians to examine the data span in the PDB from several perspectives. Although other Web sites offer some similar resources to explore the PDB contents, PDB-Metrics is among those with the most complete set of such facilities, integrated into a single Web site. This program has been developed using SQLite, a C library that provides all the query facilities of a database management system.

Research paper thumbnail of Geometric Data Transformation For

Research paper thumbnail of Privacy-preserving clustering by object similarity-based representation and dimensionality reduction transformation

Preserving privacy of individuals when data are shared for clustering is a challenging problem. D... more Preserving privacy of individuals when data are shared for clustering is a challenging problem. Data owners must not only meet privacy requirements but also guarantee valid clustering results. In this paper, we show that this dual goal can be achieved by transforming a database using two simple and effective data transformations: Object Similarity-Based Representation (OSBR) and Dimensionality Reduction-Based Transformation (DRBT). The former relies on the idea behind the similarity between objects, and the latter relies on the intuition behind random projection. The major features of our data transformations are: a) they are independent of distance-based clustering algorithms; b) they have a sound mathematical foundation; and c) they do not require CPU-intensive operations.

Research paper thumbnail of Data transformation for privacy-preserving data mining

... However, the sharing of data has also raised a number of ethical issues. ... Our investigatio... more ... However, the sharing of data has also raised a number of ethical issues. ... Our investigation concludes that privacy-preserving data mining is to some extent possible. ... demonstrate empirically and theoretically the practicality and feasibility of achieving privacy preservation in data ...

Research paper thumbnail of PROLEITE: sistema de análise e acompanhamento de rebanhos leiteiros

PROLEITE e um sistema de analise e acompanhamento de producao de rebanhos leiteiros, desenvolvido... more PROLEITE e um sistema de analise e acompanhamento de producao de rebanhos leiteiros, desenvolvido para ambiente Windows, com a finalidade de dotar os agentes executores do servico de controle leiteiro de uma infra-estrutura tecnologica capaz de dar suporte a execucaco do Programa de Teste de Progenie (PTP) e do Programa de Inseminacao Artificial (IA). Este sistema tem sido fortemente disseminado em todo Sistema Nacional de Pesquisa Agropecuaria como um instrumento essencial para auxiliar o produtor na analise e avaliacao de seus ...

Research paper thumbnail of Achieving Privacy Preservation When Sharing

In this paper, we address the problem of protecting the underlying attribute values when sharing ... more In this paper, we address the problem of protecting the underlying attribute values when sharing data for clustering. The challenge is how to meet privacy requirements and guarantee valid clustering results as well.

Research paper thumbnail of Data Perturbation by Rotation for

Preserving privacy of individuals when data are shared for clustering is a complex problem. The c... more Preserving privacy of individuals when data are shared for clustering is a complex problem. The challenge is how to protect the underlying attribute values subjected to clustering without jeopardizing the similarity between data objects under analysis. To address this problem, data owners must not only meet privacy requirements but also guarantee valid clustering results. To achieve this dual goal, we propose a novel spatial data transformation method called Rotation-Based Transformation (RBT). The major features of our data transformation are: a) it is independent of any clustering algorithm, b) it has a sound mathematical foundation; c) it is ecient and accurate; and d) it does not rely on intractability hypotheses from algebra and does not require CPU-intensive operations.

Research paper thumbnail of Adoção de TIC e oferta de software na agropecuária: breve relato dos resultados do estudo SWAgro

The paper aims to present the offer of information technology solutions applied to agriculture an... more The paper aims to present the offer of information technology solutions applied to agriculture and an overview of the adoption of Information and Communication Technologies (ICT) in the agricultural sector. The results were based on those obtained in the project “Study of the Brazilian Market of Software for agribusiness" (SWAgro); which was carried out - by Embrapa Agricultural Informatics and partner institutions from 2008 to 2010. The methodology employed in the project encompasses two steps: (i) literature review and (ii) mapping of agricultural software offer through a survey research. The results presented in this paper are: the characterization of agricultural software development companies by size and geographic location as well as the mapping of products offered by them according to 4 application groups, namely: dministration/ management, animal management, crops, and process control and / or rural activities.

Research paper thumbnail of Software para agropecuária: panorama do mercado brasileiro

This paper aims to report the results of scientific research on the Brazilian market for agricult... more This paper aims to report the results of scientific research on the Brazilian market for agricultural software, conducted by Embrapa Agricultural Informatics and partner institutions, from 2008 to 2010. The methodology included the realization of expert panels in Agricultural Informatics, mapping of agricultural software supply through a survey, identification of demands on ICT in agriculture with agricultural cooperatives and institutions for Technical Assistance and Rural Extension (TARE), and identification of opportunities and trends by using the scenario approach. The results showed the characterization of 162 private companies that develop software for agriculture, by geographic distribution, company size and their 402 software products. Of the 230 rural cooperatives that responded to the survey, 39% use some software for agribusiness. Their demands in software are for marketing of agricultural products, farm management and accounting. A number of 132 Institutions for TARE too...

Research paper thumbnail of Modelo matemático para classificação de culturas em Mato Grosso utilizando NDVI/MODIS

Research paper thumbnail of Secure Association Rule Sharing

Lecture Notes in Computer Science, 2004

Research paper thumbnail of Detection of broiler heat stress by using the generalised sequential pattern algorithm

Research paper thumbnail of Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field

PloS one, 2018

Soil CO2 emissions are regarded as one of the largest flows of the global carbon cycle and small ... more Soil CO2 emissions are regarded as one of the largest flows of the global carbon cycle and small changes in their magnitude can have a large effect on the CO2 concentration in the atmosphere. Thus, a better understanding of this attribute would enable the identification of promoters and the development of strategies to mitigate the risks of climate change. Therefore, our study aimed at using data mining techniques to predict the soil CO2 emission induced by crop management in sugarcane areas in Brazil. To do so, we used different variable selection methods (correlation, chi-square, wrapper) and classification (Decision tree, Bayesian models, neural networks, support vector machine, bagging with logistic regression), and finally we tested the efficiency of different approaches through the Receiver Operating Characteristic (ROC) curve. The original dataset consisted of 19 variables (18 independent variables and one dependent (or response) variable). The association between cover crop ...

Research paper thumbnail of Revisiting "Privacy Preserving Clustering by Data Transformation

Preserving the privacy of individuals when data are shared for clustering is a complex problem. T... more Preserving the privacy of individuals when data are shared for clustering is a complex problem. The challenge is how to protect the underlying data values subjected to clustering without jeopardizing the similarity between objects under analysis. In this short paper, we revisit a family of geometric data transformation methods (GDTMs) that distort numerical attributes by translations, scalings, rotations, or even by the combination of these geometric transformations. Such a method was designed to address privacy-preserving clustering, in scenarios where data owners must not only meet privacy requirements but also guarantee valid clustering results. We offer a detailed, comprehensive and up-to-date picture of methods for privacy-preserving clustering by data transformation.

Research paper thumbnail of using Bayesian classification

Research paper thumbnail of Predicting enzyme class from protein structure using Bayesian classification

Genetics and Molecular Research Gmr, Feb 1, 2006

Predicting enzyme class from protein structure parameters is a challenging problem in protein ana... more Predicting enzyme class from protein structure parameters is a challenging problem in protein analysis. We developed a method to predict enzyme class that combines the strengths of statistical and data-mining methods. This method has a strong mathematical foundation and is simple to implement, achieving an accuracy of 45%. A comparison with the methods found in the literature designed to predict enzyme class showed that our method outperforms the existing methods.

Research paper thumbnail of Protecting confidential knowledge by data sani-tation

Research paper thumbnail of Ainfo: a experi�ncia da Embrapa na disponibiliza��o e recupera��o de informa��o

Journal of Informetrics, 1998

Research paper thumbnail of Minera��o de dados para infer�ncia de rela��es solo-paisagem em mapeamentos digitais de solo

Research paper thumbnail of Data Perturbation by Rotation for Privacy-Preserving Clustering

Research paper thumbnail of PDB-Metrics: a web tool for exploring the PDB contents

Genetics and Molecular Research Gmr, Feb 1, 2006

PDB-Metrics (http://sms.cbi.cnptia.embrapa.br/SMS/ pdb_metrics/index.html) is a component of the ... more PDB-Metrics (http://sms.cbi.cnptia.embrapa.br/SMS/ pdb_metrics/index.html) is a component of the Diamond STING suite of programs for the analysis of protein sequence, structure and function. It summarizes the characteristics of the collection of protein structure descriptions deposited in the Protein Data Bank (PDB) and provides a Web interface to search and browse the PDB, using a variety of alternative criteria. PDB-Metrics is a powerful tool for bioinformaticians to examine the data span in the PDB from several perspectives. Although other Web sites offer some similar resources to explore the PDB contents, PDB-Metrics is among those with the most complete set of such facilities, integrated into a single Web site. This program has been developed using SQLite, a C library that provides all the query facilities of a database management system.

Research paper thumbnail of Geometric Data Transformation For

Research paper thumbnail of Privacy-preserving clustering by object similarity-based representation and dimensionality reduction transformation

Preserving privacy of individuals when data are shared for clustering is a challenging problem. D... more Preserving privacy of individuals when data are shared for clustering is a challenging problem. Data owners must not only meet privacy requirements but also guarantee valid clustering results. In this paper, we show that this dual goal can be achieved by transforming a database using two simple and effective data transformations: Object Similarity-Based Representation (OSBR) and Dimensionality Reduction-Based Transformation (DRBT). The former relies on the idea behind the similarity between objects, and the latter relies on the intuition behind random projection. The major features of our data transformations are: a) they are independent of distance-based clustering algorithms; b) they have a sound mathematical foundation; and c) they do not require CPU-intensive operations.

Research paper thumbnail of Data transformation for privacy-preserving data mining

... However, the sharing of data has also raised a number of ethical issues. ... Our investigatio... more ... However, the sharing of data has also raised a number of ethical issues. ... Our investigation concludes that privacy-preserving data mining is to some extent possible. ... demonstrate empirically and theoretically the practicality and feasibility of achieving privacy preservation in data ...

Research paper thumbnail of PROLEITE: sistema de análise e acompanhamento de rebanhos leiteiros

PROLEITE e um sistema de analise e acompanhamento de producao de rebanhos leiteiros, desenvolvido... more PROLEITE e um sistema de analise e acompanhamento de producao de rebanhos leiteiros, desenvolvido para ambiente Windows, com a finalidade de dotar os agentes executores do servico de controle leiteiro de uma infra-estrutura tecnologica capaz de dar suporte a execucaco do Programa de Teste de Progenie (PTP) e do Programa de Inseminacao Artificial (IA). Este sistema tem sido fortemente disseminado em todo Sistema Nacional de Pesquisa Agropecuaria como um instrumento essencial para auxiliar o produtor na analise e avaliacao de seus ...

Research paper thumbnail of Achieving Privacy Preservation When Sharing

In this paper, we address the problem of protecting the underlying attribute values when sharing ... more In this paper, we address the problem of protecting the underlying attribute values when sharing data for clustering. The challenge is how to meet privacy requirements and guarantee valid clustering results as well.

Research paper thumbnail of Data Perturbation by Rotation for

Preserving privacy of individuals when data are shared for clustering is a complex problem. The c... more Preserving privacy of individuals when data are shared for clustering is a complex problem. The challenge is how to protect the underlying attribute values subjected to clustering without jeopardizing the similarity between data objects under analysis. To address this problem, data owners must not only meet privacy requirements but also guarantee valid clustering results. To achieve this dual goal, we propose a novel spatial data transformation method called Rotation-Based Transformation (RBT). The major features of our data transformation are: a) it is independent of any clustering algorithm, b) it has a sound mathematical foundation; c) it is ecient and accurate; and d) it does not rely on intractability hypotheses from algebra and does not require CPU-intensive operations.

Research paper thumbnail of Adoção de TIC e oferta de software na agropecuária: breve relato dos resultados do estudo SWAgro

The paper aims to present the offer of information technology solutions applied to agriculture an... more The paper aims to present the offer of information technology solutions applied to agriculture and an overview of the adoption of Information and Communication Technologies (ICT) in the agricultural sector. The results were based on those obtained in the project “Study of the Brazilian Market of Software for agribusiness" (SWAgro); which was carried out - by Embrapa Agricultural Informatics and partner institutions from 2008 to 2010. The methodology employed in the project encompasses two steps: (i) literature review and (ii) mapping of agricultural software offer through a survey research. The results presented in this paper are: the characterization of agricultural software development companies by size and geographic location as well as the mapping of products offered by them according to 4 application groups, namely: dministration/ management, animal management, crops, and process control and / or rural activities.

Research paper thumbnail of Software para agropecuária: panorama do mercado brasileiro

This paper aims to report the results of scientific research on the Brazilian market for agricult... more This paper aims to report the results of scientific research on the Brazilian market for agricultural software, conducted by Embrapa Agricultural Informatics and partner institutions, from 2008 to 2010. The methodology included the realization of expert panels in Agricultural Informatics, mapping of agricultural software supply through a survey, identification of demands on ICT in agriculture with agricultural cooperatives and institutions for Technical Assistance and Rural Extension (TARE), and identification of opportunities and trends by using the scenario approach. The results showed the characterization of 162 private companies that develop software for agriculture, by geographic distribution, company size and their 402 software products. Of the 230 rural cooperatives that responded to the survey, 39% use some software for agribusiness. Their demands in software are for marketing of agricultural products, farm management and accounting. A number of 132 Institutions for TARE too...

Research paper thumbnail of Modelo matemático para classificação de culturas em Mato Grosso utilizando NDVI/MODIS