LUCAS MONTEIRO CHAVES - Academia.edu (original) (raw)

Papers by LUCAS MONTEIRO CHAVES

Research paper thumbnail of Distribuição bivariada Gama Beta II com um aplicação em modelagem de precipitações pluviométricas (pp.159)

Research paper thumbnail of Uma Nova Classe de Distribuições Generalizadas

TEMA - Tendências em Matemática Aplicada e Computacional, 2012

Resumo. Neste trabalho, é introduzida uma família de distribuições denominada hipergeométrica con... more Resumo. Neste trabalho, é introduzida uma família de distribuições denominada hipergeométrica confluente G que inclui os importantes modelos: beta normal, beta Weibull, beta Gumbel, beta Pareto, entre outros. Novas distribuições são apresentadas como membros dessa família, por exemplo, a distribuição hipergeométrica confluente normal e a distribuição hipergeométrica confluente Weibull. A estimação dos parâmetros dessa nova classe de distribuições generalizadas é estudada utilizando o Método da Máxima Verossimilhança e sua potencialidade é demonstrada na modelagem de um conjunto de dados reais de trinta e cinco crianças com deficiência do hormônio de crescimento. Palavras-chave. Distribuição beta generalizada, distribuição hipergeométrica confluente, Método da Máxima Verossimilhança.

Research paper thumbnail of A bayesian approach to shrinkage estimators

Estimators obtained by shrinking the least squares estimator are becoming widely used since the w... more Estimators obtained by shrinking the least squares estimator are becoming widely used since the work of Stein, in the early 60’s, where it was presented an estimator for the mean of multivariate normal that dominates the sample mean, and the work of Hoerl and Kennard, in the early 70’s, on ridge estimators. In this work we present an approach using Bayesian and empirical Bayesian procedures to obtain some important shrinkage estimators.

Research paper thumbnail of Camaleão: Um Software Para Segurança Digital Utilizando Esteganografia

Page 1. Camaleão: um Software para Segurança Digital Utilizando Esteganografia ∗ Anderson de Reze... more Page 1. Camaleão: um Software para Segurança Digital Utilizando Esteganografia ∗ Anderson de Rezende Rocha 1 , Heitor Augustus Xavier Costa (Orientador) 2 , Lucas Monteiro Chaves (Co-orientador) 2 1 Instituto de Computação ...

Research paper thumbnail of Distribuição bivariada gama beta II: soma, produto e proporção das variáveis componentes

TEMA - Tendências em Matemática Aplicada e Computacional, 2013

Research paper thumbnail of Violation of probabilistic inequality Clauser-Horne-Shimony-Holt in Quantum Mechanics

Cornell University - arXiv, Oct 31, 2016

a violação da desigualdade de Clauser-Horne-Shimony-Holt na prática. Após realizar o experimento,... more a violação da desigualdade de Clauser-Horne-Shimony-Holt na prática. Após realizar o experimento, usaram os dados na desigualdade e concluíram que a desigualdade, obtida por meio de argumentos probabilísticos, era violada, confirmando as conclusões obtidas por John S. Bell de que não era possível uma teoria de variáveis ocultas nas condições propostas por Einstein, Podolsky e Rosen. O objetivo deste trabalho é observar a validade das fórmulas nas

Research paper thumbnail of Estimating Bounded Mean Vector in Multivariate Normal: The Geometry of Hartigan Estimator

The problem on estimating a multivariate normal mean N p ( θ ; I ) when the vector mean is bounde... more The problem on estimating a multivariate normal mean N p ( θ ; I ) when the vector mean is bounded awaked interest practical and theoretical. Under such hypothesis it's possible to obtain estimators which dominate the sample mean estimator in relation to square loss. Generalizing previous results obtained, for univariate normal, J.A. Hartigan obtained, for multivariate normal with independent components, a Bayes estimator defined on a bounded closed convex set, with non-empty interior, which dominates the sample mean estimator. In this work, this result is presented in details for the case where the restriction set is a sphere centered at origin. A geometrical interpretation, useful to understand the phenomenon, is presented. Others estimators based on Gatsonis et. al. (1987) are proposed and the risks of all these estimators are compared through simulations, for the cases of dimensions p = 1 and p = 2.

Research paper thumbnail of Symmetries in Symbolic Sequences

� ABSTRACT: Data are often indexed by a set of labels that reflect certain experimental condition... more � ABSTRACT: Data are often indexed by a set of labels that reflect certain experimental conditions of interest. When these labels have, in addition, some symmetry in their particular structure, the methodology of symmetry studies (Viana, 2008) can be used to facilitate the analysis and interpretation of data. In the present work, the symmetry-related properties derived for the study of data indexed by quaternary sequences in length of three are studied in detail within that context.

Research paper thumbnail of Estimadores tipo James-Stein e suas propriedades via simulação computacional

Revista Brasileira de Biometria, 2017

Research paper thumbnail of A Bayesian Design to Test Dose-Response Sequential Trials Using Isotonic Regression

The present work aims to achieve a dose with a pre-specified toxicity rate in a target population... more The present work aims to achieve a dose with a pre-specified toxicity rate in a target population. It proposes, using Bayesian methods and isotonic regression, a sequential design to allow the researchers to update prior information. The theory behind the design is not easily assimilated to non-statisticians. What makes such approach attractive is its reduction to four steps of easy manipulation, without the need of computational effort. Simulation procedures confirm the effectiveness of the proposed methodology.

Research paper thumbnail of Factor copula models for right-censored clustered survival data

Lifetime Data Analysis, 2021

In this article we extend the factor copula model to deal with right-censored event time data gro... more In this article we extend the factor copula model to deal with right-censored event time data grouped in clusters. The new methodology allows for clusters to have variable sizes ranging from small to large and intracluster dependence to be flexibly modeled by any parametric family of bivariate copulas, thus encompassing a wide range of dependence structures. Incorporation of covariates (possibly time dependent) in the margins is also supported. Three estimation procedures are proposed: both one-and two-stage parametric and a two-stage semiparametric method where marginal survival functions are estimated by using a Cox proportional hazards model. We prove that the estimators are consistent and asymptotically normally distributed, and assess their finite sample behavior with simulation studies. Furthermore, we illustrate the proposed methods on a data set containing the time to first insemination after calving in dairy cattle clustered in herds of different sizes.

Research paper thumbnail of Camale~ao: um software de esteganografia para protec c~ao e seguranc ca digital

Research paper thumbnail of Camale~ao: privacidade e seguranc ca na internet por esteganografia em imagens

A esteganografia, arte e ciência das comunicações secretas, inclui um vasto conjunto de métodos p... more A esteganografia, arte e ciência das comunicações secretas, inclui um vasto conjunto de métodos para comunicações secretas tais como tintas "invisíveis", micro-pontos, arranjo de caracteres (character arrangement) entre outras. Os principais objetivos deste trabalho foram pesquisar as principais técnicas de esteganografia em imagens digitais da atualidade e desenvolver um software capaz de permitir a comunicação segura pela internet.

Research paper thumbnail of The optimal number of partial least squares components in genomic selection for pork pH

Ciência Rural, 2017

ABSTRACT: The main application of genomic selection (GS) is the early identification of genetical... more ABSTRACT: The main application of genomic selection (GS) is the early identification of genetically superior animals for traits difficult-to-measure or lately evaluated, such as meat pH (measured after slaughter). Because the number of markers in GS is generally larger than the number of genotyped animals and these markers are highly correlated owing to linkage disequilibrium, statistical methods based on dimensionality reduction have been proposed. Among them, the partial least squares (PLS) technique stands out, because of its simplicity and high predictive accuracy. However, choosing the optimal number of components remains a relevant issue for PLS applications. Thus, we applied PLS (and principal component and traditional multiple regression) techniques to GS for pork pH traits (with pH measured at 45min and 24h after slaughter) and also identified the optimal number of PLS components based on the degree-of-freedom (DoF) and cross-validation (CV) methods. The PLS method out perf...

Research paper thumbnail of Estimador não paramétrico para modelos de platô de resposta via regressão isotônica aplicado a dados de deposição de Zn no dedo médio de aves fêmeas da linhagem Hubbard

Research paper thumbnail of Three Simple Heuristics Mathematical Proofs on Lasso Theory

REVISTA BRASILEIRA DE BIOMETRIA

Three relevant facts about the least absolute shrinkage and selection operator (Lasso) are studie... more Three relevant facts about the least absolute shrinkage and selection operator (Lasso) are studied: The estimatives follows piecewise linear curves in relation to tuning parameter, the number of nonzero selected covariates is an unbiased estimator of its degrees of freedom and when the number of covariates p is greater than the numbers of observations n at most n covariates are selected. These results are well known and described in the literature, but with no simple demonstrations. We present, based on a geometrical approach, simple and intuitive heuristics proofs for these results.

Research paper thumbnail of Proposal of a Rao Ridge Type Estimator

REVISTA BRASILEIRA DE BIOMETRIA

Based on a geometrical interpretation of Ridge estimators a new Rao Ridge type estimator is propo... more Based on a geometrical interpretation of Ridge estimators a new Rao Ridge type estimator is proposed. Its advantage is to reach the optimum value for the shrinkage parameter more quickly. The geometry, the predictive capacity, a computational example, an application to real data and comparison with the usual Ridge estimator are developed.

Research paper thumbnail of Explaining the Generalized Cross-Validation on Linear Models

Journal of Mathematics and Statistics

Cross-Validation is a model validation method widely used by the scientific community. The Genera... more Cross-Validation is a model validation method widely used by the scientific community. The Generalized Cross-Validation (GCV) is an invariant version of the usual Cross-Validation method. This generalization was obtained using the non usual theory of circulant complex matrices. In this work we intend to give a clear and complete exposition concerning the linear algebra assumptions required by the theory. The aim was to make this text accessible to a wide audience of statisticians and non-statisticians who use the Cross-Validation method in their research activities. It is also intended to supply the absence of a basic reference on this topic in the literature.

Research paper thumbnail of On the Prediction Error

REVISTA BRASILEIRA DE BIOMETRIA

The theory of model prediction error is presented in details from the point of view of geometric ... more The theory of model prediction error is presented in details from the point of view of geometric constructions. It is expected that this approach can be a possible pedagogical tool in the treatment of the subject. Although the focus is essentially conceptual, all algebraic passages is developed in order to facilitate a greater understanding for the reader. Two elementary examples are presented..

Research paper thumbnail of Isotonic regression analysis of Guzerá cattle growth curves

Revista Ceres

The objective of this study was to apply data transformation via isotonic regression in growth cu... more The objective of this study was to apply data transformation via isotonic regression in growth curves studies of Guzerá cattle whose data presented disturbances characterized by decreased body weight in certain age groups. Weight-age data were collected on newly weaned Guzerá males according to the methodology of weight gain tests (WGT) defined by the Brazilian Association of Zebu Breeders (ABCZ). The Logistic, Von Bertalanffy and Gompertz models were fitted to weight-age data using the generalized least squares method for non-linear regression models through the Gauss-Newton algorithm. The proposed transformation based on isotonic regression theory proved to be efficient; and the Logistic model was the best to describe the growth of animals, with a high percentage of convergence (100%) and goodness of fit assessed by the mean squared error (MSE) and the coefficient of determination (R2).

Research paper thumbnail of Distribuição bivariada Gama Beta II com um aplicação em modelagem de precipitações pluviométricas (pp.159)

Research paper thumbnail of Uma Nova Classe de Distribuições Generalizadas

TEMA - Tendências em Matemática Aplicada e Computacional, 2012

Resumo. Neste trabalho, é introduzida uma família de distribuições denominada hipergeométrica con... more Resumo. Neste trabalho, é introduzida uma família de distribuições denominada hipergeométrica confluente G que inclui os importantes modelos: beta normal, beta Weibull, beta Gumbel, beta Pareto, entre outros. Novas distribuições são apresentadas como membros dessa família, por exemplo, a distribuição hipergeométrica confluente normal e a distribuição hipergeométrica confluente Weibull. A estimação dos parâmetros dessa nova classe de distribuições generalizadas é estudada utilizando o Método da Máxima Verossimilhança e sua potencialidade é demonstrada na modelagem de um conjunto de dados reais de trinta e cinco crianças com deficiência do hormônio de crescimento. Palavras-chave. Distribuição beta generalizada, distribuição hipergeométrica confluente, Método da Máxima Verossimilhança.

Research paper thumbnail of A bayesian approach to shrinkage estimators

Estimators obtained by shrinking the least squares estimator are becoming widely used since the w... more Estimators obtained by shrinking the least squares estimator are becoming widely used since the work of Stein, in the early 60’s, where it was presented an estimator for the mean of multivariate normal that dominates the sample mean, and the work of Hoerl and Kennard, in the early 70’s, on ridge estimators. In this work we present an approach using Bayesian and empirical Bayesian procedures to obtain some important shrinkage estimators.

Research paper thumbnail of Camaleão: Um Software Para Segurança Digital Utilizando Esteganografia

Page 1. Camaleão: um Software para Segurança Digital Utilizando Esteganografia ∗ Anderson de Reze... more Page 1. Camaleão: um Software para Segurança Digital Utilizando Esteganografia ∗ Anderson de Rezende Rocha 1 , Heitor Augustus Xavier Costa (Orientador) 2 , Lucas Monteiro Chaves (Co-orientador) 2 1 Instituto de Computação ...

Research paper thumbnail of Distribuição bivariada gama beta II: soma, produto e proporção das variáveis componentes

TEMA - Tendências em Matemática Aplicada e Computacional, 2013

Research paper thumbnail of Violation of probabilistic inequality Clauser-Horne-Shimony-Holt in Quantum Mechanics

Cornell University - arXiv, Oct 31, 2016

a violação da desigualdade de Clauser-Horne-Shimony-Holt na prática. Após realizar o experimento,... more a violação da desigualdade de Clauser-Horne-Shimony-Holt na prática. Após realizar o experimento, usaram os dados na desigualdade e concluíram que a desigualdade, obtida por meio de argumentos probabilísticos, era violada, confirmando as conclusões obtidas por John S. Bell de que não era possível uma teoria de variáveis ocultas nas condições propostas por Einstein, Podolsky e Rosen. O objetivo deste trabalho é observar a validade das fórmulas nas

Research paper thumbnail of Estimating Bounded Mean Vector in Multivariate Normal: The Geometry of Hartigan Estimator

The problem on estimating a multivariate normal mean N p ( θ ; I ) when the vector mean is bounde... more The problem on estimating a multivariate normal mean N p ( θ ; I ) when the vector mean is bounded awaked interest practical and theoretical. Under such hypothesis it's possible to obtain estimators which dominate the sample mean estimator in relation to square loss. Generalizing previous results obtained, for univariate normal, J.A. Hartigan obtained, for multivariate normal with independent components, a Bayes estimator defined on a bounded closed convex set, with non-empty interior, which dominates the sample mean estimator. In this work, this result is presented in details for the case where the restriction set is a sphere centered at origin. A geometrical interpretation, useful to understand the phenomenon, is presented. Others estimators based on Gatsonis et. al. (1987) are proposed and the risks of all these estimators are compared through simulations, for the cases of dimensions p = 1 and p = 2.

Research paper thumbnail of Symmetries in Symbolic Sequences

� ABSTRACT: Data are often indexed by a set of labels that reflect certain experimental condition... more � ABSTRACT: Data are often indexed by a set of labels that reflect certain experimental conditions of interest. When these labels have, in addition, some symmetry in their particular structure, the methodology of symmetry studies (Viana, 2008) can be used to facilitate the analysis and interpretation of data. In the present work, the symmetry-related properties derived for the study of data indexed by quaternary sequences in length of three are studied in detail within that context.

Research paper thumbnail of Estimadores tipo James-Stein e suas propriedades via simulação computacional

Revista Brasileira de Biometria, 2017

Research paper thumbnail of A Bayesian Design to Test Dose-Response Sequential Trials Using Isotonic Regression

The present work aims to achieve a dose with a pre-specified toxicity rate in a target population... more The present work aims to achieve a dose with a pre-specified toxicity rate in a target population. It proposes, using Bayesian methods and isotonic regression, a sequential design to allow the researchers to update prior information. The theory behind the design is not easily assimilated to non-statisticians. What makes such approach attractive is its reduction to four steps of easy manipulation, without the need of computational effort. Simulation procedures confirm the effectiveness of the proposed methodology.

Research paper thumbnail of Factor copula models for right-censored clustered survival data

Lifetime Data Analysis, 2021

In this article we extend the factor copula model to deal with right-censored event time data gro... more In this article we extend the factor copula model to deal with right-censored event time data grouped in clusters. The new methodology allows for clusters to have variable sizes ranging from small to large and intracluster dependence to be flexibly modeled by any parametric family of bivariate copulas, thus encompassing a wide range of dependence structures. Incorporation of covariates (possibly time dependent) in the margins is also supported. Three estimation procedures are proposed: both one-and two-stage parametric and a two-stage semiparametric method where marginal survival functions are estimated by using a Cox proportional hazards model. We prove that the estimators are consistent and asymptotically normally distributed, and assess their finite sample behavior with simulation studies. Furthermore, we illustrate the proposed methods on a data set containing the time to first insemination after calving in dairy cattle clustered in herds of different sizes.

Research paper thumbnail of Camale~ao: um software de esteganografia para protec c~ao e seguranc ca digital

Research paper thumbnail of Camale~ao: privacidade e seguranc ca na internet por esteganografia em imagens

A esteganografia, arte e ciência das comunicações secretas, inclui um vasto conjunto de métodos p... more A esteganografia, arte e ciência das comunicações secretas, inclui um vasto conjunto de métodos para comunicações secretas tais como tintas "invisíveis", micro-pontos, arranjo de caracteres (character arrangement) entre outras. Os principais objetivos deste trabalho foram pesquisar as principais técnicas de esteganografia em imagens digitais da atualidade e desenvolver um software capaz de permitir a comunicação segura pela internet.

Research paper thumbnail of The optimal number of partial least squares components in genomic selection for pork pH

Ciência Rural, 2017

ABSTRACT: The main application of genomic selection (GS) is the early identification of genetical... more ABSTRACT: The main application of genomic selection (GS) is the early identification of genetically superior animals for traits difficult-to-measure or lately evaluated, such as meat pH (measured after slaughter). Because the number of markers in GS is generally larger than the number of genotyped animals and these markers are highly correlated owing to linkage disequilibrium, statistical methods based on dimensionality reduction have been proposed. Among them, the partial least squares (PLS) technique stands out, because of its simplicity and high predictive accuracy. However, choosing the optimal number of components remains a relevant issue for PLS applications. Thus, we applied PLS (and principal component and traditional multiple regression) techniques to GS for pork pH traits (with pH measured at 45min and 24h after slaughter) and also identified the optimal number of PLS components based on the degree-of-freedom (DoF) and cross-validation (CV) methods. The PLS method out perf...

Research paper thumbnail of Estimador não paramétrico para modelos de platô de resposta via regressão isotônica aplicado a dados de deposição de Zn no dedo médio de aves fêmeas da linhagem Hubbard

Research paper thumbnail of Three Simple Heuristics Mathematical Proofs on Lasso Theory

REVISTA BRASILEIRA DE BIOMETRIA

Three relevant facts about the least absolute shrinkage and selection operator (Lasso) are studie... more Three relevant facts about the least absolute shrinkage and selection operator (Lasso) are studied: The estimatives follows piecewise linear curves in relation to tuning parameter, the number of nonzero selected covariates is an unbiased estimator of its degrees of freedom and when the number of covariates p is greater than the numbers of observations n at most n covariates are selected. These results are well known and described in the literature, but with no simple demonstrations. We present, based on a geometrical approach, simple and intuitive heuristics proofs for these results.

Research paper thumbnail of Proposal of a Rao Ridge Type Estimator

REVISTA BRASILEIRA DE BIOMETRIA

Based on a geometrical interpretation of Ridge estimators a new Rao Ridge type estimator is propo... more Based on a geometrical interpretation of Ridge estimators a new Rao Ridge type estimator is proposed. Its advantage is to reach the optimum value for the shrinkage parameter more quickly. The geometry, the predictive capacity, a computational example, an application to real data and comparison with the usual Ridge estimator are developed.

Research paper thumbnail of Explaining the Generalized Cross-Validation on Linear Models

Journal of Mathematics and Statistics

Cross-Validation is a model validation method widely used by the scientific community. The Genera... more Cross-Validation is a model validation method widely used by the scientific community. The Generalized Cross-Validation (GCV) is an invariant version of the usual Cross-Validation method. This generalization was obtained using the non usual theory of circulant complex matrices. In this work we intend to give a clear and complete exposition concerning the linear algebra assumptions required by the theory. The aim was to make this text accessible to a wide audience of statisticians and non-statisticians who use the Cross-Validation method in their research activities. It is also intended to supply the absence of a basic reference on this topic in the literature.

Research paper thumbnail of On the Prediction Error

REVISTA BRASILEIRA DE BIOMETRIA

The theory of model prediction error is presented in details from the point of view of geometric ... more The theory of model prediction error is presented in details from the point of view of geometric constructions. It is expected that this approach can be a possible pedagogical tool in the treatment of the subject. Although the focus is essentially conceptual, all algebraic passages is developed in order to facilitate a greater understanding for the reader. Two elementary examples are presented..

Research paper thumbnail of Isotonic regression analysis of Guzerá cattle growth curves

Revista Ceres

The objective of this study was to apply data transformation via isotonic regression in growth cu... more The objective of this study was to apply data transformation via isotonic regression in growth curves studies of Guzerá cattle whose data presented disturbances characterized by decreased body weight in certain age groups. Weight-age data were collected on newly weaned Guzerá males according to the methodology of weight gain tests (WGT) defined by the Brazilian Association of Zebu Breeders (ABCZ). The Logistic, Von Bertalanffy and Gompertz models were fitted to weight-age data using the generalized least squares method for non-linear regression models through the Gauss-Newton algorithm. The proposed transformation based on isotonic regression theory proved to be efficient; and the Logistic model was the best to describe the growth of animals, with a high percentage of convergence (100%) and goodness of fit assessed by the mean squared error (MSE) and the coefficient of determination (R2).