Prediction of A CRS Frontier Function and A Transformation Function for A CCR DEA Using EMBEDED PCA (original) (raw)

Pca as a Tool for the Selection of Inputs and Outputs in Data Envelopment Analysis

DEA model selection is problematic. The estimated efficiency for any DMU depends on the inputs and outputs included in the model. It also depends on the number of outputs plus inputs. It is clearly important to select parsimonious specifications and to avoid as far as possible models that assign full high efficiency ratings to DMUs that operate in unusual ways (mavericks). A new method for model selection is proposed in this paper. Efficiencies are calculated for all possible DEA model specifications. The results are analysed using Principal Components Analysis. It is shown that model equivalence or dissimilarity can be easily assessed using this approach. The reasons why particular DMUs achieve a certain level of efficiency with a given model specification become clear. The methodology has the additional advantage of producing DMU rankings.

Sensitivity and Stability of the Classifications of Returns to Scale in Data Envelopment Analysis

J Prod Anal, 1999

Sensitivity of the returns to scale (RTS) classifications in data envelopment analysis is studied by means of linear programming problems. The stability region for an observation preserving its current RTS classification (constant, increasing or decreasing returns to scale) can be easily investigated by the optimal values to a set of particular DEA-type formulations. Necessary and sufficient conditions are determined for preserving the RTS classifications when input or output data perturbations are non-proportional. It is shown that the sensitivity analysis method under proportional data perturbations can also be used to estimate the RTS classifications and discover the identical RTS regions yielded by the input-based and the output-based DEA methods. Thus, our approach provides information on both the RTS classifications and the stability of the classifications. This sensitivity analysis method can easily be applied via existing DEA codes.

Searching the Efficient Frontier in Data Envelopment Analysis

International Series in Operations Research & Management Science, 2002

In this paper, we deal with the problem of searching the efficient frontier in Data Envelopment Analysis (DEA). Our aim is to show that the free search approach developed to make a search on the efficient frontier in multiple objective programming can also be used in DEA. This kind of analysis is needed when among others a) a radial projection is not acceptable, b) there are restrictions on some input and output values, or c) a decision maker (DM) would like to find a decision making unit (DMU) with the most preferred input and output values. The search can be applied to CCR/BCC-models, input-oriented/output-oriented/combined models, and to the models with extra constraints. To make a free search on the efficient frontier, we recommend the use of Pareto Race (Korhonen and Wallenius [1988]) for this purpose. In Pareto Race, the DM may simply control the search with some function keys. The information is displayed to the DM as bar graphs and in numeric form. The search can be terminated at any time, the DM wishes. A numerical example is used to illustrate the approach.

An Alternative Approach to Reduce Dimensionality in Data Envelopment Analysis

Journal of Modern Applied Statistical Methods, 2013

Principal component analysis reduces dimensionality; however, uncorrelated components imply the existence of variables with weights of opposite signs. This complicates the application in data envelopment analysis. To overcome problems due to signs, a modification to the component axes is proposed and was verified using Monte Carlo simulations.

Improving discrimination in data envelopment analysis: PCA-DEA or variable reduction

European Journal of Operational Research, 2010

Within the data envelopment analysis context, problems of discrimination between efficient and inefficient decision-making units often arise, particularly if there are a relatively large number of variables with respect to observations. This paper applies Monte-Carlo simulation to generalize and compare two discrimination-improving methods; principal component analysis applied to data envelopment analysis (PCA-DEA) and variable reduction based on partial covariance (VR).

The Comparison of Principal Component Analysis and Data Envelopment Analysis in Ranking of Decision Making Units

DergiPark (Istanbul University), 2010

In this study, Data Envelopment Analysis (DEA) and Principal Component Analysis (PCA) were compared when these two methods are used for ranking Decision Making Units (DMU) with multiple inputs and outputs. DEA, a nonstatistical technique, is a methodology using a linear programming model for evaluating and ranking DMU's performance. PCA, a multivariate statistical method, uses new measures defined by DMU's inputs and outputs. The results of both methods were applied to a real data set that indicates the economic performances of European Union member countries and also, a simulation study was done for different sample sizes and for different numbers of input-output, and the results were examined. For both applications, consistent results were obtained. Spearman's correlation test is employed to compare the rankings obtained by PCA and DEA.

A new proposed model of restricted data envelopment analysis by correlation coefficients

Applied Mathematical Modelling, 2013

The concept of efficiency in data envelopment analysis (DEA) is defined as weighted sum of outputs/weighted sum of inputs. In order to calculate the maximum efficiency score, each decision making unit (DMU)'s inputs and outputs are assigned to different weights. Hence, the classical DEA allows the weight flexibility. Therefore, even if they are important, the inputs or outputs of some DMUs can be assigned zero (0) weights. Thus, these inputs or outputs are neglected in the evaluation. Also, some DMUs may be defined as efficient even if they are inefficient. This situation leads to unrealistic results. Also to eliminate the problem of weight flexibility, weight restrictions are made in DEA. In our study, we proposed a new model which has not been published in the literature. We describe it as the restricted Data Envelopment Analysis ((ARIII(COR))) model with correlation coefficients. The aim for developing this new model, is to take into account the relations between variables using correlation coefficients. Also, these relations were added as constraints to the CCR and BCC models. For this purpose, the correlation coefficients were used in the restrictions of input-output each one alone and their combination together. Inputs and outputs are related to the degree of correlation between each other in the production. Previous studies did not take into account the relationship between inputs/outputs variables. So, only with expert opinions or an objective method, weight restrictions have been made. In our study, the weights for input and output variables were determined, according to the correlations between input and output variables. The proposed new method is different from other methods in the literature, because the efficiency scores were calculated at the level of correlations between the input and/or output variables. j o u r n a l h o m e p a g e : w w w . e l s e v i e r . c o m / l o c a t e / a p m et al.

Finding the efficiency score and RTS characteristic of DMUs by means of identifying the efficient frontier in DEA

Applied Mathematics and Computation, 2005

Data envelopment analysis (DEA) is basically a linear programming based technique used for measuring the relative performance of organizational units, referred to as decision-making units (DMUs), where the presence of multiple inputs and outputs makes comparisons difficult. The ability of identifying frontier DMUs prior to the DEA calculation is of extreme importance to an effective and efficient DEA computation. In this paper, a method for identifying the efficient frontier is introduced. Then, the efficiency score and returns to scale (RTS) characteristic of DMUs will be produced by means of the equation of efficient frontier.

Measurement of returns to scale in DEA using the CCR model

In data envelopment analysis (DEA) literature, the returns to scale (RTS) of an inefficient decision making unit (DMU) is determined at its projected point on the efficient frontier. Under the occurrences of multiple projection points, however, this evaluation procedure is not precise and may lead to erroneous inferences as to the RTS possibilities of DMUs. To circumvent this, the current communication first defines the RTS of an inefficient DMU at its projected point that lies in the relative interior of the minimum face. Based on this definition, it proposes an algorithm by extending the latest developed method of measuring RTS via the CCR model. The main advantage of our proposed algorithm lies in its computational efficiency.

Selecting DEA specifications and ranking units via PCA

Journal of the Operational Research Society, 2004

Data envelopment analysis (DEA) model selection is problematic. The estimated efficiency for any DMU depends on the inputs and outputs included in the model. It also depends on the number of outputs plus inputs. It is clearly important to select parsimonious specifications and to avoid as far as possible models that assign full high-efficiency ratings to DMUs that operate in unusual ways (mavericks). A new method for model selection is proposed in this paper. Efficiencies are calculated for all possible DEA model specifications. The results are analysed using Principal Component Analysis. It is shown that model equivalence or dissimilarity can be easily assessed using this approach. The reasons why particular DMUs achieve a certain level of efficiency with a given model specification become clear. The methodology has the additional advantage of producing DMU rankings. These rankings can always be established independently of whether the model is estimated under constant or under variable returns to scale.