sharifah syed | International Islamic University Malaysia (original) (raw)

sharifah syed

Related Authors

Armando Marques-Guedes

Eric André  Poirier

Lucas R . Platero

Abir Salaaoui

Izawati Wook

Christine Sleeter

Mohd Nur Syufaat Jamiran

Mohd Ismail

Mohd Ismail

Institut Pendidikan Guru Kampus Dato' Razali Ismail

Guilherme Moerbeck

Guilherme Moerbeck

UERJ - Universidade do Estado do Rio de Janeiro / Rio de Janeiro State University

Uploads

Papers by sharifah syed

Research paper thumbnail of Identification of Outliers: A Simulation Study

This paper compares two approaches in identifying outliers in multivariate datasets; Mahalanobis ... more This paper compares two approaches in identifying outliers in multivariate datasets; Mahalanobis distance (MD) and robust distance (RD). MD has been known suffering from masking and swamping effects and RD is an approach that was developed to overcome problems that arise in MD. There are two purposes of this paper, first is to identify outliers using MD and RD and the second is to show that RD performs better than MD in identifying outliers. An observation is classified as an outlier if MD or RD is larger than a cut-off value. Outlier generating model is used to generate a set of data and MD and RD are computed from this set of data. The results showed that RD can identify outliers better than MD. However, in non-outliers data the performance for both approaches are similar. The results for RD also showed that RD can identify multivariate outliers much better when the number of dimension is large.

Research paper thumbnail of Identification of Outliers: A Simulation Study

This paper compares two approaches in identifying outliers in multivariate datasets; Mahalanobis ... more This paper compares two approaches in identifying outliers in multivariate datasets; Mahalanobis distance (MD) and robust distance (RD). MD has been known suffering from masking and swamping effects and RD is an approach that was developed to overcome problems that arise in MD. There are two purposes of this paper, first is to identify outliers using MD and RD and the second is to show that RD performs better than MD in identifying outliers. An observation is classified as an outlier if MD or RD is larger than a cut-off value. Outlier generating model is used to generate a set of data and MD and RD are computed from this set of data. The results showed that RD can identify outliers better than MD. However, in non-outliers data the performance for both approaches are similar. The results for RD also showed that RD can identify multivariate outliers much better when the number of dimension is large.

Log In