A new complexity measure for time series analysis and classification (original) (raw)
Abstract
Complexity measures are used in a number of applications including extraction of information from data such as ecological time series, detection of non-random structure in biomedical signals, testing of random number generators, language recognition and authorship attribution etc. Different complexity measures proposed in the literature like Shannon entropy, Relative entropy, Lempel-Ziv, Kolmogrov and Algorithmic complexity are mostly ineffective in analyzing short sequences that are further corrupted with noise. To address this problem, we propose a new complexity measure ETC and define it as the “Effort To Compress” the input sequence by a lossless compression algorithm. Here, we employ the lossless compression algorithm known as Non-Sequential Recursive Pair Substitution (NSRPS) and define ETC as the number of iterations needed for NSRPS to transform the input sequence to a constant sequence. We demonstrate the utility of ETC in two applications. ETC is shown to have better correlation with Lyapunov exponent than Shannon entropy even with relatively short and noisy time series. The measure also has a greater rate of success in automatic identification and classification of short noisy sequences, compared to entropy and a popular measure based on Lempel-Ziv compression (implemented by Gzip).
Access this article
Subscribe and save
- Get 10 units per month
- Download Article/Chapter or eBook
- 1 Unit = 1 Article or 1 Chapter
- Cancel anytime Subscribe now
Buy Now
Price excludes VAT (USA)
Tax calculation will be finalised during checkout.
Instant access to the full article PDF.
Similar content being viewed by others
References
- A. Lempel, J. Ziv, IEEE Trans. Inform. Theory IT-22, 75 (1976)
Article MathSciNet Google Scholar - M. Borowska, E. Oczeretko, A. Mazurek, A. Kitlas, P. Kuc, Annual Proc. Medical Sci. Suppl. 2, 50 (2005)
- L.E.V. Silva, L.O. Murta Jr., Chaos 22, 043105-1 (2012)
ADS Google Scholar - A. Menezes, P.C. van Oorschot, S. Vanstone, Handbook of Applied Cryptography (CRC Press, Boca Raton, Florida, 1996)
- A.N. Kolmogorov, IEEE Trans. Inf. Theory IT 14, 662 (1965)
Article MathSciNet Google Scholar - G.J. Chaitin, Algorithmic Information Theory (Cambridge Univ. Press, New York, 1987)
- C.E. Shannon, The Bell System Tech. J. 27, 379 (1948)
MathSciNet MATH Google Scholar - J. Ziv, A. Lempel, IEEE Trans. Info. Theory 23, 337 (1977)
Article MathSciNet MATH Google Scholar - K. Sayood, Introduction to Data Compression (Morgan Kaufmann Publ., 2009)
- T.M. Cover, J.A. Thomas, Elements of Information Theory (Wiley Interscience, 1991)
- H. Herzel, W. Ebeling, A.O. Schmitt, Phys. Rev. E. 50, 5061 (1994)
Article ADS Google Scholar - S.M. Pincus, Proc. Nati. Acad. Sci. USA 88, 2297 (1991)
Article MathSciNet ADS MATH Google Scholar - W. Ebeling, M.A. Jiménez-Montaño, Math. Biosci. 52, 53 (1980)
Article MATH Google Scholar - P. Grassberger [arXiv:physics/0207023] (2002)
- W. Ebeling, R. Steuer, M.R. Titchener, Stocha. Dyn. 1, 45 (2001)
Article MathSciNet MATH Google Scholar - A. Puglisi, D. Benedetto, E. Caglioti, V. Loreto, A. Vulpiani, Physica D 180, 92 (2003)
Article MathSciNet ADS MATH Google Scholar - L.M. Calcagnile, S Galatolo, G Menconi [arXiv:0809.1342v2] [cond-mat.stat-mech] (2009)
- Ö.U. Nalbantoglu, D.J. Russell, K. Sayood, Entropy 12, 34 (2010)
Article Google Scholar - M.A. Jiménez-Montaño, W. Ebeling, T. Pöschel [arXiv:cond-mat/0204134v1] [cond-mat.dis-nn] (2002)
- D. Benedetto, E. Caglioti, D. Gabrielli, J. Stat. Mech. P09011-1 (2006)
- M. Hasler, IEEE Trans. Cir. Sys.-I: Fund. Theory Appls. 44, 856 (1997)
Article MathSciNet Google Scholar - G. Alvarez, S. Li, Intl. J. Bif. Chaos 16, 2129 (2006)
Article MathSciNet MATH Google Scholar - D. Arroyo, G. Alvarez, V. Fernandez [arXiv:0805.4355v1] [nlin.CD] (2008)
- R. Bose, S. Pathak, IEEE Trans. Cir Sys-I 53, 848 (2006)
Article MathSciNet Google Scholar - P. Sah, J.P. Salve, S. Dey, J. Theor. Biol. 320, 113 (2012)
Article Google Scholar - H. Kantz, T. Schreiber, Nonlinear Time Series Analysis (Cambridge University Press, Cambridge, 1997)
- M.T. Rosenstein, J.J. Collins, C.J.D. Luca, Physica D 65, 117 (1993)
Article MathSciNet ADS MATH Google Scholar - S. Dey, A. Joshi, Science 312, 434 (2006)
Article ADS Google Scholar - K. Sakai, Y. Noguchi, S. Asada, Chaos, Solitons Fractals 38, 1274 (2008)
Article ADS Google Scholar - L. Becks, F.M. Hilker, H. Malchow, K. Jürgens, H. Arndt, Nature 435, 1226 (2005)
Article ADS Google Scholar - D. Benedetto, E. Caglioti, V. Loreto, A. Vulpiani, Phys. Rev. Lett. 88, 048702-1 (2002)
Article ADS Google Scholar
Author information
Authors and Affiliations
- Department of Electronics and Communications Engineering, Amrita School of Engineering, Amrita Vishwa Vidyapeetham, Amritapuri Campus, Kollam, Kerala, 690525, India
Nithin Nagaraj & Karthi Balasubramanian - Biology Division, Indian Institute of Science Education and Research, Pune, Maharashtra, 411021, India
Sutirth Dey - Adjunct Faculty, School of Natural Sciences and Engineering, National Institute of Advanced Studies, Indian Institute of Science Campus, Bangalore, 560012, India
Nithin Nagaraj
Authors
- Nithin Nagaraj
You can also search for this author inPubMed Google Scholar - Karthi Balasubramanian
You can also search for this author inPubMed Google Scholar - Sutirth Dey
You can also search for this author inPubMed Google Scholar
Corresponding author
Correspondence toNithin Nagaraj.
Rights and permissions
About this article
Cite this article
Nagaraj, N., Balasubramanian, K. & Dey, S. A new complexity measure for time series analysis and classification.Eur. Phys. J. Spec. Top. 222, 847–860 (2013). https://doi.org/10.1140/epjst/e2013-01888-9
- Received: 22 March 2013
- Revised: 03 May 2013
- Published: 11 July 2013
- Issue Date: July 2013
- DOI: https://doi.org/10.1140/epjst/e2013-01888-9