Invariant Object Recognition with Slow Feature Analysis (original) (raw)
Abstract
Primates are very good at recognizing objects independently of viewing angle or retinal position and outperform existing computer vision systems by far. But invariant object recognition is only one prerequisite for successful interaction with the environment. An animal also needs to assess an object’s position and relative rotational angle. We propose here a model that is able to extract object identity, position, and rotation angles, where each code is independent of all others. We demonstrate the model behavior on complex three-dimensional objects under translation and in-depth rotation on homogeneous backgrounds. A similar model has previously been shown to extract hippocampal spatial codes from quasi-natural videos. The rigorous mathematical analysis of this earlier application carries over to the scenario of invariant object recognition.
Preview
Unable to display preview. Download preview PDF.
We’re sorry, something doesn't seem to be working properly.
Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.
References
- Rolls, E.T., Deco, G.: The Computational Neuroscience of Vision. Oxford University Press, New York (2002)
Google Scholar - Franzius, M., Sprekeler, H., Wiskott, L.: Slowness and sparseness lead to place, head-diretion and spatial-view cells. Public Library of Science (PLoS) Computational Biology 3(8), 166 (2007)
MathSciNet Google Scholar - Picard, R., Graczyk, C., Mann, S., Wachman, J., Picard, L., Campbell, L.: Vision texture (2002), http://vismod.media.mit.edu/vismod/imagery/VisionTexture/vistex.html
- Toucan Corporation: Toucan virtual museum (2005), http://toucan.web.infoseek.co.jp/3DCG/3ds/FishModelsE.html
- Wiskott, L., Sejnowski, T.: Slow feature analysis: Unsupervised learning of invariances. Neural Computation 14(4), 715–770 (2002)
Article MATH Google Scholar - Berkes, P., Wiskott, L.: Slow feature analysis yields a rich repertoire of complex cell properties. Journal of Vision 5(6), 579–602 (2005), http://journalofvision.org/5/6/9/
Article Google Scholar - Hashimoto, W.: Quadratic forms in natural images. Network: Computation in Neural Systems 14(4), 765–788 (2003)
Article Google Scholar - Sprekeler, H., Michaelis, C., Wiskott, L.: Slowness: An objective for spike-timing-plasticity? PLoS Computational Biology 3(6), 112 (2007)
Article MathSciNet Google Scholar - Wiskott, L.: Slow feature analysis: A theoretical analysis of optimal free responses. Neural Computation 15(9), 2147–2177 (2003)
Article MATH Google Scholar - Berkes, P., Zito, T.: Modular toolkit for data processing (version 2.0) (2005), http://mdp-toolkit.sourceforge.net
- Földiák, P.: Learning invariance from transformation sequences. Neural Computation 3, 194–200 (1991)
Article Google Scholar - Stone, J.V., Bray, A.: A learning rule for extracting spatio-temporal invariances. Network: Computation in Neural Systems 6, 429–436 (1995)
Article MATH Google Scholar - Kayser, C., Einhäuser, W., Dümmer, O., König, P., Körding, K.: Extracting slow subspaces from natural videos leads to complex cells. In: Dorffner, G., Bischof, H., Hornik, K. (eds.) ICANN 2001. LNCS, vol. 2130, pp. 1075–1080. Springer, Heidelberg (2001)
Chapter Google Scholar - Rolls, E.T.: Neurophysiological mechanisms underlying face processing within and beyond the temporal cortical visual areas. Philosophical Transactions of the Royal Society 335, 11–21 (1992)
Article Google Scholar - Wallis, G., Rolls, E.T.: Invariant face and object recognition in the visual system. Progress in Neurobiology 51(2), 167–194 (1997)
Article Google Scholar
Author information
Authors and Affiliations
- Institute for Theoretical Biology, Humboldt-Universität zu Berlin, Germany
Mathias Franzius, Niko Wilbert & Laurenz Wiskott
Authors
- Mathias Franzius
- Niko Wilbert
- Laurenz Wiskott
Editor information
Véra Kůrková Roman Neruda Jan Koutník
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Franzius, M., Wilbert, N., Wiskott, L. (2008). Invariant Object Recognition with Slow Feature Analysis. In: Kůrková, V., Neruda, R., Koutník, J. (eds) Artificial Neural Networks - ICANN 2008. ICANN 2008. Lecture Notes in Computer Science, vol 5163. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87536-9\_98
Download citation
- .RIS
- .ENW
- .BIB
- DOI: https://doi.org/10.1007/978-3-540-87536-9\_98
- Publisher Name: Springer, Berlin, Heidelberg
- Print ISBN: 978-3-540-87535-2
- Online ISBN: 978-3-540-87536-9
- eBook Packages: Computer ScienceComputer Science (R0)Springer Nature Proceedings Computer Science
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.