From Exploration to Planning (original) (raw)

References

  1. Sutton, R., Barto, A.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
    Google Scholar
  2. Ungless, M., Magill, P., Bolam, J.: Uniform inhibition of dopamine neurons in the ventral tegmental area by aversive stimuli. Science 303, 2040–2042 (2004)
    Article Google Scholar
  3. Tobler, P., Fiorillo, C., Schultz, W.: Adaptive coding of reward value by dopamine neurons. Science 307(5715), 1642–1645 (2005)
    Article Google Scholar
  4. Foster, D., Dayan, P.: Structure in the space of value functions. Machine Learning 49, 325–346 (2002)
    Article MATH Google Scholar
  5. Davidson, P., Wolpert, D.: Widespread access to predictive models in the motor system: A short review. Journal of Neural Engineering 2, 8313–8319 (2005)
    Article Google Scholar
  6. Iacoboni, M., Wilson, S.: Beyond a single area: motor control and language within a neural architecture encompassing broca’s area. Cortex 42(4), 503–506 (2006)
    Article Google Scholar
  7. Miall, R.: Connecting mirror neurons and forward models. Neuroreport 14(16), 2135–2137 (2003)
    Article Google Scholar
  8. Oztop, E., Wolpert, D., Kawato, M.: Mirror neurons: Key for mental simulation? In: Twelfth annual computational neuroscience meeting CNS, p. 81 (2003)
    Google Scholar
  9. Churchland, P.: Self-representation in nervous systems. Science 296, 308–310 (2002)
    Article Google Scholar
  10. Plaut, D.C., Kello, C.T.: The emergence of phonology from the interplay of speech comprehension and production: A distributed connectionist approach. In: The emergence of language. B. MacWhinney (1998)
    Google Scholar
  11. Metta, G., Panerai, F., Manzotti, R., Sandini, G.: Babybot: an artificial developing robotic agent. In: SAB (2000)
    Google Scholar
  12. Dearden, A., Demiris, Y.: Learning forward models for robots. In: IJCAI, pp. 1440–1445 (2005)
    Google Scholar
  13. Weber, C.: Self-organization of orientation maps, lateral connections, and dynamic receptive fields in the primary visual cortex. In: Dorffner, G., Bischof, H., Hornik, K. (eds.) ICANN 2001. LNCS, vol. 2130, pp. 1147–1152. Springer, Heidelberg (2001)
    Chapter Google Scholar
  14. Dorigo, M., Birattari, M., Stützle, T.: Ant colony optimization. Computational Intelligence Magazine, IEEE 1(4), 28–39 (2006)
    Google Scholar
  15. Witkowski, M.: An action-selection calculus. Adaptive Behavior 15(1), 73–97 (2007)
    Article Google Scholar
  16. Schmidhuber, J.: Developmental robotics, optimal artificial curiosity, creativity, music, and the fine arts. Connection Science 18(2), 173–187 (1991)
    Article Google Scholar
  17. Herrmann, J., Pawelzik, K., Geisel, T.: Learning predictive representations. Neurocomputing 32-33, 785–791 (2000)
    Article Google Scholar
  18. Oudeyer, P., Kaplan, F., Hafner, V., Whyte, A.: The playground experiment: Task-independent development of a curious robot. In: AAAI Spring Symposium Workshop on Developmental Robotics (2005)
    Google Scholar
  19. Der, R., Martius, G.: From motor babbling to purposive actions: Emerging self-exploration in a dynamical systems approach to early robot development. In: SAB, pp. 406–421. Springer, Berlin (2006)
    Google Scholar
  20. Foster, D., Morris, R., Dayan, P.: A model of hippocampally dependent navigation, using the temporal difference learning rule. Hippocampus 10, 1–16 (2000)
    Article Google Scholar
  21. Van Rullen, R., Thorpe, S.: Rate coding versus temporal order coding: What the retinal ganglion cells tell the visual cortex. Neur. Comp. 13, 1255–1283 (2001)
    Article MATH Google Scholar
  22. Roelfsema, P., van Ooyen, A.: Attention-gated reinforcement learning of internal representations for classification. Neur. Comp. 17, 2176–2214 (2005)
    Article MATH Google Scholar
  23. McCallum, A.: Reinforcement Learning with Selective Perception and Hidden State. PhD thesis, U. of Rochester (1995)
    Google Scholar

Download references