Skip to main content Skip to main navigation

Publications

Displaying results 2201 to 2210 of 13730.
  1. Daan Wierstra; Alexander Förster; Jan Peters; Jürgen Schmidhuber

    Recurrent policy gradients

    In: Logic Journal of the IGPL Oxford, Vol. 18, No. 5, Pages 620-634, Oxford University Press, 2010.

  2. Jeremy L. Wyatt; Peter Dayan; Ales Leonardis; Jan Peters

    Exploration and Curiosity in Robot Learning and Inference (Dagstuhl Seminar 11131)

    In: Dagstuhl Reports, Vol. 1, No. 3, Pages 67-95, Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik, 2011.

  3. Yevgeny Seldin; Nicolò Cesa-Bianchi; François Laviolette; Peter Auer; John Shawe-Taylor; Jan Peters

    PAC-Bayesian Analysis of the Exploration-Exploitation Trade-off

    In: Computing Research Repository eprint Journal (CoRR), Vol. abs/1105.4585, Pages 0-10, arXiv, 2011.

  4. Yevgeny Seldin; François Laviolette; John Shawe-Taylor; Jan Peters; Peter Auer

    PAC-Bayesian Analysis of Martingales and Multiarmed Bandits

    In: Computing Research Repository eprint Journal (CoRR), Vol. abs/1105.2416, Pages 0-10, arXiv, 2011.

  5. Abdeslam Boularias; Jens Kober; Jan Peters

    Relative Entropy Inverse Reinforcement Learning

    In: Geoffrey J. Gordon; David B. Dunson; Miroslav Dudík (Hrsg.). Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics. International Conference on Artificial Intelligence and Statistics (AISTATS-2011), April 11-13, Fort Lauderdale, USA, Pages 182-189, JMLR Proceedings, Vol. 15, JMLR.org, 2011.

  6. Oliver Kroemer; Jan Peters

    A Non-Parametric Approach to Dynamic Programming

    In: John Shawe-Taylor; Richard S. Zemel; Peter L. Bartlett; Fernando C. N. Pereira; Kilian Q. Weinberger (Hrsg.). Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Neural Information Processing Systems (NeurIPS-2011), December 12-14, Granada, Spain, Pages 1719-1727, Curran Associates, Inc. 2011.

  7. Abdeslam Boularias; Oliver Kroemer; Jan Peters

    Learning robot grasping from 3-D images with Markov Random Fields

    In: 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2011), September 25-30, San Francisco, CA, USA, Pages 1548-1553, IEEE, 2011.

  8. Botond Bocsi; Duy Nguyen-Tuong; Lehel Csató; Bernhard Schölkopf; Jan Peters

    Learning inverse kinematics with structured prediction

    In: 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2011), September 25-30, San Francisco, CA, USA, Pages 698-703, IEEE, 2011.

  9. Jens Kober; Jan Peters

    Learning elementary movements jointly with a higher level task

    In: 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2011), September 25-30, San Francisco, CA, USA, Pages 338-343, IEEE, 2011.

  10. Zhikun Wang; Christoph H. Lampert; Katharina Mülling; Bernhard Schölkopf; Jan Peters

    Learning anticipation policies for robot table tennis

    In: 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2011), September 25-30, San Francisco, CA, USA, Pages 332-337, IEEE, 2011.