Skip to main content Skip to main navigation

Publications

 

Due to maintenance work, it is currently not possible to search for publications by author.

Displaying results 151 to 160 of 12981.
  1. Katharina Mülling; Jens Kober; Jan Peters

    Learning table tennis with a Mixture of Motor Primitives

    In: 10th IEEE-RAS International Conference on Humanoid Robots. IEEE-RAS International Conference on Humanoid Robots (Humanoids-2010), December 6-8, …

  2. Jan Peters; Katharina Mülling; Yasemin Altun

    Relative Entropy Policy Search

    In: Maria Fox; David Poole (Hrsg.). Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial …

  3. Tetsuro Morimura; Eiji Uchibe; Junichiro Yoshimoto; Jan Peters; Kenji Doya

    Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

    In: Neural Computation, Vol. 22, No. 2, Pages 342-376, MIT Press, 2010.

  4. Daan Wierstra; Alexander Förster; Jan Peters; Jürgen Schmidhuber

    Recurrent policy gradients

    In: Logic Journal of the IGPL Oxford, Vol. 18, No. 5, Pages 620-634, Oxford University Press, 2010.

  5. Jeremy L. Wyatt; Peter Dayan; Ales Leonardis; Jan Peters

    Exploration and Curiosity in Robot Learning and Inference (Dagstuhl Seminar 11131)

    In: Dagstuhl Reports, Vol. 1, No. 3, Pages 67-95, Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik, 2011.

  6. Yevgeny Seldin; Nicolò Cesa-Bianchi; François Laviolette; Peter Auer; John Shawe-Taylor; Jan Peters

    PAC-Bayesian Analysis of the Exploration-Exploitation Trade-off

    In: Computing Research Repository eprint Journal (CoRR), Vol. abs/1105.4585, Pages 0-10, arXiv, 2011.

  7. Yevgeny Seldin; François Laviolette; John Shawe-Taylor; Jan Peters; Peter Auer

    PAC-Bayesian Analysis of Martingales and Multiarmed Bandits

    In: Computing Research Repository eprint Journal (CoRR), Vol. abs/1105.2416, Pages 0-10, arXiv, 2011.

  8. Abdeslam Boularias; Jens Kober; Jan Peters

    Relative Entropy Inverse Reinforcement Learning

    In: Geoffrey J. Gordon; David B. Dunson; Miroslav Dudík (Hrsg.). Proceedings of the Fourteenth International Conference on Artificial Intelligence and …

  9. Oliver Kroemer; Jan Peters

    A Non-Parametric Approach to Dynamic Programming

    In: John Shawe-Taylor; Richard S. Zemel; Peter L. Bartlett; Fernando C. N. Pereira; Kilian Q. Weinberger (Hrsg.). Advances in Neural Information …

  10. Abdeslam Boularias; Oliver Kroemer; Jan Peters

    Learning robot grasping from 3-D images with Markov Random Fields

    In: 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE/RSJ International Conference on Intelligent Robots and Systems …