Browse
We’re here to help

Find guidance on Author Services

Search
Browse
We’re here to help

Find guidance on Author Services

Home
All Journals
Applied Bionics and Biomechanics
List of Issues
Volume 5, Issue 4
Behaviour generation in humanoids by lea ....

Search in:

Advanced search

Applied Bionics and Biomechanics Volume 5, 2009 - Issue 4: HUMANOID ROBOTS

Journal homepage

Views

CrossRef citations to date

Altmetric

PAPERS

Behaviour generation in humanoids by learning potential-based policies from constrained motion

Matthew Howard School of Informatics, University of Edinburgh, Edinburgh, EH9 3JZ, United Kingdom

Stefan Klanke School of Informatics, University of Edinburgh, Edinburgh, EH9 3JZ, United Kingdom

Michael Gienger Honda Research Institute Europe GmbH, Offenbach/Main, D-63073, Germany

Christian Goerick Honda Research Institute Europe GmbH, Offenbach/Main, D-63073, Germany

Sethu Vijayakumar School of Informatics, University of Edinburgh, Edinburgh, EH9 3JZ, United Kingdom

Pages 195-211 | Received 29 Sep 2008, Accepted 01 Feb 2009, Published online: 03 Apr 2009

Cite this article
https://doi.org/10.1080/11762320902789830

Sample our Physical Sciences journals, sign in here to start your FREE access for 14 days

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
Read this article /doi/full/10.1080/11762320902789830?needAccess=true

Abstract

Movement generation that is consistent with observed or demonstrated behaviour is an efficient way to seed movement planning in complex, high-dimensional movement systems like humanoid robots. We present a method for learning potential-based policies from constrained motion data. In contrast to previous approaches to direct policy learning, our method can combine observations from a variety of contexts where different constraints are in force, to learn the underlying unconstrained policy in form of its potential function. This allows us to generalise and predict behaviour where novel constraints apply. We demonstrate our approach on systems of varying complexity, including kinematic data from the ASIMO humanoid robot with 22 degrees of freedom.

Keywords:

imitation learning
constrained motion
kinematics and dynamics

Notes

¹For a review on DPL, please see (CitationBillard et al. 2007) and references therein.

²It should be noted that, as with all DPL approaches, the choice of state-space is problem specific (CitationSchaal et al. 2003) and, when used for imitation learning, depends on the correspondence between demonstrator and imitator. For example if we wish to learn the policy a human demonstrator uses to wash a window, and transfer that behaviour to an imitator robot, an appropriate choice of x would be the Cartesian coordinates of the hand, which would correspond to the end-effector coordinates of the robot. Transfer of behaviour across non-isomorphic state spaces, for example if the demonstrator and imitator have different embodiments, is also possible by defining an appropriate state-action metric (CitationAlissandrakis et al. 2007).

³ A† denotes the (unweighted) Moore–Penrose pseudoinverse of the matrix A

⁴It should be noted that in general the orientation of the constraint plane onto which the policy is projected may vary both with state position and time.

⁵It should be noted that these trajectories are not outliers in the sense of containing corrupt data and could in fact be used for further training of the model. For example one could take a hierarchical approach, where groups of strongly connected trajectories are aligned first to form models consisting of groups of trajectories with good alignment. We can then recursively repeat the process, aligning these larger (but more weakly connected) groups until all of the data has been included.

⁶Since the goal of the experiments was to validate the proposed approach, we used policies known in closed form as a ground truth. In the follow-up paper we apply our method to human motion capture data.

⁷A detailed explanation of the error measures used can be found in Appendix B.

⁸Please note that we also discard the outliers for evaluating the error statistics—we can hardly expect to observe good performance in regions where the learnt model f(x) has seen no data.

⁹3 DOFs per hand × 2 hands.

Billard , A , Calinon , S , Dillmann , R and Schaal , S . 2007 . “ Robot programming by demonstration ” . In Handbook of Robotics , MIT Press .

Google Scholar

Schaal , S , Ijspeert , A and Billard , A . 2003 . Computational approaches to motor learning by imitation . Phil. Trans. Biol Sci. , 358 : 537 – 547 .

PubMed Web of Science ®Google Scholar

Alissandrakis , A , Nehaniv , C and Dautenhahn , K . 2007 . Correspondence mapping induced state and action metrics for robotic imitation . IEEE Trans. Sys. Man Cybernetics , 37 ( 2 ) : 299 – 307 .

Web of Science ®Google Scholar

Log in via your institution

Access through your institution

Log in to Taylor & Francis Online

Shibboleth

Log in to Taylor & Francis Online

Username Password

Forgot password?

Keep me logged in (not suitable for shared devices).

You will otherwise be logged out automatically, after a limited period, and will need to log in again.

Restore content access

Restore content access for purchases made as guest

There are no offers available at the current time.

Share icon
Back to Top

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Information for

Authors
R&D professionals
Editors
Librarians
Societies

Open access

Overview
Open journals
Open Select
Dove Medical Press
F1000Research

Opportunities

Reprints and e-prints
Advertising solutions
Accelerated publication
Corporate access solutions

Help and information

Help and contact
Newsroom
All journals
Books

Keep up to date

Sign me up

Taylor and Francis Group Facebook page

Taylor and Francis Group X Twitter page

Taylor and Francis Group Linkedin page

Taylor and Francis Group Youtube page

Taylor and Francis Group Weibo page

Registered in England & Wales No. 3099067
5 Howick Place | London | SW1P 1WG

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research