20
Views
0
CrossRef citations to date
0
Altmetric
Original Articles

Learning action probabilities from delayed reinforcement

&
Pages 2415-2421 | Received 13 May 1992, Published online: 16 May 2007

REFERENCES

  • BARTO , A. G. , SUTTON , R. S. , and ANDERSON , C. W. , 1983 , IEEE Trans. Systems, Man. Cyber. , 13 , 834 – 846 .
  • HERTZ , J. , KROGH , A. , and PORLNER , R. G. , 1991 , Introduction to the Theory of Neural Computing ( New York Addison-Wesley ).
  • HINTON , G. E. , 1989 , Artificial Intelligence , 40 , 185 .
  • LIN , C. S. , and KIM , H. , 1991 , IEEE Trans. Neural Networks , 2 , 530 .
  • MICHIE , D. , and CHAMBERS , R. A. , 1968 , BOXES An experiment in adaptive control , Machine Intelligence 2 , edited by E. DALE and D. MICHIE ( Edinburgh , U.K. Oliver and Boyd ), pp. 137 – 152 .
  • NARENDRA , K. S. , and THATHACHAR , M. A. L. , 1989 , Learning Automata - An Introduction ( New York Prentice Hall ).
  • THATHACHAR , M. A. L. , and SASTRY , P. S. , 1985 , IEEE Trans. Systems Man. Cyber. , 15 , 168 .
  • YAGER , R. R. , 1990 , IEEE Trans. Systems Man. Cyber. , 20 , 1229 – 1234 .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.