90
Views
5
CrossRef citations to date
0
Altmetric
Original Articles

GA-based reinforcement learning for neural networks

, &
Pages 233-247 | Received 05 Jul 1997, Accepted 29 Aug 1997, Published online: 16 May 2007

References

  • ADLER , D. , 1993 , Genetic algorithms and simulated annealing a marriage proposal . Proceedings of the IEEE International Conference on Neural Networks , Vol. II . San Francisco , CA , pp. 1104 – 1109 .
  • ANDERSON , C. W. , 1986 , Learning and problem solving with multilayer connectionist systems . PhD thesis , University of Massachusetts ; 1987, Strategy learning with multilayer connectionist representations. Proceedings of the Fourth International Workshop on Machine Learning, Irvine, CA, pp. 103-114 .
  • BARTO , A. G. , and ANANDAN , P. , 1985 , Pattern-recognizing stochastic learning automata . IEEE Transactions on Systems, Man and Cybernetics . 15 . 360 – 375 .
  • BARTO , A, G. , and JORDAN , M. I. , 1987 , Gradient following without backpropagation in layered network . Proceedings of the Internationa! Joint Conference on Neural Networks , San Diego , CA , Vol. II , pp. 629 – 636 .
  • BARTO , A. G. , SUTTON , R. S. , and ANDERSON , C. W. , 1983 , Neuron-like adaptive elements that can solve difficult learning control problem . IEEE Transactions on Systems. Man and Cybernetics , 13 , 834 – 847 .
  • BERENJI , H. R. , and KHEDKAR , P. , 1992 , Learning and tuning fuzzy logic controllers through reinforcements . IEEE Transactions on Neural Networks , 3 , 724 – 740 .
  • DAVIS , L. , 1991 , Handbook of Genetic Algorithms ( New York Van Nostrand Reinhold ).
  • GOLDBERG , D. E. , 1989 . Genetic Algorithms in Search, Optimization and Machine Learning ( Reading . MA Addison-Wesley ).
  • HARP , S. , SAMAD , T. , and GUHA , A. , 1990 , Designing application-specific neural networks using the genetic algorithm . Neural Information Processing Systems . Vol. 2 ( San Mateo , CA Morgan Kaufman ).
  • HOLLAND , J. H. , 1962 , outline for a logical theory of adaptive systems . Journal of the Association for Computing Machinery , 3 , 297 – 314 ; 1975. Adaptation in Natural and Artificial System (Ann Arbor, MI University of Michigan) .
  • LAWLER , E. L. , 1976 , Combinatorial Optimization Networks and Matroids ( New York Holt, Rinehart and Winston ).
  • LUENBERGER , D. G. , 1976 , Linear and Nonlinear Programming ( Reading , MA Addison-Wesley ).
  • MICHALUWICZ , Z. , and KRAWEZYK , J. B. , 1992 , A modified genetic algorithm for optimal control problems . Computers and Mathematical Applications , 23 , 83 – 94 .
  • MONTANA , D. , and DAVIS , L. , 1989 , Training feedforward neural networks using genetic algorithms . Proceedings of the International Joint Conference on Artificial Intelligence , pp. 762 – 767 .
  • MORIARTY , D. E. , and MHKKULAINEN , R. , 1996 , Efficient reinforcement learning through symbiotic evolution . Machine Learning . 22 , 11 – 32 .
  • PETRIDIS , V. , KAZARLIS , S. , PAPAIKONOMOU , A. , and FILELIS , A. , 1992 , A hybrid genetic algorithm for training neural networks . Artificial Neural Networks 2 , edited by 1. Alcksander and J. Taylor , ( North-Holland ), pp. 953 – 956 .
  • RUMELHART , D. , HINTO , G. , and WILLIAMS , R. J. , 1986 , Learning internal representation by error propagation . Parallel Distributed Processing , edited by Rumelhart, D., and McClelland ( Cambridge , MA MIT Press ), pp. 318 – 362 .
  • SCHAFFER , J. D. , CARUANA , R. A. , and ESHELMAN , L. J. , 1990 , Using genetic search to exploit the emergent behavior of neural networks . Physica D , 42 , 244 – 248 .
  • SUTTON , R. S. , 1984 , Temporal credit assignment in reinforcement learning . PhD thesis , University of Massachusetts , Amherst , MA , USA ; 1988, Learning to predict by the methods of temporal difference. Machine Learning, 3, 9-44 .
  • TSINAS , L. , and DACHWALD , B. , 1994 , A combined neural and genetic learning algorithm . Proceedings of the IEEE International Conference on Neural Networks , vol. 1 , pp. 770 – 774 .
  • WERBOS , P. J. , 1990 , A menu of design for reinforcement learning over time . Neural Networks for Control , edited by W. T. Miller, 111, R. S. Sutton, and P. J. Werbos ( Cambridge MIT Press ), Chapter 3 .
  • WHITLEY , D. , DOMINIC , S. , DAS , R. , and ANDERSON , C. W. , 1993 , Genetic reinforcement learning for neurocontrol problems . Machine Learning , 13 , 259 – 284 .
  • WHITLEY , D. , STARKWEATHER , T. , and BOCART , C. , 1990 , Genetic algorithm and neural networks optimizing connections and connectivity . Parallel Computing , 14 , 347 – 361 .
  • WILLIAMS , R. J. , 1987 , A class of gradient-estimating algorithms for reinforcement learning in neural networks . Proceedings of the International Joint Conference on Neural Networks , San Diego , CA , Vol. II , pp. 601 – 608 .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.