References
- ADLER , D. , 1993 , Genetic algorithms and simulated annealing a marriage proposal . Proceedings of the IEEE International Conference on Neural Networks , Vol. II . San Francisco , CA , pp. 1104 – 1109 .
- ANDERSON , C. W. , 1986 , Learning and problem solving with multilayer connectionist systems . PhD thesis , University of Massachusetts ; 1987, Strategy learning with multilayer connectionist representations. Proceedings of the Fourth International Workshop on Machine Learning, Irvine, CA, pp. 103-114 .
- BARTO , A. G. , and ANANDAN , P. , 1985 , Pattern-recognizing stochastic learning automata . IEEE Transactions on Systems, Man and Cybernetics . 15 . 360 – 375 .
- BARTO , A, G. , and JORDAN , M. I. , 1987 , Gradient following without backpropagation in layered network . Proceedings of the Internationa! Joint Conference on Neural Networks , San Diego , CA , Vol. II , pp. 629 – 636 .
- BARTO , A. G. , SUTTON , R. S. , and ANDERSON , C. W. , 1983 , Neuron-like adaptive elements that can solve difficult learning control problem . IEEE Transactions on Systems. Man and Cybernetics , 13 , 834 – 847 .
- BERENJI , H. R. , and KHEDKAR , P. , 1992 , Learning and tuning fuzzy logic controllers through reinforcements . IEEE Transactions on Neural Networks , 3 , 724 – 740 .
- DAVIS , L. , 1991 , Handbook of Genetic Algorithms ( New York Van Nostrand Reinhold ).
- GOLDBERG , D. E. , 1989 . Genetic Algorithms in Search, Optimization and Machine Learning ( Reading . MA Addison-Wesley ).
- HARP , S. , SAMAD , T. , and GUHA , A. , 1990 , Designing application-specific neural networks using the genetic algorithm . Neural Information Processing Systems . Vol. 2 ( San Mateo , CA Morgan Kaufman ).
- HOLLAND , J. H. , 1962 , outline for a logical theory of adaptive systems . Journal of the Association for Computing Machinery , 3 , 297 – 314 ; 1975. Adaptation in Natural and Artificial System (Ann Arbor, MI University of Michigan) .
- LAWLER , E. L. , 1976 , Combinatorial Optimization Networks and Matroids ( New York Holt, Rinehart and Winston ).
- LUENBERGER , D. G. , 1976 , Linear and Nonlinear Programming ( Reading , MA Addison-Wesley ).
- MICHALUWICZ , Z. , and KRAWEZYK , J. B. , 1992 , A modified genetic algorithm for optimal control problems . Computers and Mathematical Applications , 23 , 83 – 94 .
- MONTANA , D. , and DAVIS , L. , 1989 , Training feedforward neural networks using genetic algorithms . Proceedings of the International Joint Conference on Artificial Intelligence , pp. 762 – 767 .
- MORIARTY , D. E. , and MHKKULAINEN , R. , 1996 , Efficient reinforcement learning through symbiotic evolution . Machine Learning . 22 , 11 – 32 .
- PETRIDIS , V. , KAZARLIS , S. , PAPAIKONOMOU , A. , and FILELIS , A. , 1992 , A hybrid genetic algorithm for training neural networks . Artificial Neural Networks 2 , edited by 1. Alcksander and J. Taylor , ( North-Holland ), pp. 953 – 956 .
- RUMELHART , D. , HINTO , G. , and WILLIAMS , R. J. , 1986 , Learning internal representation by error propagation . Parallel Distributed Processing , edited by Rumelhart, D., and McClelland ( Cambridge , MA MIT Press ), pp. 318 – 362 .
- SCHAFFER , J. D. , CARUANA , R. A. , and ESHELMAN , L. J. , 1990 , Using genetic search to exploit the emergent behavior of neural networks . Physica D , 42 , 244 – 248 .
- SUTTON , R. S. , 1984 , Temporal credit assignment in reinforcement learning . PhD thesis , University of Massachusetts , Amherst , MA , USA ; 1988, Learning to predict by the methods of temporal difference. Machine Learning, 3, 9-44 .
- TSINAS , L. , and DACHWALD , B. , 1994 , A combined neural and genetic learning algorithm . Proceedings of the IEEE International Conference on Neural Networks , vol. 1 , pp. 770 – 774 .
- WERBOS , P. J. , 1990 , A menu of design for reinforcement learning over time . Neural Networks for Control , edited by W. T. Miller, 111, R. S. Sutton, and P. J. Werbos ( Cambridge MIT Press ), Chapter 3 .
- WHITLEY , D. , DOMINIC , S. , DAS , R. , and ANDERSON , C. W. , 1993 , Genetic reinforcement learning for neurocontrol problems . Machine Learning , 13 , 259 – 284 .
- WHITLEY , D. , STARKWEATHER , T. , and BOCART , C. , 1990 , Genetic algorithm and neural networks optimizing connections and connectivity . Parallel Computing , 14 , 347 – 361 .
- WILLIAMS , R. J. , 1987 , A class of gradient-estimating algorithms for reinforcement learning in neural networks . Proceedings of the International Joint Conference on Neural Networks , San Diego , CA , Vol. II , pp. 601 – 608 .