Machine Discovery and Machine Learning Based on Adaptation and Evolution

Research Project

Project/Area Number	05452356
Research Category	Grant-in-Aid for General Scientific Research (B)
Allocation Type	Single-year Grants
Research Field	Intelligent informatics
Research Institution	TOKYO INSTITUTE OF TECHNOLOGY
Principal Investigator	KOBAYASHI Shigenobu Tokyo Institute of Technology, Inter disciprinary Graduate School of Science and Engineering, Professor, 大学院・総合理工学研究所, 教授 (40016697)
Co-Investigator(Kenkyū-buntansha)	YAMAMURA Masayuki Tokyo Institute of Technology, Inter disciprinary Graduate School of Science and, 大学院・総合理工学研究所, 助手 (00220442)
Project Period (FY)	1993 – 1994
Project Status	Completed (Fiscal Year 1994)
Budget Amount *help	¥5,600,000 (Direct Cost: ¥5,600,000) Fiscal Year 1994: ¥2,100,000 (Direct Cost: ¥2,100,000) Fiscal Year 1993: ¥3,500,000 (Direct Cost: ¥3,500,000)
Keywords	genetic algorithm / role of coding and crossover / combinatorial optimization / multiobjective optimization / reinforcement learning / profit sharing / k-certain exploration / perceptual aliasing / サブツアー交換交叉 / 経験強化型学習 / 環境同定型学習 / マルコフ解析 / 数理生態学的解析 / だまし境界定理 / Q-learing / 割引き勾配法
Research Abstract	Research results through two years are summarized as follows ; (1) Research results in evolutionary computation 1) An analysis on the role of crossover operators in genetic algorithms We made a mathematical analysis on the effect of crossover operators to search better solutions with genetic algorithms, and showed the "deceptive boundary theorem." 2) A proposal of evaluation criteria on a coding-crossover We proposed four criteria of completeness, soundness, non-redundancy and character preservingness to evaluate a model formulation. 3) A proposal and applications of the subtour exchange crossover as a character preserving operation We proposed a character preserving operation named the subtour exchange crossover in the domain of ordering problems. 4) A proposal of genetic algorithms for multiobjective optimization problems We established a methodology for genetic algorithms to generate the pareto optimal set, which is the rational solution for a multiobjective optimization. (2) Research results … More in adaptive computation 1) A proposal of a framework and a categorization for reinforcement learning We defined a transparent framework for reinforcement learning, and classified existing researches according to the class of the environment and the orientation of the approach. 2) An analysis on the rationality of the exploitation oriented reinforcement learning We made a mathematical analysis on the rationality of the sharing functions in the profit sharing method, and showed two kinds of rationality theorem. 3) A proposal and an extention of the k-certain exploration method We proposed a exploration oriented action selection strategy named the k-certain exploration method to efficiently identify an unknown environment. 4) A proposal of a reinforcement learning method based on a hillclimbing of the expected rewards We proposed an incremental reinforcement learning method, which performs a hillclimbing along the gradient of the expected rewards. 5) A proposal of a learning method under perceptual aliasing We proposed a prediction model to foresee the state transition under incomplete deceptive perception, and developed a learning method to construct a prediction model with trial and error. Less

Report

(3 results)

1994 Annual Research Report Final Research Report Summary
1993 Annual Research Report

Research Products

(32 results)

All Other

All Publications (32 results)

[Publications] 小林重信: "遺伝的アルゴリズムの現状と課題" 計測と制御. 32. 2-9 (1993)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] 山村雅幸、小林重信: "遺伝的アルゴリズムの工学的応用" 人工知能学会誌. 9. 506-511 (1994)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] 小林重信、山村雅幸: "遺伝的アルゴリズムによる探索と学習" 日本ロボット学会誌. 13. 57-62 (1995)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] Yamamura,M.,Satoh,H.and Kobayashi,S.: "An Analysis of Crossover's Effect in Genetic Algorithms" Proc.of 1st IEEE Conf. on Evlutionary Computation. 613-618 (1994)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] Sato,H.,Yamamura,M.and Kobayashi,S.: "An Analysis of Genetic Algorithms with Population Dynamics" Proc.of 3rd Int.Conf.on Fuzzy Logic,Neural Nets and Soft Computing. 449-452 (1994)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] Miyazaki,K.,Yamamura,M.and Kobayashi,S.: "On the Rationality of Profit Sharing in Reinforcement Learning" Proc.of 3rd Int.Conf.on Fuzzy Logic,Neural Nets and Soft Computing. 285-288 (1994)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] Kimura,H.,Yamamura,M.and Kobayashi,S.: "Reinforcement Learning with Delayed Rewards on Continuous State Space" Proc.of 3rd Int.Conf.on Fuzzy Logic,Neural Nets and Soft Computing. 289-292 (1994)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] 小林重信、山村雅幸(分担執筆): "遺伝的アルゴリズム" 産業図書, 328 (1993)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] Kobayashi, S.: "The State of the Art of Genetic Algorithms" Journal of Society of Instrument and Control Engineers. vol.32. 2-9 (1993)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] Yamamura, M.and Kobayashi, S.: "Toward Application Methodology of Genetic Algorithms" Journal of Japanese Society for Artifical Intelligence. vol.9. 506-511 (1994)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] Kobayashi, S.and Yamamura, M: "Search and Learning by Genetic Algorithms" Journal of the Pobotics Society of Japan. vol.13. 57-62 (1995)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] Yamamura, M., Satoh, H.and Kobayashi, S.: "An Analysis of Crossover's Effect in Genetic Algorithms" Proc.of 1st IEEE Conf.on Evolutionary Computation. 613-618 (1994)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] Sato, H., Yamamura, M.and Kobayashi, S.: "An Analysis of Genetic Algorithms with Population Dynamics" Proc.of 3rd Int.Conf.on Fuzzy Logic, Neural Nets and Soft Computing. 449-452 (1994)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] Miyazaki, K., Yamamura, M.and Kobayashi, S.: "On the Rationality of Profit Sharing in Reinforcement Learning" Proc.of 3rd Int.Conf.on Fuzzy Logic, Neural Nets and Soft Computing. 285-288 (1994)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] Kimura, H., Yamamura, M.and Kobayashi, S.: "Reinforcement Learning with Delayd Rewards on Continuous State" Proc.of 3rd Int.Conf.on Fuzzy Logic, Neural Nets and Soft Computing. 282-292 (1994)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] Kobayashi, S.and Yamamura, M: in Kitano, H.ed., Genetic Algorithms. the Role of Coding and Crossover in Genetic Algorithms Sangyo Tosho, 43-59 (1993)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1994 Final Research Report Summary
[Publications] 小林重信: "遺伝的アルゴリズムの現状と課題" 計測と制御. 32. 2-9 (1993)
- Related Report
  1994 Annual Research Report
[Publications] 山村雅幸,小林重信: "遺伝的アルゴリズムの工学的応用" 人工知能学会誌. 9. 506-511 (1994)
- Related Report
  1994 Annual Research Report
[Publications] 小林重信,山村雅幸: "遺伝的アルゴリズムの探索と学習" 日本ロボット学会誌. 13. 57-62 (1995)
- Related Report
  1994 Annual Research Report
[Publications] Yamamura,M.,Satoh,H.and Kobayashi,S.: "An Analysis of Crossover's Effect in Genetic Algorithms" Proc.of 1st IEEE Conf.on Evolutionary Computation. 613-618 (1994)
- Related Report
  1994 Annual Research Report
[Publications] Sato,H.,Yamamura,M.and Kobayashi,S.: "An Analysis of Genetic Algorithms with Population Dynamics" Proc.of 3rd Int.Conf.on Fuzzy Logic,Neural Nets and Soft Computing. 449-452 (1994)
- Related Report
  1994 Annual Research Report
[Publications] Miyazaki,K.,Yamamura,M.and Kobayashi,S.: "On the Rationality of Profit Sharing in Reinforcement Learming" Proc.of 3rd Int.Conf.on Fuzzy Logic,Neural Nets and Soft Computing. 285-288 (1994)
- Related Report
  1994 Annual Research Report
[Publications] Kimura,H.,Yamamura,M.and Kobayashi,S.: "Reinforcement Learning with Delayed Rewards on Continuous State Space" Proc.of 3rd Int.Conf.on Fuzzy Logic,Neural Nets and Soft Computing. 289-292 (1994)
- Related Report
  1994 Annual Research Report
[Publications] 小林重信,山村雅幸(分担執筆): "遺伝的アルゴリズム" 産業図書, 328 (1993)
- Related Report
  1994 Annual Research Report
[Publications] 小林重信: "遺伝的アルゴリズムの現状と認識" 計測と制御. 32. 2-9 (1993)
- Related Report
  1993 Annual Research Report
[Publications] 小林重信: "遺伝的アルゴリズムの基礎と応用" オペレーションズ・リサーチ. 38. 256-261 (1993)
- Related Report
  1993 Annual Research Report
[Publications] 小林重信,小野功,山村雅幸: "遺伝的アルゴリズムによるジョブショップスケジューリングの性能評価" 生産スケジューリングシンポジウム講演論文集. 27-32 (1993)
- Related Report
  1993 Annual Research Report
[Publications] 山村雅幸: "強化学習" 人工知能学会誌. 8. 833-834 (1993)
- Related Report
  1993 Annual Research Report
[Publications] 木村元,山村雅幸,小林重信: "状態空間が連続で報酬入力に遅れのある強化学習" 第5回自律分散システムシンポジウム資料. 9-14 (1994)
- Related Report
  1993 Annual Research Report
[Publications] 佐藤浩,山村雅幸,小林重信: "遺伝的アルゴリズムの数理生態学的解析" 第5回自律分散システムシンポジウム資料. 123-128 (1994)
- Related Report
  1993 Annual Research Report
[Publications] 宮崎和光,山村雅幸,小林重信: "強化学習における報酬割当の理論的考察" 人工知能学会誌. 9(掲載予定). (1994)
- Related Report
  1993 Annual Research Report
[Publications] 中川浩史,山村雅幸,小林重信: "遺伝的アルゴリズムの世代交替と多様性維持について" 第19回知能システムシンポジウム. (発表予定). (1994)
- Related Report
  1993 Annual Research Report

Machine Discovery and Machine Learning Based on Adaptation and Evolution

Principal Investigator

KOBAYASHI Shigenobu Tokyo Institute of Technology, Inter disciprinary Graduate School of Science and Engineering, Professor, 大学院・総合理工学研究所, 教授 (40016697)

¥5,600,000 (Direct Cost: ¥5,600,000)

Report

Research Products

[Publications] 小林重信: "遺伝的アルゴリズムの現状と課題" 計測と制御. 32. 2-9 (1993)

Description

Related Report

[Publications] 山村雅幸、小林重信: "遺伝的アルゴリズムの工学的応用" 人工知能学会誌. 9. 506-511 (1994)

Description

Related Report

[Publications] 小林重信、山村雅幸: "遺伝的アルゴリズムによる探索と学習" 日本ロボット学会誌. 13. 57-62 (1995)

Description

Related Report

[Publications] Yamamura,M.,Satoh,H.and Kobayashi,S.: "An Analysis of Crossover's Effect in Genetic Algorithms" Proc.of 1st IEEE Conf. on Evlutionary Computation. 613-618 (1994)

Description

Related Report

[Publications] Sato,H.,Yamamura,M.and Kobayashi,S.: "An Analysis of Genetic Algorithms with Population Dynamics" Proc.of 3rd Int.Conf.on Fuzzy Logic,Neural Nets and Soft Computing. 449-452 (1994)

Description

Related Report

[Publications] Miyazaki,K.,Yamamura,M.and Kobayashi,S.: "On the Rationality of Profit Sharing in Reinforcement Learning" Proc.of 3rd Int.Conf.on Fuzzy Logic,Neural Nets and Soft Computing. 285-288 (1994)

Description

Related Report

[Publications] Kimura,H.,Yamamura,M.and Kobayashi,S.: "Reinforcement Learning with Delayed Rewards on Continuous State Space" Proc.of 3rd Int.Conf.on Fuzzy Logic,Neural Nets and Soft Computing. 289-292 (1994)

Description

Related Report

[Publications] 小林重信、山村雅幸(分担執筆): "遺伝的アルゴリズム" 産業図書, 328 (1993)

Description

Related Report

[Publications] Kobayashi, S.: "The State of the Art of Genetic Algorithms" Journal of Society of Instrument and Control Engineers. vol.32. 2-9 (1993)

Description

Related Report

[Publications] Yamamura, M.and Kobayashi, S.: "Toward Application Methodology of Genetic Algorithms" Journal of Japanese Society for Artifical Intelligence. vol.9. 506-511 (1994)

Description

Related Report

[Publications] Kobayashi, S.and Yamamura, M: "Search and Learning by Genetic Algorithms" Journal of the Pobotics Society of Japan. vol.13. 57-62 (1995)

Description

Related Report

[Publications] Yamamura, M., Satoh, H.and Kobayashi, S.: "An Analysis of Crossover's Effect in Genetic Algorithms" Proc.of 1st IEEE Conf.on Evolutionary Computation. 613-618 (1994)

Description

Related Report

[Publications] Sato, H., Yamamura, M.and Kobayashi, S.: "An Analysis of Genetic Algorithms with Population Dynamics" Proc.of 3rd Int.Conf.on Fuzzy Logic, Neural Nets and Soft Computing. 449-452 (1994)

Description

Related Report

[Publications] Miyazaki, K., Yamamura, M.and Kobayashi, S.: "On the Rationality of Profit Sharing in Reinforcement Learning" Proc.of 3rd Int.Conf.on Fuzzy Logic, Neural Nets and Soft Computing. 285-288 (1994)

Description

Related Report

[Publications] Kimura, H., Yamamura, M.and Kobayashi, S.: "Reinforcement Learning with Delayd Rewards on Continuous State" Proc.of 3rd Int.Conf.on Fuzzy Logic, Neural Nets and Soft Computing. 282-292 (1994)

Description

Related Report

[Publications] Kobayashi, S.and Yamamura, M: in Kitano, H.ed., Genetic Algorithms. the Role of Coding and Crossover in Genetic Algorithms Sangyo Tosho, 43-59 (1993)

Description

Related Report

[Publications] 小林重信: "遺伝的アルゴリズムの現状と課題" 計測と制御. 32. 2-9 (1993)

Related Report

[Publications] 山村雅幸,小林重信: "遺伝的アルゴリズムの工学的応用" 人工知能学会誌. 9. 506-511 (1994)

Related Report

[Publications] 小林重信,山村雅幸: "遺伝的アルゴリズムの探索と学習" 日本ロボット学会誌. 13. 57-62 (1995)

Related Report

[Publications] Yamamura,M.,Satoh,H.and Kobayashi,S.: "An Analysis of Crossover's Effect in Genetic Algorithms" Proc.of 1st IEEE Conf.on Evolutionary Computation. 613-618 (1994)

Related Report

[Publications] Sato,H.,Yamamura,M.and Kobayashi,S.: "An Analysis of Genetic Algorithms with Population Dynamics" Proc.of 3rd Int.Conf.on Fuzzy Logic,Neural Nets and Soft Computing. 449-452 (1994)

Related Report

[Publications] Miyazaki,K.,Yamamura,M.and Kobayashi,S.: "On the Rationality of Profit Sharing in Reinforcement Learming" Proc.of 3rd Int.Conf.on Fuzzy Logic,Neural Nets and Soft Computing. 285-288 (1994)

Related Report

[Publications] Kimura,H.,Yamamura,M.and Kobayashi,S.: "Reinforcement Learning with Delayed Rewards on Continuous State Space" Proc.of 3rd Int.Conf.on Fuzzy Logic,Neural Nets and Soft Computing. 289-292 (1994)

Related Report

[Publications] 小林重信,山村雅幸(分担執筆): "遺伝的アルゴリズム" 産業図書, 328 (1993)

Related Report

[Publications] 小林重信: "遺伝的アルゴリズムの現状と認識" 計測と制御. 32. 2-9 (1993)

Related Report

[Publications] 小林重信: "遺伝的アルゴリズムの基礎と応用" オペレーションズ・リサーチ. 38. 256-261 (1993)

Related Report

[Publications] 小林重信,小野功,山村雅幸: "遺伝的アルゴリズムによるジョブショップスケジューリングの性能評価" 生産スケジューリングシンポジウム講演論文集. 27-32 (1993)

Related Report

[Publications] 山村雅幸: "強化学習" 人工知能学会誌. 8. 833-834 (1993)

Related Report

[Publications] 木村元,山村雅幸,小林重信: "状態空間が連続で報酬入力に遅れのある強化学習" 第5回自律分散システムシンポジウム資料. 9-14 (1994)

Related Report