Baldwinian Evolution of Task-Specialised High-Efficiency Learning in Neural Networks

Research Project

Project/Area Number	23K11262
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Multi-year Fund
Section	一般
Review Section	Basic Section 61040:Soft computing-related
Research Institution	Shinshu University
Principal Investigator	Arnold Solvi 信州大学, 工学部, 准教授(特定雇用) (80764935)
Co-Investigator(Kenkyū-buntansha)	有田隆也名古屋大学, 情報学研究科, 教授 (40202759) 鈴木麗璽名古屋大学, 情報学研究科, 准教授 (20362296)
Project Period (FY)	2023-04-01 – 2026-03-31
Project Status	Granted (Fiscal Year 2023)
Budget Amount *help	¥4,160,000 (Direct Cost: ¥3,200,000、Indirect Cost: ¥960,000) Fiscal Year 2025: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000) Fiscal Year 2024: ¥2,990,000 (Direct Cost: ¥2,300,000、Indirect Cost: ¥690,000) Fiscal Year 2023: ¥520,000 (Direct Cost: ¥400,000、Indirect Cost: ¥120,000)
Keywords	neural networks / artificial intelligence / learning algorithms / evolution of learning / meta-learning / Baldwin effect / artificial life / 人工知能 / ニューラルネットワーク / 人工生命 / ボールドウィン効果 / 機械学習
Outline of Research at the Start	Learning is a key aspect of intelligence. In contrast to AI, humans can learn efficiently from limited experience. Human cognition is evolutionarily specialised to learn important tasks (e.g. learning to walk, acquiring language) rapidly. This is hypothesised to be a core factor in cognitive evolution. We computationally model the evolution of learning ability using neural networks, with a focus on such specialisation. Our goals are 1) to explore how we can make AI learning more human-like, and 2) to gain new insights in the evolution of intelligence in nature.
Outline of Annual Research Achievements	The main goal for this year was a proof-of-concept implementation of the hypothesised evolutionary scenario. We developed a model for evolution of neural networks with mechanisms for both reward-driven learning (an existing Reinforcement Learning algorithm) and direct synaptic weight modification via neuromodulation (a novel implementation of the neuromodulation concept that considers columnar neural structures). We designed 2D and 3D task domains consisting of navigation tasks that require individual learning to solve. We let neural network populations evolve on these domains, and analysed how learning abilities evolved. The resulting evolutionary dynamics are consistent with our theory: first reward-driven learning appears, then non-reward information is gradually integrated into the learning process via the neuromodulation mechanism, thereby improving the efficiency of the learning process. On the present task domains, evolution eventually eliminates the need for reward signals altogether, enabling reward-agnostic learning of the tasks. We performed a quantitative comparison with a representative non-evolutionary Reinforcement Learning algorithm, and found that the learning abilities evolved in our model learn over 300 times faster on the task domain. These results support our theory and indicate its potential for improving learning ability in neural networks. We prepared a conference paper discussing our theory and results.
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason The project is mostly proceeding as planned. We switched from a locomotion task to a navigation task for the initial proof of concept because the latter is computationally lightweight, allowing us to experiment more effectively with various neural network implementations during the early stages of the project. Results on the navigation task exceeded expectations. The originally planned locomotion task has also been developed, and we plan to run experiments on this task in FY2024.
Strategy for Future Research Activity	In FY2024 so far, we submitted a conference paper on the theory and our first results, and made this paper publicly available as a pre-print. The main research direction for FY2024 will be to diversify the tasks we apply the system to. From a theoretical point of view, this should help clarify the role of the hypothesised evolutionary dynamic in biological evolution. From a practical point of view, this will clarify what sort of tasks the system solves well and what sort of tasks will require further development. We also plan to release source code to allow others to experiment with the approach.