MINOH Michihiko Kyoto University, Academic Center for Computing and Media Studies, Professor, 学術情報メディアセンター, 教授 (70166099)
IKEDA Katsuo Osaka Institute of Technology, Faculty of Information Science and Technology, Professor, 情報科学部, 教授 (30026009)
SUENAGA Yasuhito Nagoya University, Graduate School of Engineering, Profeesor, 工学研究科, 教授 (60293643)
YOKOYA Naokazu Nara Institute of Science and Technology Graduate School of Information Science, Professor, 情報科学研究科, 教授 (10252834)
KAWASHIMA Toshio Future Univereity-Hakodate, Department of Media Architecture, Professor, 情報アーキテクチャ学科, 教授 (20152952)
Pattern recognition and understanding, PRU, is a fundamental technology to realize intelligent visual information media, which can select, summarize, and augment the visual information in the real world according to the need of each individual. The purpose of this project is to restructure the PRU technology, which has been developed for making automatic machines to replace humans, into a key technology in the multimedia fields where the information media are expected to support the human activity and to augment the human ability. The critical difference between the two cases is that a human consumes the output of PRU in multimedia systems, while a machine consumes it in automated systems for substituting humans.
As common research targets of the project, we have focused on three problems that are typical and fundamental in the PRU for information media. L) The mechanism to handle the action and behavior of humans. 2) The mechanism to handle the ambiguous demand of humans. 3) The mechanism to handle the real world for which simple modeling is not applicable. These classic and difficult problems have been solved successfully by fully utilizing the following three assumptions. I) The final consumer of recognition results is a human. II) The real time interaction between computers and humane is possible. III) Visual data with long period, multiple cameras, wide field-of-vision are available.
We have developed several visual information systems, such as a distance learning system with intelligent camera control capability, a mixed-reality community space with eye-contact recovery capability, a personal episode recording system with visual information summary mechanism, and a video surveillance system using multiple omni-directional image sensors.