Project/Area Number |
15K00241
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Multi-year Fund |
Section | 一般 |
Research Field |
Perceptual information processing
|
Research Institution | Iwate Prefectural University |
Principal Investigator |
Yoshiaki Itoh 岩手県立大学, ソフトウェア情報学部, 教授 (90325928)
|
Co-Investigator(Kenkyū-buntansha) |
李 時旭 国立研究開発法人産業技術総合研究所, 情報・人間工学領域, 主任研究員 (50415642)
|
Co-Investigator(Renkei-kenkyūsha) |
Ogura Kanayo 岩手県立大学, ソフトウェア情報学部, 講師 (10432139)
|
Project Period (FY) |
2015-04-01 – 2018-03-31
|
Project Status |
Completed (Fiscal Year 2017)
|
Budget Amount *help |
¥4,550,000 (Direct Cost: ¥3,500,000、Indirect Cost: ¥1,050,000)
Fiscal Year 2017: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Fiscal Year 2016: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Fiscal Year 2015: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
|
Keywords | 音声言語処理 / 音声検索 / 音声中の検索語検出 / 深層学習 / Deep Neural Network / スパースベクトル / Deep Neural Net / 未知語 |
Outline of Final Research Achievements |
This research aims the realization of high retrieval accuracy, speed up and small resources for spoken term detection among video data or voice data. The research introduced deep learning so called DNN (Deep Neural Network). The developed method utilizes the conventional retrieval method for spoken term detection and extracts candidates in the first step. It realized the high retrieval accuracy and speed up by performing detailed matching between a query and the small number of extracted candidates in the second step. Furthermore, we realized the speed up and small resources by the method of pre-retrieval for all syllable bigrams.When a spoken query is given, we developed the spoken term detection system that realized high retrieval accuracy, speed up and small resources.
|