A Unified Bayesian Approach to Simultaneous Speech Recognition for Mixture Signals
Project/Area Number |
15K12063
|
Research Category |
Grant-in-Aid for Challenging Exploratory Research
|
Allocation Type | Multi-year Fund |
Research Field |
Perceptual information processing
|
Research Institution | Kyoto University |
Principal Investigator |
|
Co-Investigator(Kenkyū-buntansha) |
糸山 克寿 京都大学, 情報学研究科, 助教 (60614451)
|
Co-Investigator(Renkei-kenkyūsha) |
KAWAHARA Tatsuya 京都大学, 大学院情報学研究科, 教授 (00234104)
MOCHIHASHI Daichi 統計数理研究所, モデリング研究系, 准教授 (80418508)
|
Project Period (FY) |
2015-04-01 – 2017-03-31
|
Project Status |
Completed (Fiscal Year 2016)
|
Budget Amount *help |
¥3,640,000 (Direct Cost: ¥2,800,000、Indirect Cost: ¥840,000)
Fiscal Year 2016: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
Fiscal Year 2015: ¥1,950,000 (Direct Cost: ¥1,500,000、Indirect Cost: ¥450,000)
|
Keywords | 音源分離 / 音声認識 / 確率モデル / ベイズモデル / MCMC |
Outline of Final Research Achievements |
We proposed a method that can simultaneously recognize multiple utterances by using a probabilictic model of source separation. Since there is uncertainty about source signals, we combined speech recognition with source separation by considering the posterior distributin of the source signals. This enabled us to obtain recognition results directly from mixture signals without uniquely determining the source signals. In addition, we proposed a source separation method based on an integrated model involving a source model and a superimposition model. Each model is represented as a mixture (LDA) or factor model (NMF) and the performance of each combination was evaluated.
|
Report
(3 results)
Research Products
(13 results)
-
-
[Journal Article] Low Latency and High Quality Two-Stage Human-Voice-Enhancement System for a Hose-Shaped Rescue Robot2017
Author(s)
Yoshiaki Bando, Hiroshi Saruwatari, Nobutaka Ono, Shoji Makino, Katustoshi Itoyama1, Daichi Kitamura, Masaru Ishimura, Moe Takakusaki, Narumi Mae, Kouei Yamaoka, Yutaro Matsui, Yuichi Ambe, Masashi Konyo, Satoshi Tadokoro, Kazuyoshi Yoshii, Hiroshi G. Okuno
-
Journal Title
Journal of Robotics and Mechatronics
Volume: 29
Issue: 1
Pages: 198-212
DOI
NAID
ISSN
0915-3942, 1883-8049
Year and Date
2017-02-20
Related Report
Peer Reviewed / Open Access / Acknowledgement Compliant
-
-
-
[Presentation] Sound-Based Online Localization for an In-Pipe Snake Robot2016
Author(s)
Yoshiaki Bando, Hiroki Suhara, Motoyasu Tanaka, Tetsushi Kamegawa, Katsutoshi Itoyama, Kazuyoshi Yoshii, Fumitoshi Matsuno, Hiroshi G. Okuno
Organizer
IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR)
Place of Presentation
Lausanne, Switzerland
Year and Date
2016-10-23
Related Report
Int'l Joint Research
-
-
-
-
-
-
-
-