• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Spoken term detection system with high retrieval accuracy, high speed and small resources using Deep Neural Network

Research Project

Project/Area Number 15K00241
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeMulti-year Fund
Section一般
Research Field Perceptual information processing
Research InstitutionIwate Prefectural University

Principal Investigator

Yoshiaki Itoh  岩手県立大学, ソフトウェア情報学部, 教授 (90325928)

Co-Investigator(Kenkyū-buntansha) 李 時旭  国立研究開発法人産業技術総合研究所, 情報・人間工学領域, 主任研究員 (50415642)
Co-Investigator(Renkei-kenkyūsha) Ogura Kanayo  岩手県立大学, ソフトウェア情報学部, 講師 (10432139)
Project Period (FY) 2015-04-01 – 2018-03-31
Project Status Completed (Fiscal Year 2017)
Budget Amount *help
¥4,550,000 (Direct Cost: ¥3,500,000、Indirect Cost: ¥1,050,000)
Fiscal Year 2017: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Fiscal Year 2016: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Fiscal Year 2015: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
Keywords音声言語処理 / 音声検索 / 音声中の検索語検出 / 深層学習 / Deep Neural Network / スパースベクトル / Deep Neural Net / 未知語
Outline of Final Research Achievements

This research aims the realization of high retrieval accuracy, speed up and small resources for spoken term detection among video data or voice data. The research introduced deep learning so called DNN (Deep Neural Network). The developed method utilizes the conventional retrieval method for spoken term detection and extracts candidates in the first step. It realized the high retrieval accuracy and speed up by performing detailed matching between a query and the small number of extracted candidates in the second step. Furthermore, we realized the speed up and small resources by the method of pre-retrieval for all syllable bigrams.When a spoken query is given, we developed the spoken term detection system that realized high retrieval accuracy, speed up and small resources.

Report

(4 results)
  • 2017 Annual Research Report   Final Research Report ( PDF )
  • 2016 Research-status Report
  • 2015 Research-status Report
  • Research Products

    (25 results)

All 2018 2017 2016 2015

All Journal Article (3 results) (of which Peer Reviewed: 3 results,  Open Access: 2 results,  Acknowledgement Compliant: 1 results) Presentation (22 results) (of which Int'l Joint Research: 6 results)

  • [Journal Article] A Construction Method of an Acoustic Distance Using Output Probability of Deep Neural Network for Spoken Term Detection2017

    • Author(s)
      紺野良太,小嶋和徳,李時旭,伊藤慶明
    • Journal Title

      電子情報通信学会論文誌D 情報・システム

      Volume: J100-D Issue: 8 Pages: 798-807

    • DOI

      10.14923/transinfj.2016JDP7122

    • ISSN
      1880-4535, 1881-0225
    • Year and Date
      2017-08-01
    • Related Report
      2017 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] A Rescoring Method for Spoken Term Detection Using Output Probability of Deep Neural Network2017

    • Author(s)
      紺野良太,小嶋和徳,李時旭,伊藤慶明
    • Journal Title

      電子情報通信学会論文誌D 情報・システム

      Volume: J100-D Issue: 5 Pages: 595-604

    • DOI

      10.14923/transinfj.2016JDP7103

    • ISSN
      1880-4535, 1881-0225
    • Year and Date
      2017-05-01
    • Related Report
      2017 Annual Research Report 2016 Research-status Report
    • Peer Reviewed / Open Access
  • [Journal Article] 音声中の未知語の検索 語検出における音節バイグラムのインデックス化方式2016

    • Author(s)
      伊藤慶明,鳴海司朗,大内一揮,菅原翔太,李時旭
    • Journal Title

      電子情報通信学会論文誌 D

      Volume: D Vol. J99-D, 2 Pages: 178-187

    • Related Report
      2015 Research-status Report
    • Peer Reviewed / Acknowledgement Compliant
  • [Presentation] 音声中の検索語検出における深層学習を用いた検索時間削減方式2018

    • Author(s)
      小原 真人,小嶋 和徳,伊藤 慶明,田中 和世,李 時旭
    • Organizer
      日本音響学会春季研究発表会
    • Related Report
      2017 Annual Research Report
  • [Presentation] 音声中の検索語検出における最上位候補を含む講演及びその類似講演優先方式2018

    • Author(s)
      丹治 遥,小嶋 和徳,李 時旭,南條 浩輝,伊藤 慶明
    • Organizer
      日本音響学会春季研究発表会
    • Related Report
      2017 Annual Research Report
  • [Presentation] 音声中の検索語検出におけるドキュメント間類似度を利用したリスコアリング方式2018

    • Author(s)
      清水 嘉乃,李 時旭,小嶋 和徳,伊藤 慶明
    • Organizer
      情報処理学会第80回全国大会
    • Related Report
      2017 Annual Research Report
  • [Presentation] 音声検索語検出の距離値における事後確率の統合2018

    • Author(s)
      李 時旭,田中 和世,伊藤 慶明
    • Organizer
      日本音響学会春季研究発表会
    • Related Report
      2017 Annual Research Report
  • [Presentation] 音声中の検索語検出の上位候補に対するSVMを用いたリランキング2017

    • Author(s)
      大内一揮,小原真人,小嶋和徳,李時旭,伊藤慶明
    • Organizer
      電子情報通信学会総合大会
    • Place of Presentation
      名城大学
    • Year and Date
      2017-03-21
    • Related Report
      2016 Research-status Report
  • [Presentation] SQ-STDにおけるDNN及びCTC導入方式の検討2017

    • Author(s)
      紺野良太,小嶋和徳,李時旭, 田中和世,伊藤慶明
    • Organizer
      日本音響学会春季研究発表会
    • Place of Presentation
      明治大学
    • Year and Date
      2017-03-15
    • Related Report
      2016 Research-status Report
  • [Presentation] 音声中の検索語検出における拗音及び長母音モデルの検討2017

    • Author(s)
      関恒平,小嶋和徳,李時旭,伊藤慶明
    • Organizer
      日本音響学会春季研究発表会
    • Place of Presentation
      明治大学
    • Year and Date
      2017-03-15
    • Related Report
      2016 Research-status Report
  • [Presentation] Constructing Acoustic Distances between Subwords and States Obtained from a Deep Neural Network for Spoken Term Detection2017

    • Author(s)
      Daisuke Kaneko, Kazunori Kojima, Kazuyo Tanaka, Shi-wook Lee, Yoshiaki Itoh
    • Organizer
      INTERSPEECH
    • Related Report
      2017 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Acceleration for Query-by-Example Using Posteriorgram of Deep Neural Network2017

    • Author(s)
      Masato Obara, Kazunori Kojima, Shi-wook Lee and Yoshiaki Itoh
    • Organizer
      Asia-Pacific Signal and Information Processing Association APSIPA
    • Related Report
      2017 Annual Research Report
    • Int'l Joint Research
  • [Presentation] 音声中の検索語検出におけるParagraph Vector を用いたリスコアリング手法2017

    • Author(s)
      清水 嘉乃,李 時旭,小嶋 和徳,伊藤 慶明
    • Organizer
      日本音響学会秋季研究発表会
    • Related Report
      2017 Annual Research Report
  • [Presentation] STDにおける複数検索結果のスコア優先統合方式2016

    • Author(s)
      清水嘉乃,岩﨑瑛太郎,李時旭, 田中和世,小嶋和徳,伊藤慶明
    • Organizer
      日本音響学会秋季研究発表会
    • Place of Presentation
      富山大学
    • Year and Date
      2016-09-14
    • Related Report
      2016 Research-status Report
  • [Presentation] サブワード/状態/フレーム照合スコアの統合によるSQ-STD検索精度向上2016

    • Author(s)
      紺野良太,李時旭, 田中和世,小嶋和徳,伊藤慶明
    • Organizer
      日本音響学会秋季研究発表会
    • Place of Presentation
      富山大学
    • Year and Date
      2016-09-14
    • Related Report
      2016 Research-status Report
  • [Presentation] Rescoring by Combination of Posteriorgram Score and Subword-Matching Score for Use in Query-by-Example2016

    • Author(s)
      Masato Obara, Kazunori Kojima, Kazuyo Tanaka, Shi-wook Lee and Yoshiaki Itoh
    • Organizer
      INTERSPEECH
    • Place of Presentation
      San Francisco
    • Year and Date
      2016-09-08
    • Related Report
      2016 Research-status Report
    • Int'l Joint Research
  • [Presentation] Generating Complementary Acoustic Model Spaces in DNN-Based Sequence-to-Frame DTW Scheme for Out-of-Vocabulary Spoken Term Detection2016

    • Author(s)
      Shi-wook Lee, Kazuyo Tanaka and Yoshiaki Itoh
    • Organizer
      INTERSPEECH
    • Place of Presentation
      富山大学
    • Year and Date
      2016-09-08
    • Related Report
      2016 Research-status Report
    • Int'l Joint Research
  • [Presentation] DNN分布間距離より構 築したサブワード/状態間音響距離のSTDへの適用2016

    • Author(s)
      紺野良太,李時旭,田中和世,小嶋和徳,伊藤慶明
    • Organizer
      日本音響学会春季研究発表会
    • Place of Presentation
      横浜桐蔭大学
    • Year and Date
      2016-03-09
    • Related Report
      2015 Research-status Report
  • [Presentation] DNN 出力確率系列 Posteriorgram との併用によるSTD 検索精度の向上2016

    • Author(s)
      小原真人,李時旭,田中和世,小嶋和徳,伊藤慶明
    • Organizer
      日本音響学会春季研究発表会
    • Place of Presentation
      横浜桐蔭大学
    • Year and Date
      2016-03-09
    • Related Report
      2015 Research-status Report
  • [Presentation] 音声検索語検出システムのスコアリングに関する実験的検討2016

    • Author(s)
      李時旭, 田中和世,伊藤慶明
    • Organizer
      日本音響学会秋季研究発表会
    • Place of Presentation
      明治大学
    • Related Report
      2016 Research-status Report
  • [Presentation] Rescoring by a Deep Neural Network for Spoken Term Detection2015

    • Author(s)
      Ryota Konno, Kazunori Kojima, Lee Shi-Wook, Kazuyo Tanaka, Yoshiaki Itoh
    • Organizer
      Asia-Pacific Signal and Information Processing Association APSIPA
    • Place of Presentation
      Hong Kong
    • Year and Date
      2015-12-16
    • Related Report
      2015 Research-status Report
    • Int'l Joint Research
  • [Presentation] STDにおけるフレーム レベル状態系列間照合による検 索精度向上2015

    • Author(s)
      紺野良太,李時旭,田中和世,小嶋和徳,伊藤慶明
    • Organizer
      日本音響学会秋季研究発表会
    • Place of Presentation
      会津大学
    • Year and Date
      2015-09-16
    • Related Report
      2015 Research-status Report
  • [Presentation] 確率分布間の距離近似と異種性に基づく 音 声検索語検出システムの統合2015

    • Author(s)
      李時旭,田中和世,伊藤慶明
    • Organizer
      日本音響学会秋季研究発表会
    • Place of Presentation
      会津大学
    • Year and Date
      2015-09-16
    • Related Report
      2015 Research-status Report
  • [Presentation] Evaluation of re-ranking by prioritizing highly ranked documents in spoken term detection2015

    • Author(s)
      Kazuki Oouchi, Ryota Kon'no, Takahiro Akyu, Kazuma Konno, Kazunori Kojima, Kazuyo Tanaka, Shi-wook Lee, Yoshiaki Itoh
    • Organizer
      INTERSPEECH
    • Place of Presentation
      Dresden, Germany
    • Year and Date
      2015-09-07
    • Related Report
      2015 Research-status Report
    • Int'l Joint Research
  • [Presentation] 音声中の検索語検出におけるフレームレベル状態系列間照合方式2015

    • Author(s)
      紺野良太,小嶋和徳,李時旭,田中和世,伊藤慶明
    • Organizer
      電子情報通信学会技術研究報告
    • Place of Presentation
      かたくら諏訪湖ホテル
    • Year and Date
      2015-07-16
    • Related Report
      2015 Research-status Report

URL: 

Published: 2015-04-16   Modified: 2019-03-29  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi