Universal-Phonetic-Segment-Based Speech Coding and Its Applications to Speech Processing

Research Project

Project/Area Number	15300026
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	一般
Research Field	Media informatics/Database
Research Institution	University of Tsukuba
Principal Investigator	TANAKA Kazuyo University of Tsukuba, Graduate School of Library, Information and Media Studies, Professor, 大学院・図書館情報メディア研究科, 教授 (70344207)
Co-Investigator(Kenkyū-buntansha)	ITOH Yoshiaki Iwate Prefectural University, Faculty of Software and Information Science, Associate Professor, ソフトウエア情報学部, 助教授 (90325928) OKAWA Shigeki Chiba Institute of Technology, Dept.of Information and Network Science, Associate Professor, 情報科学部, 助教授 (40306395) KOJIMA Hiroaki National Institute of Advanced Industrial Science and Technology, Research Group Leader, 情報技術研究部門, グループリーダ (80356980)
Project Period (FY)	2003 – 2005
Project Status	Completed (Fiscal Year 2005)
Budget Amount *help	¥16,500,000 (Direct Cost: ¥16,500,000) Fiscal Year 2005: ¥4,700,000 (Direct Cost: ¥4,700,000) Fiscal Year 2004: ¥5,200,000 (Direct Cost: ¥5,200,000) Fiscal Year 2003: ¥6,600,000 (Direct Cost: ¥6,600,000)
Keywords	speech recognition / spoken document retrieval / phonetic code / IPA / Dynamic Programming / phone model / multilingual / open vocabulary / 汎用音声符号 / 音声音響モデル / 音声要約
Research Abstract	In this project, we present a novel speech processing framework, where all of the acoustic speech samples are once encoded into universal phonetic segment (UPS) sequences and spoken document processing (SDP) systems, such as recognition, retrieval, indexing, are constructed on this UPS domain. Adopting this framework, the SDP systems are separated from the original acoustic correlates or environments. This makes it possible to realize such flexibility that recognition-type processing can be handled by just calculating distances between UPS sequences, and also can be constructed on distributed processing schemes. Through this project, we have developed the following component techniques on this framework : 1)an original fine sub-phonetic segment (SPS) set as the UPS set, which brought high performance recognition and easy processing of multilingual speech, 2)effective DP(dynamic programming)-based sequence matching algorithms, called Shift CDP and Relay CDP. Effectiveness of the processing framework, the SPS set, and DP-based algorithms are evaluated by constructing speech recognition and open vocabulary spoken document retrieval (SDR) systems. Experimental results showed that the proposed SDP systems are superior to those based on conventional methods in performance evaluation. We have finally constructed a real time open vocabulary SDR system for demonstration, in which the system can retrieve broadcast video by user's speech.

Report

(4 results)

2005 Annual Research Report Final Research Report Summary
2004 Annual Research Report
2003 Annual Research Report

Research Products
(87 results)

All 2006 2005 2004 2003 Other

All Journal Article (62 results) Book (3 results) Patent(Industrial Property Rights) (2 results) Publications (20 results)

[Journal Article] HMM-based noise-robust feature compensation2006
- Author(s)
  Akira Sasou
- Journal Title
  
  International Journal of Speech Communication Accepted, In publication
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] 分析区間長を可変としたテキスト分割手法2006
- Author(s)
  内海慶, 藤井敦, 田中和世
- Journal Title
  
  言語処理学会12回年次大会(NLP2006)論文集 1
  
  Pages: 4-4
- Related Report
  2005 Annual Research Report
[Journal Article] 語彙フリー音声検索における時間精緻化サブワードモデルの検討2006
- Author(s)
  岩田耕平, 伊藤慶明, 小嶋和徳, 石亀昌明, 田中和世, 李時旭
- Journal Title
  
  日本音響学会2006年春季研究発表会論文集 1
  
  Pages: 2-2
- Related Report
  2005 Annual Research Report
[Journal Article] スポッティング区間の再認識に基づく音声検索性能の向上2006
- Author(s)
  大竹, 岩田, 伊藤, 小嶋, 石亀, 田中, 李
- Journal Title
  
  日本音響学会2006年春季研究発表会論文集 1
  
  Pages: 2-2
- Related Report
  2005 Annual Research Report
[Journal Article] Instantaneous frequencies of signals obtained by the analytic signal method2006
- Author(s)
  H.Suzuki, F.Ma, H.Izumi, O.Yamazaki, S.Okawa, K.Kido
- Journal Title
  
  Acoustical Science and Technology 27・3
  
  Pages: 8-8
- Related Report
  2005 Annual Research Report
[Journal Article] Multi-mixture based PDT-SSS Algorithm for Extension of an HMnet Structure2006
- Author(s)
  石洙永, 李時旭, 児島宏明
- Journal Title
  
  日本音響学会2006年春季研究発表会論文集 1
  
  Pages: 2-2
- Related Report
  2005 Annual Research Report
[Journal Article] 電動車いす搭載用平行マイクアレイ実装方式の検討2006
- Author(s)
  佐宗晃, 児島宏明
- Journal Title
  
  日本音響学会2006年春季研究発表会論文集 1
  
  Pages: 2-2
- Related Report
  2005 Annual Research Report
[Journal Article] Voice activity detection using YIN, a fundamental frequency estimator2006
- Author(s)
  石洙永, 李時旭, 児島宏明
- Journal Title
  
  日本音響学会2006年春季研究発表会論文集 1
  
  Pages: 2-2
- Related Report
  2005 Annual Research Report
[Journal Article] Combining Multiple subword representations for open-vocabulary spoken document retrieval2005
- Author(s)
  Lee, S.W.
- Journal Title
  
  Proc. of International Conference on Acoustics, Speech, and Signal Processing (IEEE ICASSP2005) 1
  
  Pages: 505-508
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] An algorithm for similar utterance section extraction for managing spoken documents2005
- Author(s)
  Itoh, Y.
- Journal Title
  
  Multimedia Systems,ISSN : 0942-4962 10・5
  
  Pages: 432-443
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] An Approach for Retrieving Inquiries in TV Broadcasts in a Disaster2005
- Author(s)
  K.Iwata
- Journal Title
  
  Proc. of IASTED International Conference on Signal and Image Processing, 1
  
  Pages: 34-39
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] Discrimination of speech, musical instruments and singing voices using the temporal patterns of sinusoidal segments in audio signals2005
- Author(s)
  T.Taniguchi
- Journal Title
  
  Proceedings of Interspeech2005 1
  
  Pages: 589-592
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] Combining Multiple subword representations for open-vocabulary spoken document retrieval,2005
- Author(s)
  Lee, S.W., Tanaka, K., Itoh, Y.
- Journal Title
  
  Proc.,of International Conference on Acoustics, Speech, and Signal Processing (IEEE ICASSP2005) Vol.1
  
  Pages: 505-508
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] An algorithm for similar utterance section extraction for managing spoken documents,2005
- Author(s)
  Itoh, Y., Tanaka, K., Lee, S.W.
- Journal Title
  
  Multimedia Systems ISSN:0942-4962 Vol.10, No.5
  
  Pages: 432-443
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] An Approach for Retrieving Inquiries in TV Broadcasts in Disaster,2005
- Author(s)
  Kohei Iwata, Yoshiaki Itoh, Kazunori Kojima, Masaaki Ishigame, Kazuyo Tanaka, Shi-wook Lee
- Journal Title
  
  Proc.of IASTED International Conference on Signal and Image Processing
  
  Pages: 34-39
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] Discrimination of speech, musical instruments and singing voices using the temporal patterns of sinusoidal segments in audio signals,2005
- Author(s)
  Toru Taniguchi, Akishige Adachi, Shigeki Okawa, Masaaki Honda, Katsuhiko Shirai
- Journal Title
  
  Proc.of Interspeech2005
  
  Pages: 589-592
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] An algorithm for similar utterance section extraction for managing spoken documents2005
- Author(s)
  Itoh, Y., Tanaka, K., Lee, S.W.
- Journal Title
  
  Multimedia Systems, ISSN:0942-4962 10・5
  
  Pages: 10-10
- Related Report
  2005 Annual Research Report
[Journal Article] Evaluation of an HMM-Based Feature-Compensation Method Using the AURORA2J2005
- Author(s)
  A.Sasou, F.Asano, K.Tanaka, S.Nakamura
- Journal Title
  
  Proc. of IEEE & EURASIP International Workshop on Nonlinear Signal and Image Processing 1
  
  Pages: 6-6
- Related Report
  2005 Annual Research Report
[Journal Article] An Approach for Retrieving Inquiries in TV Broadcasts in a Disaster2005
- Author(s)
  K.Iwata, Y.Itoh, K.Kojima, M.Ishigame, K.Tanaka, S.Lee
- Journal Title
  
  IASTED International Conference on Signal and Image Processing 1
  
  Pages: 6-6
- Related Report
  2005 Annual Research Report
[Journal Article] 語彙フリー音声検索におけるサブワードの検討および災害放送検索システムへの応用2005
- Author(s)
  岩田, 伊藤, 小嶋, 石亀, 田中, 李
- Journal Title
  
  電子情報通信学会研究技術報告 SP2005-21
  
  Pages: 6-6
- Related Report
  2005 Annual Research Report
[Journal Article] An AR-HMM based speech analysis method and evaluation of singing-voice recognition2005
- Author(s)
  A.Sasou, M.Goto, S.Hayamizu, K.Tanaka
- Journal Title
  
  IEICE Technical Report SP2005-42
  
  Pages: 6-6
- NAID
  110003298740
- Related Report
  2005 Annual Research Report
[Journal Article] HMMに基く特徴補正を実装した有限状態文法音声認識エンジンの開発および評価2005
- Author(s)
  佐宗晃, 浅野太, 田中和世
- Journal Title
  
  日本音響学会2005年秋季研究発表会論文集 1
  
  Pages: 2-2
- Related Report
  2005 Annual Research Report
[Journal Article] 信号音のAR-HMMに基く複合環境音認識の検討2005
- Author(s)
  長谷川智紀, 佐宗晃, 田中和世
- Journal Title
  
  日本音響学会2005年秋季研究発表会論文集 1
  
  Pages: 2-2
- Related Report
  2005 Annual Research Report
[Journal Article] 語彙フリー音声検索におけるサブワードと応用システムの検討2005
- Author(s)
  岩田耕平, 伊藤慶明, 小嶋和徳, 石亀昌明, 田中和世, 李時旭
- Journal Title
  
  日本音響学会2005年秋季研究発表会論文集 1
  
  Pages: 2-2
- Related Report
  2005 Annual Research Report
[Journal Article] Human-machine Interface Using EMG Signals for Robot Hand Control2005
- Author(s)
  M.Yoshikawa, T.Tsujimura, M.Mikawa K.Tanaka
- Journal Title
  
  Proceedings of the Society of Instrument and Control Engineers (SICE) Annual Conference 1
  
  Pages: 6-6
- Related Report
  2005 Annual Research Report
[Journal Article] Issues of SSML in Japanese2005
- Author(s)
  W.Imatake, M.Akabane, K.Tanaka
- Journal Title
  
  Proc. of the W3C workshop on Internatinalizing the Speech Synthesis Markup Language 1
  
  Pages: 2-2
- Related Report
  2005 Annual Research Report
[Journal Article] Discrimination of speech, musical instruments and singing voices using the temporal patterns of sinusoidal segments in audio signals2005
- Author(s)
  T.Taniguchi, A.Adachi, S.Okawa, M.Honda, K.Shirai
- Journal Title
  
  Proceedings of Interspeech2005 1
  
  Pages: 4-4
- Related Report
  2005 Annual Research Report
[Journal Article] Sinusoidal Segmentの時間的特徴を用いた音声・楽器音・歌声が混在した音響信号中の音カテゴリ検出2005
- Author(s)
  谷口徹, 安達了慈, 大川茂樹, 誉田雅彰, 白井克彦
- Journal Title
  
  日本音響学会2006年季研究発表会論文集 1
  
  Pages: 2-2
- Related Report
  2005 Annual Research Report
[Journal Article] 音素片のカーネル主成分分析を用いたトピックセグメンテーション2005
- Author(s)
  佐土原健, 児島宏明, 李時旭
- Journal Title
  
  人工知能学会 1E2-03
  
  Pages: 2-2
- NAID
  130004653919
- Related Report
  2005 Annual Research Report
[Journal Article] Combining Multiple subword representations for open-vocabulary spoken document retrieval2005
- Author(s)
  Lee, S.W., Tanaka, K., Itoh, Y.
- Journal Title
  
  Proc.of International Conference on Acoustics, Speech, and Signal Processing (IEEE ICASSP2005) 1
  
  Pages: 4-4
- Related Report
  2004 Annual Research Report
[Journal Article] An auto-regressive, nonstationary excited signal parameter estimation method and an evaluation of a singing-voice recognition2005
- Author(s)
  Sasou, A., Goto, M., Hayamizu, S., Tanaka, K.
- Journal Title
  
  Proc.of International Conference on Acoustics, Speech, and Signal Processing (IEEE ICASSP2005) 1
  
  Pages: 4-4
- Related Report
  2004 Annual Research Report
[Journal Article] 曲内の類似性を用いた曲境界の検出の性能改善2005
- Author(s)
  岩渕晃, 伊藤慶明, 小嶋和徳, 石亀昌明, 田中和世, Shi-wook Lee
- Journal Title
  
  日本音響学会講演論文集2005年3月 1
  
  Pages: 2-2
- Related Report
  2004 Annual Research Report
[Journal Article] 音声・楽器音・歌声が重畳した音響信号中のカテゴリ識別2005
- Author(s)
  谷口徹, 安達了慈, 大川茂樹, 誉田雅彰, 白井克彦
- Journal Title
  
  電子情報通信学会技術研究報告 SP2004-153
  
  Pages: 6-6
- NAID
  10014425442
- Related Report
  2004 Annual Research Report
[Journal Article] Knowledge integration in annotation-based collaborative virtual environments2005
- Author(s)
  S.Aubry, S.Okawa, D.Lenne, I.Thouvenin
- Journal Title
  
  インタラクション2005論文集 1
  
  Pages: 2-2
- Related Report
  2004 Annual Research Report
[Journal Article] 生活環境音を記録し音響的特徴を用いて要約するインタフェースの提案2005
- Author(s)
  大塚昭徳, 伊丹徳重, 坂倉美保, 冨塚清史, 大川茂樹
- Journal Title
  
  情報処理学会全国大会論文集2005 2
  
  Pages: 2-2
- NAID
  170000170314
- Related Report
  2004 Annual Research Report
[Journal Article] 音声・楽器音・歌声が混在した音響信号中の音カテゴリ検出2005
- Author(s)
  谷口徹, 安達了慈, 大川茂樹, 誉田雅彰, 白井克彦
- Journal Title
  
  日本音響学会講演論文集2005年3月 1
  
  Pages: 2-2
- NAID
  10018037648
- Related Report
  2004 Annual Research Report
[Journal Article] Open-vocabulary spoken document retrieval based on multiligual subphonetic segment recognition2004
- Author(s)
  Lee, S.W.
- Journal Title
  
  Proc. of 18th International Congress on Acoustics(ICA2004) 2
  
  Pages: 1723-1726
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] Frequent word section extraction in a presentation speech by an effective dynamic programming algorithm2004
- Author(s)
  Itoh, Y.
- Journal Title
  
  Journal of Acoustical Society of America(JASA) 116-2
  
  Pages: 1234-1243
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] Robust spoken document retrieval based on multiligual subphonetic segment recognition2004
- Author(s)
  Lee, S.W.
- Journal Title
  
  Proc. of 6th International Conference on Enterprise Information Systems CD-ROM
  
  Pages: 1-7
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] Open-vocabulary spoken document retrieval based on multiligual subphonetic segment recognition,2004
- Author(s)
  Lee, S.W., Tanaka, K., Itoh, Y.
- Journal Title
  
  Proc.of 18th International Congress on Acoustics (ICA2004) Vol.II
  
  Pages: 1723-1726
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] Frequent word section extraction in a presentation speech by an effective dynamic programming algorithm,2004
- Author(s)
  Itoh, Y, Tanaka, K.
- Journal Title
  
  Journal of Acoustical Society of America (JASA) Vol.116, No.2
  
  Pages: 1234-1243
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] Robust spoken document retrieval based on multiligual subphonetic segment recognition,2004
- Author(s)
  Lee, S.W., Tanaka, K., Itoh, Y.
- Journal Title
  
  Proc.,of 6th International Conference on Enterprise Information Systems (CD-ROM)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] Open-vocabulary spoken document retrieval based on multiligual subphonetic segment recognition2004
- Author(s)
  Lee, S.W., Tanaka, K., Itoh, Y.
- Journal Title
  
  Proc.of 18th International Congress on Acoustics (ICA2004) 2
  
  Pages: 4-4
- Related Report
  2004 Annual Research Report
[Journal Article] An algorithm for extracting similar partial utterances toward flexible spoken document retrieval2004
- Author(s)
  Itoh, Y., Tanaka, K., Lee, S.W.
- Journal Title
  
  Proc.of 18th International Congress on Acoustics (ICA2004) 2
  
  Pages: 2-2
- Related Report
  2004 Annual Research Report
[Journal Article] Robust spoken document retrieval based on multiligual subphonetic segment recognition2004
- Author(s)
  Lee, S.W., Tanaka, K., Itoh, Y.
- Journal Title
  
  Proc.of 6th International Conference on Enterprise Information Systems 1
  
  Pages: 7-7
- Related Report
  2004 Annual Research Report
[Journal Article] Frequent word section extraction in a presentation speech by an effective dynamic programming algorithm2004
- Author(s)
  Itoh, Y., Tanaka, K.
- Journal Title
  
  Journal of Acoustical Society of America (JASA) 116-2
  
  Pages: 10-10
- Related Report
  2004 Annual Research Report
[Journal Article] 音声的距離に基く類似薬品名表示・検索システム2004
- Author(s)
  田中和世, 中村美保子, 肖丹青, 伊藤慶明
- Journal Title
  
  日本音響学会講演論文集2004年9月 1
  
  Pages: 2-2
- Related Report
  2004 Annual Research Report
[Journal Article] DCTをベースとする音響信号の無歪みデータ圧縮の検討2004
- Author(s)
  佐藤博喜, 田中和世, 佐藤寧
- Journal Title
  
  日本音響学会講演論文集2004年9月 1
  
  Pages: 2-2
- Related Report
  2004 Annual Research Report
[Journal Article] 逆ハの字型マイクロホン配置による雑音除去の検討2004
- Author(s)
  太田昌宏, 長谷川智紀, 田中和世, 佐藤寧
- Journal Title
  
  日本音響学会講演論文集2004年9月 1
  
  Pages: 2-2
- Related Report
  2004 Annual Research Report
[Journal Article] Similar section extraction for analyzing stream data structure2004
- Author(s)
  Itoh, Y., Tanaka, K., Lee, S.W.
- Journal Title
  
  Proc.of 5th European Conference on Machine Learning (ECML2004) 1
  
  Pages: 10-10
- Related Report
  2004 Annual Research Report
[Journal Article] Multi-layer subword units for open-vocabulary spoken document retrieval2004
- Author(s)
  Lee, S.W., Tanaka, K., Itoh, Y.
- Journal Title
  
  Proc.of International Conference on Spoken Language Processing (ICSLP2004) 2
  
  Pages: 4-4
- Related Report
  2004 Annual Research Report
[Journal Article] An Efficient Partial Matching Algorithm toward Speech Retrieval by Speech2004
- Author(s)
  Itoh, Y., Tanaka, K., Lee, S.W.
- Journal Title
  
  Proc.of International Conference on Spoken Language Processing (ICSLP2004) 2
  
  Pages: 4-4
- Related Report
  2004 Annual Research Report
[Journal Article] HMM-Based Feature Compensation Method : An Evaluation Using the AURORA22004
- Author(s)
  Sasou, A., Asano, F., Tanaka, K., Nakamura, S.
- Journal Title
  
  Proc.of International Conference on Spoken Language Processing (ICSLP2004) 1
  
  Pages: 4-4
- Related Report
  2004 Annual Research Report
[Journal Article] 曲内の類似性を用いた曲境界の検出2004
- Author(s)
  岩渕晃, 伊藤慶明, 小嶋和徳, 石亀昌明
- Journal Title
  
  日本音響学会講演論文集2004年9月 1
  
  Pages: 2-2
- Related Report
  2004 Annual Research Report
[Journal Article] Discrimination of speech, musical instruments and singing voices using the temporal patterns of sinusoidal segments in audio signals2004
- Author(s)
  T.Taniguchi, A.Adachi, S.Okawa, M.Honda, K.Shirai
- Journal Title
  
  Proc.of International Conference on Speech and Language Technology 1
  
  Pages: 4-4
- Related Report
  2004 Annual Research Report
[Journal Article] 時系列パターンの任意部分区間の高速マッチング手法Shift CDP法2003
- Author(s)
  伊藤慶明
- Journal Title
  
  電子情報通信学会論文誌D-II J85-D-II No.9
  
  Pages: 1267-1277
- NAID
  110003170966
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] Mixed-Lingual Spoken Word Recognition by Using VQ Codebook Sequnces of Variable Length Segments2003
- Author(s)
  Kojima, H.
- Journal Title
  
  Proc. of the European Conference on Speech Communication and Technology 4
  
  Pages: 2485-2488
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] Statistical estimation of phoneme's most stable point based on universal constraints2003
- Author(s)
  Shigeki Okawa
- Journal Title
  
  Proc. of 7th European Conference on Speech Communication 2
  
  Pages: 781-784
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] A fast matching algorithm called shift continuous DP between arbitrary parts of two time sequence data sets,2003
- Author(s)
  Yoshiaki Itoh
- Journal Title
  
  IEICE Trans.Information and Systems (Japanese Ed.) Vol.J89-D, No.3
  
  Pages: 1267-1277
- NAID
  110003170966
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] Mixed-Lingual Spoken Word Recognition by Using VQ Codebook Sequnces of Variable Length Segments,2003
- Author(s)
  Hiroaki Kojima, Kazuyo Tanaka
- Journal Title
  
  Proc.of the European Conference on Speech Communication and Technology
  
  Pages: 2485-2488
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] Statistical estimation of phoneme's most stable point based on universal constraints,2003
- Author(s)
  Shigeki Okawa, Katsuhiko Shirai
- Journal Title
  
  Proceedings of 7th European Conference on Speech Communication and Technology
  
  Pages: 781-784
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2005 Final Research Report Summary
[Journal Article] HMM-based noise-robust feature compensation,
- Author(s)
  Akira Sasou, Futoshi Asano, Satoshi Nakamura, Kazuyo Tanaka
- Journal Title
  
  International Journal of Speech Communication (Accepted, in publication)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2005 Final Research Report Summary
[Book] 音声工学2005
- Author(s)
  板橋秀一
- Total Pages
  244
- Publisher
  森北出版
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2005 Final Research Report Summary
[Book] Speech Technology, ISBN4-627-828112005
- Author(s)
  S.Itahashi, K.Tanaka, et al.
- Total Pages
  244
- Publisher
  Morikita-Shuppan
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2005 Final Research Report Summary
[Book] 音声工学2005
- Author(s)
  板橋秀一, 田中和世, 他
- Total Pages
  244
- Publisher
  森北出版
- Related Report
  2004 Annual Research Report
[Patent(Industrial Property Rights)] 視覚的かつ聴覚的類似品名提示装置2004
- Inventor(s)
  田中和世
- Industrial Property Rights Holder
  国立大学法人筑波大学
- Industrial Property Number
  2004-271381
- Filing Date
  2004-09-17
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2005 Final Research Report Summary
[Patent(Industrial Property Rights)] 視覚的かつ聴覚的類似品提示装置2004
- Inventor(s)
  田中和世
- Industrial Property Rights Holder
  国立大学法人筑波大学
- Industrial Property Number
  2004-271381
- Filing Date
  2004-09-17
- Related Report
  2004 Annual Research Report
[Publications] 田中和世: "音声認識技術とその応用、現状と課題"計測と制御. 42巻6号. 491-496 (2003)
- Related Report
  2003 Annual Research Report
[Publications] 伊藤慶明: "時系列パターンの任意部分区間の高速マッチング手法Shift CDP法"電子情報通信学会論文誌D-II. J85-D-IINo.9. 1267-1277 (2003)
- Related Report
  2003 Annual Research Report
[Publications] Kojima, H., Tanaka, K.: "Mixed-Lingual Spoken Word Recognition by Using VQ Codebook Sequnces of Variable Length Segments"Proc. of the European Conference on Speech Communication and Technology. 4. 2485-2488 (2003)
- Related Report
  2003 Annual Research Report
[Publications] Sasou, A., Asano, F., Tanaka, K., Nakamura, S.: "Adaptation of Acoustic Model Using the Gain-Adapted HMM Decomposition Method"Proc. of the European Conference on Speech Communication and Technology. 1. 29-32 (2003)
- Related Report
  2003 Annual Research Report
[Publications] Lee, S.W., Tanaka, K., Itoh, Y.: "Spoken document retrieval with multilingual subphoneme sets"Proc.of the Autumn Meeting of the Acoust.Soc.Japan. 1. 165-166 (2003)
- Related Report
  2003 Annual Research Report
[Publications] Yoshiaki Itoh, Kazuyo Tanaka: "An efficient algorithm for extracting repeated key sentence in a presentation speech"Proc. of the 7th World Multiconference on Systemics, Cybernetics and Informatics. 10(CD-ROM). 6 (2003)
- Related Report
  2003 Annual Research Report
[Publications] Lee, S.W., Tanaka, K., Itoh, Y.: "Adaptation of multilingual subphonetic segment for spoken document retrieval"電子情報通信学会技術研究報告. SP2003-144. 187-192 (2003)
- Related Report
  2003 Annual Research Report
[Publications] 佐宗晃, 浅野太, 田中和世, 中村哲: "利得適応型AR-HMM分解法を用いた音響モデルの雑音適応化の検討"電子情報通信学会技術報告. 103・26. 19-24 (2003)
- Related Report
  2003 Annual Research Report
[Publications] Yoshiaki Itoh, Kazuyo Tanaka, Shi-wook Lee: "Repeated utterance extraction by a new algorithm for labeling a presentation speech"Proc. of the 5th ACM SIGMM International Workshop on Multimedia Information Retrieval. 1(CD-ROM). 7 (2003)
- Related Report
  2003 Annual Research Report
[Publications] 佐宗晃, 浅野太, 田中和世, 中村哲: "HMM基づいた雑音重畳音声の特徴量補正"日本音響学会2003年秋季研究発表会論文集. 1. 23-24 (2003)
- Related Report
  2003 Annual Research Report
[Publications] 伊藤慶明, 田中和世, 李時旭: "時系列データの任意区間マッチング法の効率化"日本音響学会秋季研究発表会論文集. 1. 163-164 (2003)
- Related Report
  2003 Annual Research Report
[Publications] Stephane Aubry, Shigeki Okawa: "Analysis of rhythm-based method for language identification"Technical Report of Chiba Institute of Technology. 50. 93-99 (2003)
- Related Report
  2003 Annual Research Report
[Publications] Shigeki Okawa, Katsuhiko Shirai: "Statistical estimation of phoneme's most stable point based on universal constraints"Proc. of 7th European Conference on Speech Communication. 2. 781-784 (2003)
- Related Report
  2003 Annual Research Report
[Publications] 谷口徹, 安達了慈, 大川茂樹, 白井克彦: "HMMを用いた音声・音楽識別"電子情報通信学会技術研究報告. SP2003-92. 47-51 (2003)
- Related Report
  2003 Annual Research Report
[Publications] 椿雅也, 李而立, 谷口徹, 大川茂樹, 誉田雅彰, 白井克彦: "心理的距離尺度に基づく実音色から有限音色空間への写像"日本音響学会講演論文集. 2. 673-675 (2003)
- Related Report
  2003 Annual Research Report
[Publications] Shi-wook Lee, Tanaka, K., Itoh, Y.: "Application of multilayer subword units for spoken document retrieval,"Proc. of the Spring Meeting of the Acoust. Soc. Japan. 1. 81-82 (2004)
- Related Report
  2003 Annual Research Report
[Publications] Yoshiaki Itoh, Kazuyo Tanaka, S.W.Lee: "An algorithm for extracting similar partial utterances toward spoken document retrieval"Proc. of International Congress on Acoustics 2004. (採択済)(印刷中). 4 (2004)
- Related Report
  2003 Annual Research Report
[Publications] Lee, S.W., Tanaka, K., Itoh, Y.: "Open-vocabulary spoken document retrieval based on multiligual subphonetic segment recognition"Proc. of International Congress on Acoustics 2004. (採択済)(印刷中). 4 (2004)
- Related Report
  2003 Annual Research Report
[Publications] Yoshiaki Itoh, Kazuyo Tanaka: "Frequent word section extraction in a presentation speech by an effective dynamic programming algorithm"Journal of Acoustical Society of America. (採択済)(印刷中). 10 (2004)
- Related Report
  2003 Annual Research Report
[Publications] Lee, S.W., Tanaka, K., Itoh, Y.: "Robust spoken document retrieval based on multiligual subphonetic segment recognition"Proc. of 6th International Conference on Enterprise Information Systems. (採択済)(印刷中). 7 (2004)
- Related Report
  2003 Annual Research Report

Universal-Phonetic-Segment-Based Speech Coding and Its Applications to Speech Processing

Principal Investigator

TANAKA Kazuyo University of Tsukuba, Graduate School of Library, Information and Media Studies, Professor, 大学院・図書館情報メディア研究科, 教授 (70344207)

¥16,500,000 (Direct Cost: ¥16,500,000)

Report

Research Products

[Journal Article] HMM-based noise-robust feature compensation2006

Author(s)

Journal Title

Description

Related Report

[Journal Article] 分析区間長を可変としたテキスト分割手法2006

Author(s)

Journal Title

Related Report

[Journal Article] 語彙フリー音声検索における時間精緻化サブワードモデルの検討2006

Author(s)

Journal Title

Related Report

[Journal Article] スポッティング区間の再認識に基づく音声検索性能の向上2006

Author(s)

Journal Title

Related Report

[Journal Article] Instantaneous frequencies of signals obtained by the analytic signal method2006

Author(s)

Journal Title

Related Report

[Journal Article] Multi-mixture based PDT-SSS Algorithm for Extension of an HMnet Structure2006

Author(s)

Journal Title

Related Report

[Journal Article] 電動車いす搭載用平行マイクアレイ実装方式の検討2006

Author(s)

Journal Title

Related Report

[Journal Article] Voice activity detection using YIN, a fundamental frequency estimator2006

Author(s)

Journal Title

Related Report

[Journal Article] Combining Multiple subword representations for open-vocabulary spoken document retrieval2005

Author(s)

Journal Title

Description

Related Report

[Journal Article] An algorithm for similar utterance section extraction for managing spoken documents2005

Author(s)

Journal Title

Description

Related Report

[Journal Article] An Approach for Retrieving Inquiries in TV Broadcasts in a Disaster2005

Author(s)

Journal Title

Description

Related Report

[Journal Article] Discrimination of speech, musical instruments and singing voices using the temporal patterns of sinusoidal segments in audio signals2005

Author(s)

Journal Title

Description

Related Report

[Journal Article] Combining Multiple subword representations for open-vocabulary spoken document retrieval,2005

Author(s)

Journal Title

Description

Related Report

[Journal Article] An algorithm for similar utterance section extraction for managing spoken documents,2005

Author(s)

Journal Title

Description

Related Report

[Journal Article] An Approach for Retrieving Inquiries in TV Broadcasts in Disaster,2005

Author(s)

Journal Title

Description

Related Report

[Journal Article] Discrimination of speech, musical instruments and singing voices using the temporal patterns of sinusoidal segments in audio signals,2005

Author(s)

Journal Title

Description

Related Report

[Journal Article] An algorithm for similar utterance section extraction for managing spoken documents2005