2010 Fiscal Year Final Research Report

Improvement of Very Large Vocabulary Speech Recognition using an encoding based on probabilistic structure of vocabulary

Research Project

Project/Area Number	20500166
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	Perception information processing/Intelligent robotics
Research Institution	Hosei University
Principal Investigator	ITOU Katunobu Hosei University, 情報科学部, 教授 (30356472)
Project Period (FY)	2008 – 2010
Keywords	音声認識 / 話者認識 / 音響ライフログ
Research Abstract	For speech recognition, in a large vocabulary task, any phone sequence didn't induce statistically significant deficient performance without contribution of language models. For speaker recognition/verification, on the other hand, it seems to be difficult other than increasing of training data. Most applications of speaker recognition, it cannot be expected sufficient training data. Moreover, it is difficult to assume a target phone sequence in advance. Therefore, a new method is required for speaker recognition, because many previous methods for improving speech recognition cannot be efficient for speaker recognition.

Research Products
(15 results)

All 2011 2010 2009 2008 Other

All Presentation (14 results) Remarks (1 results)

[Presentation] 話者照合と音声認識を併用したスマートフォン向け認証システムの作成2011
- Author(s)
  平野邦彦,伊藤克亘
- Organizer
  情報処理学会全国大会(査読無)
- Year and Date
  2011-03-04
[Presentation] デジタル放送の字幕情報を用いた発話者のアノテーション2011
- Author(s)
  山室慶太,伊藤克亘
- Organizer
  情報処理学会全国大会(査読無)
- Year and Date
  2011-03-04
[Presentation] 音声を用いた農作業日誌システムの構築2011
- Author(s)
  住澤卓也,伊藤克亘
- Organizer
  情報処理学会全国大会(査読無)
- Year and Date
  2011-03-04
[Presentation] Speaker model updating by the conversational sounds in speaker verification2010
- Author(s)
  Keita Yamamuro, Katunobu ITOU
- Organizer
  IIWAS2010(査読有)
- Year and Date
  2010-11-04
[Presentation] Speaker model updating by the conversational sounds in speaker verification2010
- Author(s)
  Kazufumi Nakamura, Katunobu ITOU
- Organizer
  internoise 2010(概要査読有)
- Year and Date
  2010-06-15
[Presentation] 音響ライフログへのアノテーションのための話者と場所の自動分類2010
- Author(s)
  山野貴一郎,伊藤克亘
- Organizer
  情報処理学会全国大会(査読無)
- Year and Date
  2010-03-11
[Presentation] 携帯端末への話者照合を用いたセキュリティロック2010
- Author(s)
  山室慶太,伊藤克亘
- Organizer
  情報処理学会全国大会(査読無)
- Year and Date
  2010-03-11
[Presentation] 高齢者の加齢による聴力低下に対応する音声強調2010
- Author(s)
  田母神恒,伊藤克亘
- Organizer
  情報処理学会全国大会(査読無)
- Year and Date
  2010-03-11
[Presentation] Flashコンテンツ操作のための音声認識インタフェース2010
- Author(s)
  松浦健太,伊藤克亘
- Organizer
  情報処理学会全国大会(査読無)
- Year and Date
  2010-03-11
[Presentation] コンテンツ制作における収録音編集のための音声強調2010
- Author(s)
  中村一文,伊藤克亘
- Organizer
  情報処理学会全国大会(査読無)
- Year and Date
  2010-03-10
[Presentation] Browsing Audio Life-log Data Using Acoustic and Location Information2009
- Author(s)
  Kiichiro Yamano, Katunobu ITOU
- Organizer
  UBICOMM 2009(査読有)
- Year and Date
  2009-11-15
[Presentation] バイノーラルマイクを用いたライフログ映像のショット識別2008
- Author(s)
  山野貴一郎,伊藤克亘
- Organizer
  信号処理シンポジウム(査読無)
- Year and Date
  2008-11-13
[Presentation] Detecting Scenes in Lifelog Videos based on Probabilistic Models of Audio data.2008
- Author(s)
  Kiichiro Yamano, Katunobu ITOU
- Organizer
  Acoustics, 08(概要査読有)
- Year and Date
  2008-07-03
[Presentation] 情報処理学会全国大会(査読無)
- Author(s)
  山野貴一郎,伊藤克亘,音響情報を用いたライフログデータのインデキシング
- Organizer
  20090311
[Remarks] ホームページ等
- URL
  http://cis.k.hosei.ac.jp/info/faculty/digital/itou.html

2010 Fiscal Year Final Research Report

Improvement of Very Large Vocabulary Speech Recognition using an encoding based on probabilistic structure of vocabulary

Principal Investigator

ITOU Katunobu Hosei University, 情報科学部, 教授 (30356472)

Research Products

[Presentation] 話者照合と音声認識を併用したスマートフォン向け認証システムの作成2011

Author(s)

Organizer

Year and Date

[Presentation] デジタル放送の字幕情報を用いた発話者のアノテーション2011

Author(s)

Organizer

Year and Date

[Presentation] 音声を用いた農作業日誌システムの構築2011

Author(s)

Organizer

Year and Date

[Presentation] Speaker model updating by the conversational sounds in speaker verification2010

Author(s)

Organizer

Year and Date

[Presentation] Speaker model updating by the conversational sounds in speaker verification2010

Author(s)

Organizer

Year and Date

[Presentation] 音響ライフログへのアノテーションのための話者と場所の自動分類2010

Author(s)

Organizer

Year and Date

[Presentation] 携帯端末への話者照合を用いたセキュリティロック2010

Author(s)

Organizer

Year and Date

[Presentation] 高齢者の加齢による聴力低下に対応する音声強調2010

Author(s)

Organizer

Year and Date

[Presentation] Flashコンテンツ操作のための音声認識インタフェース2010

Author(s)

Organizer

Year and Date

[Presentation] コンテンツ制作における収録音編集のための音声強調2010

Author(s)

Organizer

Year and Date

[Presentation] Browsing Audio Life-log Data Using Acoustic and Location Information2009

Author(s)

Organizer

Year and Date

[Presentation] バイノーラルマイクを用いたライフログ映像のショット識別2008

Author(s)

Organizer

Year and Date

[Presentation] Detecting Scenes in Lifelog Videos based on Probabilistic Models of Audio data.2008

Author(s)

Organizer

Year and Date

[Presentation] 情報処理学会全国大会(査読無)

Author(s)

Organizer

[Remarks] ホームページ等

URL