2007 Fiscal Year Final Research Report Summary
A research for automatic structure extraction and information retrieval from video data using speech and voice
Project/Area Number |
17500073
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Media informatics/Database
|
Research Institution | Iwate Prefectural University |
Principal Investigator |
ITOH Yoshiaki Iwate Prefectural University, Faculty of Software and Information Science, Associate professor (90325928)
|
Co-Investigator(Kenkyū-buntansha) |
OKAWA Shigeki Chiba Institute University, Faculty of Software and Information Science, Associate professor (40306395)
KOJIMA Kazunori Iwate Prefectural University, Faculty of Software and Information Science, Research associate (60305307)
|
Project Period (FY) |
2005 – 2007
|
Keywords | retrieval / video data / structure / segmentation / music retrieval / music information / acoustic feature |
Research Abstract |
The first theme of this research is to realize an automatic video segmentation. For this purpose,we developed the methods for extracting a local and global feature of video information and for classifying video data sets. The second theme is automatic extraction of the structure for segmented video data sets using local and global similarity and dissimilarity between the segmented sections. This research first conducted the analysis of local acoustic and image similarity in video data sets,and extracted similar partial sections that are repeated in a music piece. The developed method then discriminated speech sections and music sections respectively using a local feature and a global feature.We confirmed the developed methods worked well for real video data sets. The research also conducted speech retrieval by a text and speech query for speech video sections segmented by the above method. For speech retrieval,new technique of dealing with any words for a query and the integration method for plural subword models were proposed,and the experimental results demonstrated the method showed better performance compared to former methods. These results were reported at the many domestic and international conferences. In the future,we are going to develop the methods for the actual use.
|
Research Products
(100 results)
-
-
-
[Journal Article] Automatic Music Boundary Detection Using Short Segmental Acoustic Similarity in a Music Piece2008
Author(s)
Yoshiaki, Itoh, Akira, Iwabuchi, Kazuyo, Kazunori Kojima, Masaaki, Ishigame, Kazuyo, Tanaka, Shi-wook, Lee
-
Journal Title
EURASIP Journal on Audio,Speech,and Music Processing (to appear)
Description
「研究成果報告書概要(欧文)」より
-
-
-
[Journal Article] 語彙フリー音声文書検索手法における新しいサブワードモデルとサブワード音響距離の有効性の検証2007
Author(s)
岩田, 耕平, 伊藤, 慶明, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, 李, 時旭
-
Journal Title
情報処理学会論文誌 48巻5号
Pages: 1990-2000
Description
「研究成果報告書概要(欧文)」より
-
[Journal Article] Asynchronous Parallel Distributed Genetic Algorithm with Elite Migration2007
Author(s)
Kazunori, Kojima, Masaaki, Ishigame, Goutam, Chakraborty, Hiroshi, Matsuo, Shozo, Makino
-
Journal Title
International Journal of Computational Intelligence Vol. 4, No. 2
Pages: 105-111
Description
「研究成果報告書概要(欧文)」より
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
[Presentation] 語彙非依存型音声文書検索のためのサブワードモデルおよび検索方式の検討2007
Author(s)
岩田, 耕平, 伊藤, 慶明, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, Shi-Wook, Lee
Organizer
第1回音声ドキュメント処理ワークショップ
Year and Date
20070000
Description
「研究成果報告書概要(欧文)」より
-
[Presentation] An Integration Method of Retrieval Results using Plural Subword Models for Vocabulary-free Spoken Document Retrieval2007
Author(s)
Yoshiaki, Itoh, Kohei, Iwata, Kazunori, Kojima, Masaaki, Ishigame, Kazuyo, Tanaka, Shi-wook, Lee
Organizer
INTERSPEECH
Year and Date
20070000
Description
「研究成果報告書概要(欧文)」より
-
[Presentation] A New Architecture and Approaches to Improve a Subword-Based Open Vocabulary Spoken Document Retrieval System2007
Author(s)
Yoshiaki, Itoh, Kohei, Iwata, Kazunori, Kojima, Masaaki, Ishigame, Kazuyo, Tanaka, Shi-wook, Lee
Organizer
International Congress on Acoustics
Year and Date
20070000
Description
「研究成果報告書概要(欧文)」より
-
[Presentation] Music Boundary Detection using Similarity in a Music Selection2007
Author(s)
Yoshiaki, Itoh, Akira, Iwabuchi, Kazunori, Kojima, Masaaki, Ishigame, Kazuyo, Tanaka, Shi-wook, Lee
Organizer
International Workshop on Multimedia Signal Processing
Year and Date
20070000
Description
「研究成果報告書概要(欧文)」より
-
-
-
[Presentation] サブワードを用いた音声文書検索における複数サブワードの統合-サブワード毎の検索語検索性能期待値の利用2007
Author(s)
伊藤, 慶明, 岩田, 耕平, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, 李, 時旭
Organizer
電子情報通信学会研究技術報告
Year and Date
20070000
Description
「研究成果報告書概要(欧文)」より
-
[Presentation] 検索語毎の性能期待値に基づく複数サブワードモデルの検索結果統合方式2007
Author(s)
伊藤, 慶明, 岩田, 耕平, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, 李, 時旭
Organizer
音響学会論文集
Year and Date
20070000
Description
「研究成果報告書概要(欧文)」より
-
-
-
-
-
-
-
-
-
-
-
-
-
-
[Presentation] 語彙フリー音声検索における時間精緻化サブワードモデルの検討2006
Author(s)
岩田, 耕平, 伊藤, 慶明, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, 李, 時旭
Organizer
日本音響学会講演論文集
Year and Date
20060000
Description
「研究成果報告書概要(欧文)」より
-
[Presentation] スポッテイング区間の再認識に基づく音声検索性能の向上2006
Author(s)
大竹, 隆之, 岩田, 耕平, 伊藤, 慶明, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, 李, 時旭
Organizer
日本音響学会講演論文集
Year and Date
20060000
Description
「研究成果報告書概要(欧文)」より
-
-
-
[Presentation] A Study of Sophisticated Subword Models on a Time Axis for Open-vocabulary Spoken Document Retrieval2006
Author(s)
Kohei, Iwata, Yoshiaki, Itoh, Kazunori, Kojima, Masaaki, Ishigame, Kazuyo, Tanaka, Shi-wook, Lee
Organizer
Western Pacific Acoustics Conference
Year and Date
20060000
Description
「研究成果報告書概要(欧文)」より
-
[Presentation] Two-stage Vocabulary-free Spoken Document Retrieval -Subword Identification and Re-recognition of the Identified Sections-2006
Author(s)
Yoshiaki, Itoh, Takayuki, Otake, Kohei, Iwata, Kazunori, Kojima, Masaaki, Ishigame, Kazuyo, Tanaka, Shi-wook, Lee
Organizer
International Conference on Spoken
Year and Date
20060000
Description
「研究成果報告書概要(欧文)」より
-
[Presentation] Open-Vocabulary Spoken Document Retrieval based on new subword models and subword phonetic similarity2006
Author(s)
Kohei, Iwata, Yoshiaki, Itoh, Kazunori, Kojima, Masaaki, Ishigame, Kazuyo, Tanaka, Shi-wook, Lee
Organizer
International Conference on Spoken Language Processing (Interspeech)
Year and Date
20060000
Description
「研究成果報告書概要(欧文)」より
-
-
[Presentation] 音声検索システムのための時間整合を考慮したサブワードモデル構築手法の検討2006
Author(s)
岩田, 耕平, 伊藤, 慶明, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, 李, 時旭
Organizer
情報処理学会研究技術報告
Year and Date
20060000
Description
「研究成果報告書概要(欧文)」より
-
-
-
-
-
-
-
-
-
-
-
-
-
[Presentation] An Approach for Retrieving Inquiries in TV Broadcasts in a Disaster2005
Author(s)
Kohei, Iwata, Yoshiaki, Itoh, Kazunori, Kojima, Masaaki, Ishigame, Kazuyo, Tanaka, Shi-wook, Lee
Organizer
IASTED International Conference on Signal and Image Processing
Year and Date
20050000
Description
「研究成果報告書概要(欧文)」より
-
[Presentation] 語彙フリー音声検索におけるサブワードの検討および災害放送検索システムへの応用2005
Author(s)
岩田, 耕平, 伊藤, 慶明, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, 李, 時旭
Organizer
電子情報通信学会研究技術報告
Year and Date
20050000
Description
「研究成果報告書概要(欧文)」より
-
[Presentation] 曲内の類似性を用いた曲境界の検出の性能改善2005
Author(s)
岩渕, 晃, 伊藤, 慶明, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, Shi-Wook, Lee
Organizer
日本音響学会講演論文集
Year and Date
20050000
Description
「研究成果報告書概要(欧文)」より
-
[Presentation] 彙フリー音声検索におけるサブワードと応用システムの検討2005
Author(s)
岩田, 耕平, 伊藤, 慶明, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, 李, 時旭
Organizer
日本音響学会講演論文集
Year and Date
20050000
Description
「研究成果報告書概要(欧文)」より
-
-
-
-
-
-
-
-