A research for automatic structure extraction and information retrieval from video data using speech and voice
Project/Area Number |
17500073
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Media informatics/Database
|
Research Institution | Iwate Prefectural University |
Principal Investigator |
ITOH Yoshiaki Iwate Prefectural University, Faculty of Software and Information Science, Associate professor (90325928)
|
Co-Investigator(Kenkyū-buntansha) |
OKAWA Shigeki Chiba Institute University, Faculty of Software and Information Science, Associate professor (40306395)
KOJIMA Kazunori Iwate Prefectural University, Faculty of Software and Information Science, Research associate (60305307)
|
Project Period (FY) |
2005 – 2007
|
Project Status |
Completed (Fiscal Year 2007)
|
Budget Amount *help |
¥3,770,000 (Direct Cost: ¥3,500,000、Indirect Cost: ¥270,000)
Fiscal Year 2007: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000)
Fiscal Year 2006: ¥1,100,000 (Direct Cost: ¥1,100,000)
Fiscal Year 2005: ¥1,500,000 (Direct Cost: ¥1,500,000)
|
Keywords | retrieval / video data / structure / segmentation / music retrieval / music information / acoustic feature / 音声検索 / 音楽情報処理 / 音声特徴量 / 音楽特徴量 |
Research Abstract |
The first theme of this research is to realize an automatic video segmentation. For this purpose,we developed the methods for extracting a local and global feature of video information and for classifying video data sets. The second theme is automatic extraction of the structure for segmented video data sets using local and global similarity and dissimilarity between the segmented sections. This research first conducted the analysis of local acoustic and image similarity in video data sets,and extracted similar partial sections that are repeated in a music piece. The developed method then discriminated speech sections and music sections respectively using a local feature and a global feature.We confirmed the developed methods worked well for real video data sets. The research also conducted speech retrieval by a text and speech query for speech video sections segmented by the above method. For speech retrieval,new technique of dealing with any words for a query and the integration method for plural subword models were proposed,and the experimental results demonstrated the method showed better performance compared to former methods. These results were reported at the many domestic and international conferences. In the future,we are going to develop the methods for the actual use.
|
Report
(4 results)
Research Products
(138 results)
-
-
-
[Journal Article] Automatic Music Boundary Detection Using Short Segmental Acoustic Similarity in a Music Piece2008
Author(s)
Yoshiaki, Itoh, Akira, Iwabuchi, Kazuyo, Kazunori Kojima, Masaaki, Ishigame, Kazuyo, Tanaka, Shi-wook, Lee
-
Journal Title
EURASIP Journal on Audio,Speech,and Music Processing (to appear)
Description
「研究成果報告書概要(欧文)」より
Related Report
-
-
-
[Journal Article] 語彙フリー音声文書検索手法における新しいサブワードモデルとサブワード音響距離の有効性の検証2007
Author(s)
岩田, 耕平, 伊藤, 慶明, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, 李, 時旭
-
Journal Title
情報処理学会論文誌 48巻5号
Pages: 1990-2000
NAID
Description
「研究成果報告書概要(欧文)」より
Related Report
-
[Journal Article] Asynchronous Parallel Distributed Genetic Algorithm with Elite Migration2007
Author(s)
Kazunori, Kojima, Masaaki, Ishigame, Goutam, Chakraborty, Hiroshi, Matsuo, Shozo, Makino
-
Journal Title
International Journal of Computational Intelligence Vol. 4, No. 2
Pages: 105-111
Description
「研究成果報告書概要(欧文)」より
Related Report
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
[Presentation] 語彙非依存型音声文書検索のためのサブワードモデルおよび検索方式の検討2007
Author(s)
岩田, 耕平, 伊藤, 慶明, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, Shi-Wook, Lee
Organizer
第1回音声ドキュメント処理ワークショップ
Description
「研究成果報告書概要(欧文)」より
Related Report
-
[Presentation] An Integration Method of Retrieval Results using Plural Subword Models for Vocabulary-free Spoken Document Retrieval2007
Author(s)
Yoshiaki, Itoh, Kohei, Iwata, Kazunori, Kojima, Masaaki, Ishigame, Kazuyo, Tanaka, Shi-wook, Lee
Organizer
INTERSPEECH
Description
「研究成果報告書概要(欧文)」より
Related Report
-
[Presentation] A New Architecture and Approaches to Improve a Subword-Based Open Vocabulary Spoken Document Retrieval System2007
Author(s)
Yoshiaki, Itoh, Kohei, Iwata, Kazunori, Kojima, Masaaki, Ishigame, Kazuyo, Tanaka, Shi-wook, Lee
Organizer
International Congress on Acoustics
Description
「研究成果報告書概要(欧文)」より
Related Report
-
[Presentation] Music Boundary Detection using Similarity in a Music Selection2007
Author(s)
Yoshiaki, Itoh, Akira, Iwabuchi, Kazunori, Kojima, Masaaki, Ishigame, Kazuyo, Tanaka, Shi-wook, Lee
Organizer
International Workshop on Multimedia Signal Processing
Description
「研究成果報告書概要(欧文)」より
Related Report
-
-
-
[Presentation] サブワードを用いた音声文書検索における複数サブワードの統合-サブワード毎の検索語検索性能期待値の利用2007
Author(s)
伊藤, 慶明, 岩田, 耕平, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, 李, 時旭
Organizer
電子情報通信学会研究技術報告
Description
「研究成果報告書概要(欧文)」より
Related Report
-
[Presentation] 検索語毎の性能期待値に基づく複数サブワードモデルの検索結果統合方式2007
Author(s)
伊藤, 慶明, 岩田, 耕平, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, 李, 時旭
Organizer
音響学会論文集
Description
「研究成果報告書概要(欧文)」より
Related Report
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
[Presentation] 語彙フリー音声検索における時間精緻化サブワードモデルの検討2006
Author(s)
岩田, 耕平, 伊藤, 慶明, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, 李, 時旭
Organizer
日本音響学会講演論文集
Description
「研究成果報告書概要(欧文)」より
Related Report
-
[Presentation] スポッテイング区間の再認識に基づく音声検索性能の向上2006
Author(s)
大竹, 隆之, 岩田, 耕平, 伊藤, 慶明, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, 李, 時旭
Organizer
日本音響学会講演論文集
Description
「研究成果報告書概要(欧文)」より
Related Report
-
-
-
[Presentation] A Study of Sophisticated Subword Models on a Time Axis for Open-vocabulary Spoken Document Retrieval2006
Author(s)
Kohei, Iwata, Yoshiaki, Itoh, Kazunori, Kojima, Masaaki, Ishigame, Kazuyo, Tanaka, Shi-wook, Lee
Organizer
Western Pacific Acoustics Conference
Description
「研究成果報告書概要(欧文)」より
Related Report
-
[Presentation] Two-stage Vocabulary-free Spoken Document Retrieval -Subword Identification and Re-recognition of the Identified Sections-2006
Author(s)
Yoshiaki, Itoh, Takayuki, Otake, Kohei, Iwata, Kazunori, Kojima, Masaaki, Ishigame, Kazuyo, Tanaka, Shi-wook, Lee
Organizer
International Conference on Spoken
Description
「研究成果報告書概要(欧文)」より
Related Report
-
[Presentation] Open-Vocabulary Spoken Document Retrieval based on new subword models and subword phonetic similarity2006
Author(s)
Kohei, Iwata, Yoshiaki, Itoh, Kazunori, Kojima, Masaaki, Ishigame, Kazuyo, Tanaka, Shi-wook, Lee
Organizer
International Conference on Spoken Language Processing (Interspeech)
Description
「研究成果報告書概要(欧文)」より
Related Report
-
-
[Presentation] 音声検索システムのための時間整合を考慮したサブワードモデル構築手法の検討2006
Author(s)
岩田, 耕平, 伊藤, 慶明, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, 李, 時旭
Organizer
情報処理学会研究技術報告
Description
「研究成果報告書概要(欧文)」より
Related Report
-
-
-
-
-
-
-
-
-
-
-
-
-
[Presentation] An Approach for Retrieving Inquiries in TV Broadcasts in a Disaster2005
Author(s)
Kohei, Iwata, Yoshiaki, Itoh, Kazunori, Kojima, Masaaki, Ishigame, Kazuyo, Tanaka, Shi-wook, Lee
Organizer
IASTED International Conference on Signal and Image Processing
Description
「研究成果報告書概要(欧文)」より
Related Report
-
[Presentation] 語彙フリー音声検索におけるサブワードの検討および災害放送検索システムへの応用2005
Author(s)
岩田, 耕平, 伊藤, 慶明, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, 李, 時旭
Organizer
電子情報通信学会研究技術報告
Description
「研究成果報告書概要(欧文)」より
Related Report
-
[Presentation] 曲内の類似性を用いた曲境界の検出の性能改善2005
Author(s)
岩渕, 晃, 伊藤, 慶明, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, Shi-Wook, Lee
Organizer
日本音響学会講演論文集
Description
「研究成果報告書概要(欧文)」より
Related Report
-
[Presentation] 彙フリー音声検索におけるサブワードと応用システムの検討2005
Author(s)
岩田, 耕平, 伊藤, 慶明, 小嶋, 和徳, 石亀, 昌明, 田中, 和世, 李, 時旭
Organizer
日本音響学会講演論文集
Description
「研究成果報告書概要(欧文)」より
Related Report
-
-
-
-
-
-
-
-