2004 Fiscal Year Final Research Report Summary

Development of a supporting system for creation of educational video contents using robust automatic speech recognition technology

Research Project

Project/Area Number	14580246
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	Educational technology
Research Institution	Ishikawa National College of Technology
Principal Investigator	KANEDERA Noboru Ishikawa National College of Technology, Department of Electronics and Information Engineering, Associate Professor, 電子情報工学科, 助教授 (50194931)
Project Period (FY)	2002 – 2004
Keywords	Educational video contents / Video segmentation / Independent component analysis / Automatic speech recognition / Dynamic Programming / Video retrieval / Video edit system
Research Abstract	We developed a supporting system for creation of educational video contents. The system automatically segments a lecture video material into subtopics based on speech signals. To represent subtopics of video scenes, the text recognized by automatic speech recognition (ASR) from a lecture speech was converted into an index using independent component analysis (ICA) instead of conventional TF-IDF. This research attempted a method of segmentation using dynamic programming that minimizes the sum of cosine distances between adjacent indexes that represent subtopics of video scenes. The validity of the proposed method was evaluated using sample lecture videos uttered by five lecturers. Results indicated that scene segmentation using automatic speech recognition performed as well as that using transcription text. Editing a video requires searching for subtopic segmentation positions, and extraction of necessary video segments, or removing unnecessary video segments. In particular, when searching subtopic segmentation positions, a large amount of time and efforts are required to review the video from beginning to end. That is, it is hard work to search subtopic segmentation positions. It is therefore expected to reduce the editing time and efforts by the developed system with automatic subtopic segmentation. In this research, we carried out subjective evaluation by 16 examinees and 5 lecture video materials to confirm the effect of automatic subtopic segmentation. As a result, 75% of examinees answered that the editing method with automatic subtopic segmentation is better than that without segmentation. Moreover, the average editing time was reduced by about 14%.

Research Products
(21 results)

All 2005 2004 2003 2002

All Journal Article (21 results)

[Journal Article] Subtopic segmentation in the lecture speech for creation of lecture video contents.2005
- Author(s)
  N.Kanedera, A.Sumida, T.Ikehata, T.Funada
- Journal Title
  
  The IEICE Trans, on information and Systems Vol.J88-DINo.5(In Printing)
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Lecture video segmentation for creation of educational video contents.2004
- Author(s)
  N.Kanedera, A.Sumida, J.Jikeya
- Journal Title
  
  Japanese Colleges of Technology Education Journal No.27
  
  Pages: 727-732
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Lecture video segmentation derived from speech using ICA.2004
- Author(s)
  N.Kanedera, A.Sumida, J.Jikeya, T.Ikehata, T.Funada
- Journal Title
  
  The 18th International Congress on Acoustics Vol.III
  
  Pages: 2023-2026
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Development of scene selection system for making lecture video.2004
- Author(s)
  T.Ikehata, A.Sumida, N.Kanedera
- Journal Title
  
  Japanese Society for Engineering Education
  
  Pages: 151-152
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Development of a supporting system for creation of educational video contents using automatic subtopic segmentation.2004
- Author(s)
  N.Kanedera, T.Ikehata, A.Sumida
- Journal Title
  
  Meeting of research and education for information processing in the college of technology No.24
  
  Pages: 81-84
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Evaluation of a video segmentation system supporting the creation of lecture video materials.2004
- Author(s)
  N.Kanedera, T.Ikehata, A.Sumida, T.Funada
- Journal Title
  
  Japan Acoustic society Fall Meeting I
  
  Pages: 37-38
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Lecture video segmentation derived from speech by dynamic programming.2004
- Author(s)
  A.Sumida, N.Kanedera, T.Ikehata
- Journal Title
  
  Forum on Information Technology 2004 Vol.2
  
  Pages: 353-356
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Subtopic segmentation in the lecture speech.2004
- Author(s)
  N.Kanedera, A.Sumida, T.Ikehata, T.Funada
- Journal Title
  
  Proceedings of International Conference on Spoken Language Processing Vol.III
  
  Pages: 1821-1824
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] 音声による講義ビデオシーン分割方法の検討2003
- Author(s)
  金寺登
- Journal Title
  
  日本音響学会2003年春季研究発表会講演論文集 I
  
  Pages: 187-188
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Continuous Speech Recognition Based on the Contribution of Modulation Spectrum2003
- Author(s)
  N.Kanedera
- Journal Title
  
  SPEECH DYNAMICS BY EAR, EYE, MOUTH AND MACHINE, An Interdisciplinary Workshop(電子情報通信学会技術研究報告) 103・155
  
  Pages: 67-72
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] 独立成分分析を用いた音声による講義ビデオシーン分割2003
- Author(s)
  隅田飛鳥
- Journal Title
  
  電子情報通信学会技術研究報告 103・220
  
  Pages: 7-12
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] 授業ビデオの自動シーン分割-ビデオ教材の充実を目指して-2003
- Author(s)
  金寺登
- Journal Title
  
  高等専門学校情報処理教育研究員会情報処理教育研究発表会論文集 23
  
  Pages: 98-101
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] 独立成分分析によるトピック表現を用いた講義ビデオシーン分割2003
- Author(s)
  金寺登
- Journal Title
  
  日本音響学会 2003年秋季研究発表会講演論文集 I
  
  Pages: 181-182
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] A lecture video segmentation method using audio data.2003
- Author(s)
  N.Kanedera, A.Sumida, J.Jikeya
- Journal Title
  
  Japan Acoustic society Spring Meeting I
  
  Pages: 187-188
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Continuous speech recognition based on the contribution of modulation spectrum.2003
- Author(s)
  N.Kanedera, T.Arai, K.Okada, K.Asai
- Journal Title
  
  Speech Dynamics by Ear, Eye, Mouth and Machine, An Interdisciplinary Workshop, Technical Report of IEICE Vol.103No.155
  
  Pages: 67-72
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Lecture video segmentation derived from speech using ICA.2003
- Author(s)
  A.Sumida, N.Kanedera, J.Jikeya, T.ikehata, T.Funada
- Journal Title
  
  Technical Report of IEICE Vol.103No.220
  
  Pages: 7-12
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Automatic subtopic segmentation of lecture video contents.2003
- Author(s)
  N.Kanedera, E.Tanaka, A.Sumida
- Journal Title
  
  Meeting of research and education for information processing in the college of technology No.23
  
  Pages: 98-101
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Lecture video segmentation using topic representation by ICA.2003
- Author(s)
  N.Kanedera, A.Sumida, J.Jikeya, T.ikehata, T.Funada
- Journal Title
  
  Japan Acoustic society Fall Meeting I
  
  Pages: 181-182
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Lecture speech recognition and lecture video segmentation.2003
- Author(s)
  N.Kanedera, A.Sumida, J.Jikeya, T.ikehata, T.Funada
- Journal Title
  
  Technical Report of the Japanese Society for Artificial Intelligence SIG-SLUD-A302
  
  Pages: 9-14
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] 変調スペクトルの貢献度に基づく連続音声認識2002
- Author(s)
  金寺登
- Journal Title
  
  電子情報通信学会技術研究報告 102・248
  
  Pages: 41-46
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Continuous speech recognition based on the contribution of modulation frequency components.2002
- Author(s)
  N.Kanedera, T.Arai, K.Okada, Y.Momomura
- Journal Title
  
  Technical Report of IEICE Vol.102No.248
  
  Pages: 41-46
- Description
  「研究成果報告書概要(欧文)」より

2004 Fiscal Year Final Research Report Summary

Development of a supporting system for creation of educational video contents using robust automatic speech recognition technology

Principal Investigator

KANEDERA Noboru Ishikawa National College of Technology, Department of Electronics and Information Engineering, Associate Professor, 電子情報工学科, 助教授 (50194931)

Research Products

[Journal Article] Subtopic segmentation in the lecture speech for creation of lecture video contents.2005

Author(s)

Journal Title

Description

[Journal Article] Lecture video segmentation for creation of educational video contents.2004

Author(s)

Journal Title

Description

[Journal Article] Lecture video segmentation derived from speech using ICA.2004

Author(s)

Journal Title

Description

[Journal Article] Development of scene selection system for making lecture video.2004

Author(s)

Journal Title

Description

[Journal Article] Development of a supporting system for creation of educational video contents using automatic subtopic segmentation.2004

Author(s)

Journal Title

Description

[Journal Article] Evaluation of a video segmentation system supporting the creation of lecture video materials.2004

Author(s)

Journal Title

Description

[Journal Article] Lecture video segmentation derived from speech by dynamic programming.2004

Author(s)

Journal Title

Description

[Journal Article] Subtopic segmentation in the lecture speech.2004

Author(s)

Journal Title

Description

[Journal Article] 音声による講義ビデオシーン分割方法の検討2003

Author(s)

Journal Title

Description

[Journal Article] Continuous Speech Recognition Based on the Contribution of Modulation Spectrum2003

Author(s)

Journal Title

Description

[Journal Article] 独立成分分析を用いた音声による講義ビデオシーン分割2003

Author(s)

Journal Title

Description

[Journal Article] 授業ビデオの自動シーン分割-ビデオ教材の充実を目指して-2003

Author(s)

Journal Title

Description

[Journal Article] 独立成分分析によるトピック表現を用いた講義ビデオシーン分割2003

Author(s)

Journal Title

Description

[Journal Article] A lecture video segmentation method using audio data.2003

Author(s)

Journal Title

Description

[Journal Article] Continuous speech recognition based on the contribution of modulation spectrum.2003

Author(s)

Journal Title

Description

[Journal Article] Lecture video segmentation derived from speech using ICA.2003

Author(s)

Journal Title

Description

[Journal Article] Automatic subtopic segmentation of lecture video contents.2003

Author(s)

Journal Title

Description

[Journal Article] Lecture video segmentation using topic representation by ICA.2003

Author(s)

Journal Title

Description

[Journal Article] Lecture speech recognition and lecture video segmentation.2003

Author(s)

Journal Title