• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2004 Fiscal Year Final Research Report Summary

Development of a supporting system for creation of educational video contents using robust automatic speech recognition technology

Research Project

Project/Area Number 14580246
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Educational technology
Research InstitutionIshikawa National College of Technology

Principal Investigator

KANEDERA Noboru  Ishikawa National College of Technology, Department of Electronics and Information Engineering, Associate Professor, 電子情報工学科, 助教授 (50194931)

Project Period (FY) 2002 – 2004
KeywordsEducational video contents / Video segmentation / Independent component analysis / Automatic speech recognition / Dynamic Programming / Video retrieval / Video edit system
Research Abstract

We developed a supporting system for creation of educational video contents. The system automatically segments a lecture video material into subtopics based on speech signals. To represent subtopics of video scenes, the text recognized by automatic speech recognition (ASR) from a lecture speech was converted into an index using independent component analysis (ICA) instead of conventional TF-IDF. This research attempted a method of segmentation using dynamic programming that minimizes the sum of cosine distances between adjacent indexes that represent subtopics of video scenes. The validity of the proposed method was evaluated using sample lecture videos uttered by five lecturers. Results indicated that scene segmentation using automatic speech recognition performed as well as that using transcription text.
Editing a video requires searching for subtopic segmentation positions, and extraction of necessary video segments, or removing unnecessary video segments. In particular, when searching subtopic segmentation positions, a large amount of time and efforts are required to review the video from beginning to end. That is, it is hard work to search subtopic segmentation positions. It is therefore expected to reduce the editing time and efforts by the developed system with automatic subtopic segmentation. In this research, we carried out subjective evaluation by 16 examinees and 5 lecture video materials to confirm the effect of automatic subtopic segmentation. As a result, 75% of examinees answered that the editing method with automatic subtopic segmentation is better than that without segmentation. Moreover, the average editing time was reduced by about 14%.

  • Research Products

    (21 results)

All 2005 2004 2003 2002

All Journal Article (21 results)

  • [Journal Article] Subtopic segmentation in the lecture speech for creation of lecture video contents.2005

    • Author(s)
      N.Kanedera, A.Sumida, T.Ikehata, T.Funada
    • Journal Title

      The IEICE Trans, on information and Systems Vol.J88-DINo.5(In Printing)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Lecture video segmentation for creation of educational video contents.2004

    • Author(s)
      N.Kanedera, A.Sumida, J.Jikeya
    • Journal Title

      Japanese Colleges of Technology Education Journal No.27

      Pages: 727-732

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Lecture video segmentation derived from speech using ICA.2004

    • Author(s)
      N.Kanedera, A.Sumida, J.Jikeya, T.Ikehata, T.Funada
    • Journal Title

      The 18th International Congress on Acoustics Vol.III

      Pages: 2023-2026

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Development of scene selection system for making lecture video.2004

    • Author(s)
      T.Ikehata, A.Sumida, N.Kanedera
    • Journal Title

      Japanese Society for Engineering Education

      Pages: 151-152

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Development of a supporting system for creation of educational video contents using automatic subtopic segmentation.2004

    • Author(s)
      N.Kanedera, T.Ikehata, A.Sumida
    • Journal Title

      Meeting of research and education for information processing in the college of technology No.24

      Pages: 81-84

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Evaluation of a video segmentation system supporting the creation of lecture video materials.2004

    • Author(s)
      N.Kanedera, T.Ikehata, A.Sumida, T.Funada
    • Journal Title

      Japan Acoustic society Fall Meeting I

      Pages: 37-38

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Lecture video segmentation derived from speech by dynamic programming.2004

    • Author(s)
      A.Sumida, N.Kanedera, T.Ikehata
    • Journal Title

      Forum on Information Technology 2004 Vol.2

      Pages: 353-356

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Subtopic segmentation in the lecture speech.2004

    • Author(s)
      N.Kanedera, A.Sumida, T.Ikehata, T.Funada
    • Journal Title

      Proceedings of International Conference on Spoken Language Processing Vol.III

      Pages: 1821-1824

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] 音声による講義ビデオシーン分割方法の検討2003

    • Author(s)
      金寺 登
    • Journal Title

      日本音響学会2003年春季研究発表会講演論文集 I

      Pages: 187-188

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] Continuous Speech Recognition Based on the Contribution of Modulation Spectrum2003

    • Author(s)
      N.Kanedera
    • Journal Title

      SPEECH DYNAMICS BY EAR, EYE, MOUTH AND MACHINE, An Interdisciplinary Workshop(電子情報通信学会技術研究報告) 103・155

      Pages: 67-72

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] 独立成分分析を用いた音声による講義ビデオシーン分割2003

    • Author(s)
      隅田 飛鳥
    • Journal Title

      電子情報通信学会技術研究報告 103・220

      Pages: 7-12

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] 授業ビデオの自動シーン分割-ビデオ教材の充実を目指して-2003

    • Author(s)
      金寺 登
    • Journal Title

      高等専門学校 情報処理教育研究員会 情報処理教育研究発表会論文集 23

      Pages: 98-101

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] 独立成分分析によるトピック表現を用いた講義ビデオシーン分割2003

    • Author(s)
      金寺 登
    • Journal Title

      日本音響学会 2003年秋季研究発表会 講演論文集 I

      Pages: 181-182

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] A lecture video segmentation method using audio data.2003

    • Author(s)
      N.Kanedera, A.Sumida, J.Jikeya
    • Journal Title

      Japan Acoustic society Spring Meeting I

      Pages: 187-188

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Continuous speech recognition based on the contribution of modulation spectrum.2003

    • Author(s)
      N.Kanedera, T.Arai, K.Okada, K.Asai
    • Journal Title

      Speech Dynamics by Ear, Eye, Mouth and Machine, An Interdisciplinary Workshop, Technical Report of IEICE Vol.103No.155

      Pages: 67-72

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Lecture video segmentation derived from speech using ICA.2003

    • Author(s)
      A.Sumida, N.Kanedera, J.Jikeya, T.ikehata, T.Funada
    • Journal Title

      Technical Report of IEICE Vol.103No.220

      Pages: 7-12

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Automatic subtopic segmentation of lecture video contents.2003

    • Author(s)
      N.Kanedera, E.Tanaka, A.Sumida
    • Journal Title

      Meeting of research and education for information processing in the college of technology No.23

      Pages: 98-101

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Lecture video segmentation using topic representation by ICA.2003

    • Author(s)
      N.Kanedera, A.Sumida, J.Jikeya, T.ikehata, T.Funada
    • Journal Title

      Japan Acoustic society Fall Meeting I

      Pages: 181-182

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Lecture speech recognition and lecture video segmentation.2003

    • Author(s)
      N.Kanedera, A.Sumida, J.Jikeya, T.ikehata, T.Funada
    • Journal Title

      Technical Report of the Japanese Society for Artificial Intelligence SIG-SLUD-A302

      Pages: 9-14

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] 変調スペクトルの貢献度に基づく連続音声認識2002

    • Author(s)
      金寺 登
    • Journal Title

      電子情報通信学会技術研究報告 102・248

      Pages: 41-46

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] Continuous speech recognition based on the contribution of modulation frequency components.2002

    • Author(s)
      N.Kanedera, T.Arai, K.Okada, Y.Momomura
    • Journal Title

      Technical Report of IEICE Vol.102No.248

      Pages: 41-46

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 2006-07-11  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi