• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Kansai Factors in Talking Image Effect of Digital Compressed Images

Research Project

Project/Area Number 09838020
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field 感性工学
Research InstitutionKyoto Institute of Technology

Principal Investigator

TAMURA Hiroshi  Kyoto Institute of Technology, Graduate School of Science and Technology, Professor, 大学院・工芸科学研究科, 教授 (70029411)

Co-Investigator(Kenkyū-buntansha) SHIBUYA Yu  Kyoto Institute of Technology, Faculty of Engineering, Associate Professor, 工芸学部, 助教授 (70226190)
Project Period (FY) 1997 – 1998
Project Status Completed (Fiscal Year 1998)
Budget Amount *help
¥3,000,000 (Direct Cost: ¥3,000,000)
Fiscal Year 1998: ¥900,000 (Direct Cost: ¥900,000)
Fiscal Year 1997: ¥2,100,000 (Direct Cost: ¥2,100,000)
KeywordsTalking Image Effect / inferior speech recognition method / MPEG / analysis on incorrect answer / miss perception / slow effect / fast effect / Kansei / ISDN / 感性因子 / 劣性音声 / 左右音声 / 混合音声 / デジタル圧縮・再生
Research Abstract

Talking Image Effect (TIE) of digital compressed video images was evaluated in this research. MPEG-1 format was used for video image compression and inferior speech recognition method was introduced for the evaluation.
Talker's front face image and profile one were recorded and compressed with MPEG-1 encoder. Video bit rate is a parameter that is set at encoding. Encoding with higher video bit rate causes to make bigger file on the storage but it is expected to make higher quality video image. For experiment of this research, 400khs and 4000kbs were used as low video bit rate and high one respectively.
Following things were clarified with this experiment.
1. From the point of view of TIE, there was no significant difference between high video bit rate compression and low one. Furthermore, analysis on incorrect answer also indicated that there was no significant difference with video bit rate.
2. TIE of profile face image is higher than that of front face image for labials.
Because the talker's video image was recorded with a chromakey blue backdrop, each image had high spatial and time redundancy. Due to the feature of the algorithm on MPEG-1 encoding, it is easy to make a highly compressed MPEG-1 video image from above high redundant video images. Therefore there might be no difference of image effect with difference of video bit rate.
In order to use more complex video image, that is a image with less redundant of time and space, a talking video image overlapped on the CG images was introduced. With using this video image, it was found that the slow effect based on motion of talker's mouth at low video bit rates was much loss than that of at high video bit rate.

Report

(3 results)
  • 1998 Annual Research Report   Final Research Report Summary
  • 1997 Annual Research Report
  • Research Products

    (14 results)

All Other

All Publications (14 results)

  • [Publications] 呉 俊: "音声識別における顔映像の役割" 電子情報通信学会論文誌. J80-D-II・8. 2066-2073 (1997)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 都築 達郎: "単語音声識別における静止映像の提示効果" Human Interface. 13. 457-462 (1997)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 古江 伸樹: "劣性音声識別実験法を用いたMPEG-1映像の品質評価" Human Interface. 14. 565-570 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 都築 達郎: "MPEG-1圧縮した正面映像と側面映像の話者映像効果の比較" 日本人間工学会関西支部大会講演論文集. 93-98 (1998)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Jun WU: "On the Roles of Talking Face in Speech Perception" Trans.IEICE. vol.J80-D-II. 2066-2073 (1997)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Tatsurou TSUZUKI: "Still Image Effect in Speech Word Recognition" Human Interface. vol.13. 457-462 (1997)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Nobuki FURUE: "Evaluation of MPEG-1 Video by Means of Inferior Speech Recognition" Human Interface. vol.14. 565-570 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Tatsuro TSUZUKI: "A Comparison of Talking Image Effect of front and profile face image compressed by MPEG-1 format" Proc.Conference of Kansai Chapter, Japan Ergonomics Society. 93-98 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 呉 俊: "音声識別における顔映像の役割" 電子情報通信学会論文誌. J80-D-II・8. 2066-2073 (1997)

    • Related Report
      1998 Annual Research Report
  • [Publications] 都築 達郎: "単語音声識別における静止映像の提示効果" Human Interface. 13. 457-462 (1997)

    • Related Report
      1998 Annual Research Report
  • [Publications] 古江 伸樹: "劣性音声識別実験法を用いたMPEG-1映像の品質評価" Human Interface. 14. 565-570 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] 都築 達郎: "MPEG-1圧縮した正面映像と側面映像の話者映像効果の比較" 日本人間工学会関西支部大会講演論文集. 93-98 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] 呉 俊: "音声識別における顔映像の役割" 電子情報通信学会論文誌. VOL.J80. 2066-2073 (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] 都築 達郎: "単語音声識別における静止映像の提示効果" ヒューマン・インタフェース・シンポジウム論文集. 第13回. 457-462 (1997)

    • Related Report
      1997 Annual Research Report

URL: 

Published: 1997-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi