Kansai Factors in Talking Image Effect of Digital Compressed Images

Research Project

Project/Area Number	09838020
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	感性工学
Research Institution	Kyoto Institute of Technology
Principal Investigator	TAMURA Hiroshi Kyoto Institute of Technology, Graduate School of Science and Technology, Professor, 大学院・工芸科学研究科, 教授 (70029411)
Co-Investigator(Kenkyū-buntansha)	SHIBUYA Yu Kyoto Institute of Technology, Faculty of Engineering, Associate Professor, 工芸学部, 助教授 (70226190)
Project Period (FY)	1997 – 1998
Project Status	Completed (Fiscal Year 1998)
Budget Amount *help	¥3,000,000 (Direct Cost: ¥3,000,000) Fiscal Year 1998: ¥900,000 (Direct Cost: ¥900,000) Fiscal Year 1997: ¥2,100,000 (Direct Cost: ¥2,100,000)
Keywords	Talking Image Effect / inferior speech recognition method / MPEG / analysis on incorrect answer / miss perception / slow effect / fast effect / Kansei / ISDN / 感性因子 / 劣性音声 / 左右音声 / 混合音声 / デジタル圧縮・再生
Research Abstract	Talking Image Effect (TIE) of digital compressed video images was evaluated in this research. MPEG-1 format was used for video image compression and inferior speech recognition method was introduced for the evaluation. Talker's front face image and profile one were recorded and compressed with MPEG-1 encoder. Video bit rate is a parameter that is set at encoding. Encoding with higher video bit rate causes to make bigger file on the storage but it is expected to make higher quality video image. For experiment of this research, 400khs and 4000kbs were used as low video bit rate and high one respectively. Following things were clarified with this experiment. 1. From the point of view of TIE, there was no significant difference between high video bit rate compression and low one. Furthermore, analysis on incorrect answer also indicated that there was no significant difference with video bit rate. 2. TIE of profile face image is higher than that of front face image for labials. Because the talker's video image was recorded with a chromakey blue backdrop, each image had high spatial and time redundancy. Due to the feature of the algorithm on MPEG-1 encoding, it is easy to make a highly compressed MPEG-1 video image from above high redundant video images. Therefore there might be no difference of image effect with difference of video bit rate. In order to use more complex video image, that is a image with less redundant of time and space, a talking video image overlapped on the CG images was introduced. With using this video image, it was found that the slow effect based on motion of talker's mouth at low video bit rates was much loss than that of at high video bit rate.

Report

(3 results)

1998 Annual Research Report Final Research Report Summary
1997 Annual Research Report

Research Products
(14 results)

All Other

All Publications (14 results)

[Publications] 呉俊: "音声識別における顔映像の役割" 電子情報通信学会論文誌. J80-D-II・8. 2066-2073 (1997)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] 都築達郎: "単語音声識別における静止映像の提示効果" Human Interface. 13. 457-462 (1997)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] 古江伸樹: "劣性音声識別実験法を用いたMPEG-1映像の品質評価" Human Interface. 14. 565-570 (1998)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] 都築達郎: "MPEG-1圧縮した正面映像と側面映像の話者映像効果の比較" 日本人間工学会関西支部大会講演論文集. 93-98 (1998)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] Jun WU: "On the Roles of Talking Face in Speech Perception" Trans.IEICE. vol.J80-D-II. 2066-2073 (1997)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] Tatsurou TSUZUKI: "Still Image Effect in Speech Word Recognition" Human Interface. vol.13. 457-462 (1997)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] Nobuki FURUE: "Evaluation of MPEG-1 Video by Means of Inferior Speech Recognition" Human Interface. vol.14. 565-570 (1998)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] Tatsuro TSUZUKI: "A Comparison of Talking Image Effect of front and profile face image compressed by MPEG-1 format" Proc.Conference of Kansai Chapter, Japan Ergonomics Society. 93-98 (1998)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1998 Final Research Report Summary
[Publications] 呉俊: "音声識別における顔映像の役割" 電子情報通信学会論文誌. J80-D-II・8. 2066-2073 (1997)
- Related Report
  1998 Annual Research Report
[Publications] 都築達郎: "単語音声識別における静止映像の提示効果" Human Interface. 13. 457-462 (1997)
- Related Report
  1998 Annual Research Report
[Publications] 古江伸樹: "劣性音声識別実験法を用いたMPEG-1映像の品質評価" Human Interface. 14. 565-570 (1998)
- Related Report
  1998 Annual Research Report
[Publications] 都築達郎: "MPEG-1圧縮した正面映像と側面映像の話者映像効果の比較" 日本人間工学会関西支部大会講演論文集. 93-98 (1998)
- Related Report
  1998 Annual Research Report
[Publications] 呉俊: "音声識別における顔映像の役割" 電子情報通信学会論文誌. VOL.J80. 2066-2073 (1997)
- Related Report
  1997 Annual Research Report
[Publications] 都築達郎: "単語音声識別における静止映像の提示効果" ヒューマン・インタフェース・シンポジウム論文集. 第13回. 457-462 (1997)
- Related Report
  1997 Annual Research Report

Kansai Factors in Talking Image Effect of Digital Compressed Images

Principal Investigator

TAMURA Hiroshi Kyoto Institute of Technology, Graduate School of Science and Technology, Professor, 大学院・工芸科学研究科, 教授 (70029411)

¥3,000,000 (Direct Cost: ¥3,000,000)

Report

Research Products

[Publications] 呉 俊: "音声識別における顔映像の役割" 電子情報通信学会論文誌. J80-D-II・8. 2066-2073 (1997)

Description

Related Report

[Publications] 都築 達郎: "単語音声識別における静止映像の提示効果" Human Interface. 13. 457-462 (1997)

Description

Related Report

[Publications] 古江 伸樹: "劣性音声識別実験法を用いたMPEG-1映像の品質評価" Human Interface. 14. 565-570 (1998)

Description

Related Report

[Publications] 都築 達郎: "MPEG-1圧縮した正面映像と側面映像の話者映像効果の比較" 日本人間工学会関西支部大会講演論文集. 93-98 (1998)

Description

Related Report

[Publications] Jun WU: "On the Roles of Talking Face in Speech Perception" Trans.IEICE. vol.J80-D-II. 2066-2073 (1997)

Description

Related Report

[Publications] Tatsurou TSUZUKI: "Still Image Effect in Speech Word Recognition" Human Interface. vol.13. 457-462 (1997)

Description

Related Report

[Publications] Nobuki FURUE: "Evaluation of MPEG-1 Video by Means of Inferior Speech Recognition" Human Interface. vol.14. 565-570 (1998)

Description

Related Report

[Publications] Tatsuro TSUZUKI: "A Comparison of Talking Image Effect of front and profile face image compressed by MPEG-1 format" Proc.Conference of Kansai Chapter, Japan Ergonomics Society. 93-98 (1998)

Description

Related Report

[Publications] 呉 俊: "音声識別における顔映像の役割" 電子情報通信学会論文誌. J80-D-II・8. 2066-2073 (1997)

Related Report

[Publications] 都築 達郎: "単語音声識別における静止映像の提示効果" Human Interface. 13. 457-462 (1997)

Related Report

[Publications] 古江 伸樹: "劣性音声識別実験法を用いたMPEG-1映像の品質評価" Human Interface. 14. 565-570 (1998)

Related Report

[Publications] 都築 達郎: "MPEG-1圧縮した正面映像と側面映像の話者映像効果の比較" 日本人間工学会関西支部大会講演論文集. 93-98 (1998)

Related Report

[Publications] 呉 俊: "音声識別における顔映像の役割" 電子情報通信学会論文誌. VOL.J80. 2066-2073 (1997)

Related Report

[Publications] 都築 達郎: "単語音声識別における静止映像の提示効果" ヒューマン・インタフェース・シンポジウム論文集. 第13回. 457-462 (1997)

Related Report

[Publications] 呉俊: "音声識別における顔映像の役割" 電子情報通信学会論文誌. J80-D-II・8. 2066-2073 (1997)

[Publications] 都築達郎: "単語音声識別における静止映像の提示効果" Human Interface. 13. 457-462 (1997)

[Publications] 古江伸樹: "劣性音声識別実験法を用いたMPEG-1映像の品質評価" Human Interface. 14. 565-570 (1998)

[Publications] 都築達郎: "MPEG-1圧縮した正面映像と側面映像の話者映像効果の比較" 日本人間工学会関西支部大会講演論文集. 93-98 (1998)

[Publications] 呉俊: "音声識別における顔映像の役割" 電子情報通信学会論文誌. J80-D-II・8. 2066-2073 (1997)

[Publications] 都築達郎: "単語音声識別における静止映像の提示効果" Human Interface. 13. 457-462 (1997)

[Publications] 古江伸樹: "劣性音声識別実験法を用いたMPEG-1映像の品質評価" Human Interface. 14. 565-570 (1998)

[Publications] 都築達郎: "MPEG-1圧縮した正面映像と側面映像の話者映像効果の比較" 日本人間工学会関西支部大会講演論文集. 93-98 (1998)

[Publications] 呉俊: "音声識別における顔映像の役割" 電子情報通信学会論文誌. VOL.J80. 2066-2073 (1997)

[Publications] 都築達郎: "単語音声識別における静止映像の提示効果" ヒューマン・インタフェース・シンポジウム論文集. 第13回. 457-462 (1997)