Media Conversion for User-friendly Human-Machine Interface
Project/Area Number |
04650295
|
Research Category |
Grant-in-Aid for General Scientific Research (C)
|
Allocation Type | Single-year Grants |
Research Field |
電子通信系統工学
|
Research Institution | SEIKEI UNIVERSITY |
Principal Investigator |
|
Project Period (FY) |
1992 – 1993
|
Project Status |
Completed (Fiscal Year 1993)
|
Budget Amount *help |
¥2,100,000 (Direct Cost: ¥2,100,000)
Fiscal Year 1993: ¥900,000 (Direct Cost: ¥900,000)
Fiscal Year 1992: ¥1,200,000 (Direct Cost: ¥1,200,000)
|
Keywords | MEDIA CONVERSION / HUMAN INTERFACE / EXPRESSION ANIMATION / 3-D MODEL / VOICE ANALYSIS / SCENARIO DESCRIPTION / SPEECH RECOGNITION / TEXTURE MAPPING / 知的インタフェース / 表情合成 / コンピュータグラフィックス / 音声と画像の同期 |
Research Abstract |
In 1992, I proposed media conversion system from both text and voice to facial image. Lip motion is controlled by the analysis results of text and voice. At first, segment boundaries of vowel positions are extracted using spectrum and power transition. Strict vowel and consonant positions are decided by DP matching of segmentation result and text information. By this algorithm, accurate phoneme segmentation using text information is realized. This system can be applied to speech synthesis, too. The keyframes of mouth animation are located on the segment boundary decided by segmentation result. Standard mouth shapes and durations are decided by the text analysis. The parameters between keyframes are decided by 3-D Spline interpolation, so the motion becomes smooth and natural. In 1993, I constructed Scenario Making System to describe facial animation easily. Expression is realized by the modification of 3D wire frame model. Operator only has to decide the keyframes position by locating the Iconized facial expression on the time axis. Using this system, I realized proto-type e-mail interface system with facial expression. User of transmitter side only gives text, voice and expression positions. Only few parameters are transmitted and after a few delay, the facial motion image synchronized with the voice appears at the terminal of receiver. Ultra low bit rate image transmittion can be realized by our system. The facial expression recognition is our future subject.
|
Report
(3 results)
Research Products
(22 results)