Topic segmentation of speech data based on keyword detection
Project/Area Number |
10680415
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
情報システム学(含情報図書館学)
|
Research Institution | Ritsumeikan University |
Principal Investigator |
YAMASHITA Yoichi Ritsumeikan Univ. Fac. Science and Engineering, Asoc. Professor, 理工学部, 助教授 (80174689)
|
Project Period (FY) |
1998 – 1999
|
Project Status |
Completed (Fiscal Year 1999)
|
Budget Amount *help |
¥3,300,000 (Direct Cost: ¥3,300,000)
Fiscal Year 1999: ¥1,000,000 (Direct Cost: ¥1,000,000)
Fiscal Year 1998: ¥2,300,000 (Direct Cost: ¥2,300,000)
|
Keywords | keyword spotting / topic identification / speech data / false alarm / ワードスポッティング / ニュース音声 / シミュレーション / 音声認識 / DPマッチング |
Research Abstract |
A method of topic identification is proposed for TV news speech based on the keyword spotting technique. Three thousands of nouns are selected as keywords which contribute to topic identification, based on criterion of mutual information and a length of the word. The topic of news is identified by calculating possibilities of the topics in terms of an acoustic score of the spotted word and a topic probability of the word. Topic identification rate is 66.5 percent assuming that identification is correct if the correct topic is included in the first three places of the result of topic identification. A new method of keyword spotting using prosodic information as well as phonemic information is discussed in order to reduce false alarms keeping high detection rate. Prosodic dissimilarity between a keyword and input speech is measured by DP matching of F0 contours. A total score based on these two measures is used for detecting keywords. The F0 template are stored for each keyword after smoothing based on the F0 model. The introduction of F0 information reduced false alarms by 30% to 50% for the same detection rate for TV news speech. A method of estimating accuracy of word spotting is proposed and it is compared with the estimation based on the length of the word. False alarm counts are estimated by a new measure calculated by simulation of speech recognition for phoneme sequences that the language model generates. The simulation-based measure shows better performance for the estimation of false alarms.
|
Report
(3 results)
Research Products
(11 results)