Study on the labeling of discourse structure and its electronic representation
Project/Area Number |
08837004
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
談話(ディスコース)
|
Research Institution | Chiba University |
Principal Investigator |
TUTIYA Syun Chiba University, Faculty of Letters Professor, 文学部, 教授 (50155404)
|
Project Period (FY) |
1996 – 1997
|
Project Status |
Completed (Fiscal Year 1997)
|
Budget Amount *help |
¥2,400,000 (Direct Cost: ¥2,400,000)
Fiscal Year 1997: ¥800,000 (Direct Cost: ¥800,000)
Fiscal Year 1996: ¥1,600,000 (Direct Cost: ¥1,600,000)
|
Keywords | discourse / dialogue / markup / discourse structure / segmentation / SGML / TEI / turn taking |
Research Abstract |
The present research aims to examine the methods with which to create, archive and exchange the recorded dialogue data, focusing on the markup or "tags." This year, the requirements for the spoken dialogue corpus was considered and a prototypical sample corpus was created based on the consideration on the basic tag set. The spoken dialogue corpus consists of 3 or 4 components. The first component is a set of sound files, which contains the separate voices uttered by the two different speakers. Each file represents one dialogue, enabling researchers to timestamp utterances. The second component is a movie file, but at the current state of the technology, it can be optional. The third component is a set of annotations to the phonetic events. Annotations includes the orthographical transcription, linguistic (syntactic, semantic) annotations and discourse annotations. The current research has been successful in showing that the transcription is essentially a kind of annotation, not that which is annotated, from the perspective of the exchange of spoken dialogue corpora. A tag set was proposed and verified in term of TEI,using the 128 maptask dialogues in Japanese. A guideline is prepared for the creation and exchange.
|
Report
(3 results)
Research Products
(4 results)