2010 Fiscal Year Final Research Report
Building a Balanced Database of the Finnish Morpho-syntax Using Large Corpora of Finnish
Project/Area Number |
19720100
|
Research Category |
Grant-in-Aid for Young Scientists (B)
|
Allocation Type | Single-year Grants |
Research Field |
Linguistics
|
Research Institution | Reitaku University |
Principal Investigator |
CHIBA Shoju Reitaku University, 外国語学部, 准教授 (70337723)
|
Project Period (FY) |
2007 – 2010
|
Keywords | 統語論 / 形態論 / コーパス言語学 |
Research Abstract |
Using sampling techniques, about 10 million-sized textual database was extracted from the large corpora of written modern Finnish, and then linguistically annotated. Annotation ranges from lexical, grammatical to discourse-functional information. We also developed a quantitative profiling method alongside practical applications which compares the syntactic/morpho-syntactic/lexical profiles of Finnish grammatical constructions with the overall settings of the sampled database.
|