Building a Balanced Database of the Finnish Morpho-syntax Using Large Corpora of Finnish
Project/Area Number |
19720100
|
Research Category |
Grant-in-Aid for Young Scientists (B)
|
Allocation Type | Single-year Grants |
Research Field |
Linguistics
|
Research Institution | Reitaku University |
Principal Investigator |
CHIBA Shoju (CHIBA Shioju) Reitaku University, 外国語学部, 准教授 (70337723)
|
Project Period (FY) |
2007 – 2010
|
Project Status |
Completed (Fiscal Year 2010)
|
Budget Amount *help |
¥3,910,000 (Direct Cost: ¥3,400,000、Indirect Cost: ¥510,000)
Fiscal Year 2010: ¥910,000 (Direct Cost: ¥700,000、Indirect Cost: ¥210,000)
Fiscal Year 2009: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000)
Fiscal Year 2008: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000)
Fiscal Year 2007: ¥1,700,000 (Direct Cost: ¥1,700,000)
|
Keywords | 統語論 / 形態論 / コーパス言語学 / 外国語(中・英・仏・独除く) / 言語学 / フィンランド語 / 国際情報交換 / フィンランド |
Research Abstract |
Using sampling techniques, about 10 million-sized textual database was extracted from the large corpora of written modern Finnish, and then linguistically annotated. Annotation ranges from lexical, grammatical to discourse-functional information. We also developed a quantitative profiling method alongside practical applications which compares the syntactic/morpho-syntactic/lexical profiles of Finnish grammatical constructions with the overall settings of the sampled database.
|
Report
(6 results)
Research Products
(42 results)