Calculating Autocorrelation Function for Word Occurrences in Texts and Its Modeling with Stochastic Processes
Project/Area Number |
25580093
|
Research Category |
Grant-in-Aid for Challenging Exploratory Research
|
Allocation Type | Multi-year Fund |
Research Field |
Linguistics
|
Research Institution | Showa University |
Principal Investigator |
|
Project Period (FY) |
2013-04-01 – 2015-03-31
|
Project Status |
Completed (Fiscal Year 2014)
|
Budget Amount *help |
¥2,210,000 (Direct Cost: ¥1,700,000、Indirect Cost: ¥510,000)
Fiscal Year 2014: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000)
Fiscal Year 2013: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
|
Keywords | 自己相関関数 / 拡張指数型関数 / 確率過程 / ポアソン過程 / 非定常ポアソン課程 / Cox過程 / 言語統計 / テキストマイニング / 非定常ポアソン過程 / 機能語 / 概念語 / 非済次ポアソン過程 / 語のバースト性 |
Outline of Final Research Achievements |
In this study,we attempt to offer a new analyzing point of view for texts in which occurrences of words are considered as dynamical time series. Based on this interpretation of texts, we propose a method for calculating autocorrelation function (ACF) which represents the correlation between occurrences of a considered word. In our method, the basic time unit of the stochastic process of word occurrence is taken to be one sentence and this allows us a suitable definition of ACF. The examples of ACF obtained through our method for 'conceptual words'and those for 'nonconceptual words' are given and their characteristic behaviors are discussed. Here, the term 'conceptual word' means the word which is deeply related with the central concepts or themes of text, and the 'nonconceptual word' represents the word which is not related with themes of text. It was found that the ACFs for 'conceptual words' and those for 'nonconceptual words' show entirely different characteristic behaviors.
|
Report
(3 results)
Research Products
(9 results)