• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2014 Fiscal Year Final Research Report

Calculating Autocorrelation Function for Word Occurrences in Texts and Its Modeling with Stochastic Processes

Research Project

  • PDF
Project/Area Number 25580093
Research Category

Grant-in-Aid for Challenging Exploratory Research

Allocation TypeMulti-year Fund
Research Field Linguistics
Research InstitutionShowa University

Principal Investigator

OGURA HIROSHI  昭和大学, 教養部, 准教授 (40214100)

Project Period (FY) 2013-04-01 – 2015-03-31
Keywords自己相関関数 / 拡張指数型関数 / 確率過程 / ポアソン過程 / 非定常ポアソン課程 / Cox過程 / 言語統計 / テキストマイニング
Outline of Final Research Achievements

In this study,we attempt to offer a new analyzing point of view for texts in which occurrences of words are considered as dynamical time series. Based on this interpretation of texts, we propose a method for calculating autocorrelation function (ACF) which represents the correlation between occurrences of a considered word. In our method, the basic time unit of the stochastic process of word occurrence is taken to be one sentence and this allows us a suitable definition of ACF. The examples of ACF obtained through our method for 'conceptual words'and those for 'nonconceptual words' are given and their characteristic behaviors are discussed. Here, the term 'conceptual word' means the word which is deeply related with the central concepts or themes of text, and the 'nonconceptual word' represents the word which is not related with themes of text. It was found that the ACFs for 'conceptual words' and those for 'nonconceptual words' show entirely different characteristic behaviors.

Free Research Field

統計的機械学習,計量言語学

URL: 

Published: 2016-06-03  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi