• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Stylometric investigation of 19th-century English through text-mining

Research Project

Project/Area Number 23500298
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeMulti-year Fund
Section一般
Research Field Library and information science/Humanistic social informatics
Research InstitutionOsaka University

Principal Investigator

TABATA Tomoji  大阪大学, 言語文化研究科(研究院), 准教授 (10249873)

Project Period (FY) 2011 – 2013
Project Status Completed (Fiscal Year 2013)
Budget Amount *help
¥5,200,000 (Direct Cost: ¥4,000,000、Indirect Cost: ¥1,200,000)
Fiscal Year 2013: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
Fiscal Year 2012: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
Fiscal Year 2011: ¥1,820,000 (Direct Cost: ¥1,400,000、Indirect Cost: ¥420,000)
Keywords19世紀英語 / 文体 / マイニング / stylometry / テクストマイニング / 計量文体分析 / 通時文体論 / 近代英語 / 19世紀 / 統計学的マイニング / コーパスマイニング / 著者識別指標 / 文体指標 / テクスト分類 / 著者推定 / 計量研究
Research Abstract

This grant-in-aid study has tried applying the state-of-art data-minig tools in stylistic investigation of nineteenth-century English prose. The linguistic features analysed in this research range from very common function words through mid- to lower-frequency items. While nineteenth-centurry English shares, in principle, similar linguistic features with eighteenth-century English as well as English in the early twentieth-century, it has proved to be possible to clearly differentiate nineteenth-century texts from texts written in the eighteenth as well as in the early twentieth centuries using machine learning techniques. Major distinguishing variables extracted though machine-learning mining algorithms include body-part languages, which function as descriptors of emotion or as device for characterisation. Of further interest is that Charles Dickens triggered the significant development of such stylistic devices in the nineteenth century.

Report

(4 results)
  • 2013 Annual Research Report   Final Research Report ( PDF )
  • 2012 Research-status Report
  • 2011 Research-status Report
  • Research Products

    (41 results)

All 2014 2013 2012 2011 Other

All Journal Article (8 results) (of which Peer Reviewed: 1 results) Presentation (29 results) (of which Invited: 2 results) Book (4 results)

  • [Journal Article] Stylometry of Collaboration: Pinpointing style changes in the text of mixed authorship2014

    • Author(s)
      Tomoji Tabata
    • Journal Title

      人文学データのマイニング

      Volume: 322

    • Related Report
      2013 Annual Research Report
  • [Journal Article] 'Approaching Dickens's Style through Random Forests', Digital Humanities 2012 Conference Proceedings2012

    • Author(s)
      Tomoji Tabata
    • Journal Title

      University of Hamburg, Germany, and The Alliance of Digital Humanities Organizations

      Pages: 388-91

    • Related Report
      2013 Final Research Report
  • [Journal Article] Approaching Dickens's Style through Random Forests2012

    • Author(s)
      Tabata. Tomoji
    • Journal Title

      Digital Humanities 2012 Conference Abstracts

      Volume: 2012

    • Related Report
      2012 Research-status Report
    • Peer Reviewed
  • [Journal Article] 「テクストマイニングからテクスト分析へ:Collinsとの共著作品におけるDickensの文体」2012

    • Author(s)
      田畑 智司
    • Journal Title

      『電子化言語資料分析研究2011--2012』

      Volume: 2012

    • Related Report
      2012 Research-status Report
  • [Journal Article] 'Key' Words and Stylistic 'Signatures': Textometry with Random Forests2012

    • Author(s)
      Tomoji Tabata
    • Journal Title

      統計数理研究所共同研究リポート 『マイニング技術を応用したテクスト分析研究』

      Volume: 278

    • Related Report
      2011 Research-status Report
  • [Journal Article] テクストマイニングからテクスト分析へ : Collins との共著作品におけるDickens の文体『電子化言語資料分析研究2011-2012』2011

    • Author(s)
      田畑 智司
    • Journal Title

      大阪大学大学院言語文化研究科

      Pages: 3-17

    • Related Report
      2013 Final Research Report
  • [Journal Article] 'Key' Words and Stylistic 'Signatures': Textometry with Random Forests

    • Author(s)
      田畑 智司
    • Journal Title

      統計数理研究所共同研究リポート

      Volume: 264 Pages: 45-64

    • Related Report
      2013 Final Research Report
  • [Journal Article] Stylometry of Collaboration: Pinpointing style changes in the text of mixed authorship

    • Author(s)
      Tomoji Tabata
    • Journal Title

      統計解析言語R を活用したデジタルヒューマニティーズ研究

      Pages: 37-46

    • Related Report
      2013 Final Research Report
  • [Presentation] テクストマイニングからテクスト分析へ2013

    • Author(s)
      田畑 智司
    • Organizer
      英語コーパス学会シンポジウム『私のコーパス利用』
    • Place of Presentation
      大阪大学
    • Year and Date
      2013-04-27
    • Related Report
      2013 Final Research Report
  • [Presentation] Too many suspects, too much burstiness: A meta-analysis of key-word detection statistics for stylometry2013

    • Author(s)
      田畑 智司
    • Organizer
      言語研究と統計2013
    • Place of Presentation
      統計数理研究所
    • Related Report
      2013 Final Research Report
  • [Presentation] Stylometry of Collaborations: Dickens, Collins and their collaborations2013

    • Author(s)
      Tomoji Tabata
    • Organizer
      PALA 2013: International Conference of the Poetics and Linguistics Association
    • Place of Presentation
      University of Heidelberg, Germany
    • Related Report
      2013 Final Research Report
  • [Presentation] Burrows's Delta and the stylometry of collaborations2013

    • Author(s)
      田畑 智司
    • Organizer
      統計数理研究所言語系共同利用研究班合同研究会
    • Place of Presentation
      西南学院大学
    • Related Report
      2013 Final Research Report
  • [Presentation] Opening up a New Perspective for Text Analysis in the Digital Age2013

    • Author(s)
      Tomoji Tabata
    • Organizer
      Humanities Studies in the Digital Age and the Role of Buddhist Studies
    • Place of Presentation
      University of Tokyo
    • Related Report
      2013 Final Research Report
  • [Presentation] マイニングとテクスト研究2013

    • Author(s)
      田畑 智司
    • Organizer
      英語コーパス学会春季シンポジウム『私のコーパス利用』
    • Place of Presentation
      大阪大学豊中キャンパス
    • Related Report
      2013 Annual Research Report
  • [Presentation] Stylometry of Collaborations: Dickens, Collins and their collaborations2013

    • Author(s)
      Tomoji Tabata
    • Organizer
      PALA (Poetics and Linguistics Association) 2013
    • Place of Presentation
      University of Heidelberg
    • Related Report
      2013 Annual Research Report
  • [Presentation] Opening up a New Perspective for Text Analysis in the Digital Age2013

    • Author(s)
      Tomoji Tabata
    • Organizer
      Humanities Studies in the Digital Age and the Role of Buddhist Studies
    • Place of Presentation
      東京大学本郷キャンパス
    • Related Report
      2013 Annual Research Report
    • Invited
  • [Presentation] Too many suspects, too much burstiness: A meta-analysis of key-word-detection statistics for stylometry2013

    • Author(s)
      田畑 智司
    • Organizer
      言語研究と統計2013
    • Place of Presentation
      統計数理研究所
    • Related Report
      2012 Research-status Report
  • [Presentation] Key words and textometry: Are key words really "key" words?2012

    • Author(s)
      田畑 智司
    • Organizer
      計量的言語研究の諸相
    • Place of Presentation
      北海道大学大学院メディア・コミュニケーション研究院
    • Year and Date
      2012-09-19
    • Related Report
      2013 Final Research Report
  • [Presentation] テクストマイニングからテクスト分析へ:Wilkie Collins との共著作品におけるCharles Dickens の文体を計る2012

    • Author(s)
      田畑 智司
    • Organizer
      言語研究と統計2012
    • Place of Presentation
      統計数理研究所
    • Year and Date
      2012-03-07
    • Related Report
      2013 Final Research Report
  • [Presentation] Detecting Stylistic Differences in Collaborative Writings: Random Forests + Burrows2012

    • Author(s)
      Tomoji Tabata
    • Organizer
      Delta on Dickens, Collins and their co-authored texts', Australasian Digital Humanities Conference 2012
    • Place of Presentation
      Australian National University, ACT, Australia. (Long paper)
    • Related Report
      2013 Final Research Report
  • [Presentation] Approaching Dickens's Style through Random Forests2012

    • Author(s)
      Tomoji Tabata
    • Organizer
      Digital Humanities 2012: International Conference of the Alliance of Digital Humanities Organizations
    • Place of Presentation
      University of Hamburg, Germany
    • Related Report
      2013 Final Research Report
  • [Presentation] Digital Enhancements to the Dickens Lexicon2012

    • Author(s)
      Tomoji Tabata
    • Organizer
      The Bicentennial International Dickens Fellowship Conference
    • Place of Presentation
      University of Portsmouth, UK
    • Related Report
      2013 Final Research Report
  • [Presentation] Text-mining Linguistic Variations from a Diachronic Perspective: An experiment in textometry2012

    • Author(s)
      Tomoji Tabata
    • Organizer
      JADH 2012 Conference "Inheriting Humanities"
    • Place of Presentation
      University of Tokyo, Japan
    • Related Report
      2013 Final Research Report
  • [Presentation] The State of Digital Humanities in Japan: Its history, development, and future perspective2012

    • Author(s)
      Tomoji Tabata
    • Organizer
      International Conference of Digital Archives and Digital Humanities
    • Place of Presentation
      National Taiwan University, Taipei
    • Related Report
      2013 Final Research Report
    • Invited
  • [Presentation] Approaching Dickens's Style through Random Forests2012

    • Author(s)
      Tabata, Tomoji
    • Organizer
      Digital Humanities 2012
    • Place of Presentation
      University of Hamburg, Germany
    • Related Report
      2012 Research-status Report
  • [Presentation] Digital Enhancements to the Dickens Lexicon2012

    • Author(s)
      Tabata, Tomoji
    • Organizer
      Dickens Fellowship Bicentennial International Conference 2012
    • Place of Presentation
      University of Portsmouth, UK
    • Related Report
      2012 Research-status Report
  • [Presentation] キーワード分析とテクスト統計2012

    • Author(s)
      田畑 智司
    • Organizer
      統計数理研究所共同利用研究班合同報告会
    • Place of Presentation
      北海道大学
    • Related Report
      2012 Research-status Report
  • [Presentation] Dickens と Collins の共著作品への文体統計学的アプローチ2012

    • Author(s)
      田畑 智司
    • Organizer
      第93回 情報処理学会・人文科学とコンピュータ研究会発表会
    • Place of Presentation
      奄美市立奄美博物館
    • Related Report
      2011 Research-status Report
  • [Presentation] テクストマイニングからテクスト分析へ:Wilkie Collinsとの共著作品におけるCharles Dickensの文体を計る2012

    • Author(s)
      田畑 智司
    • Organizer
      言語研究と統計2012
    • Place of Presentation
      統計数理研究所
    • Related Report
      2011 Research-status Report
  • [Presentation] Detecting Stylistic Differences in Collaborative Writings: Random Forests + Burrows’ Delta on Dickens, Collins and Their Co-Authored Texts2012

    • Author(s)
      Tabata, Tomoji
    • Organizer
      Digital Humanities Australasia 2012
    • Place of Presentation
      Australian National University
    • Related Report
      2011 Research-status Report
  • [Presentation] Using random forests to identify Dickensian style2011

    • Author(s)
      Tomoji Tabata
    • Organizer
      Language Individuation: A symposium in honour of John Burrows
    • Place of Presentation
      University of Newcastle, NSW, Australia
    • Related Report
      2013 Final Research Report
  • [Presentation] Investigating Dickensian Style through Random Forests2011

    • Author(s)
      Tomoji Tabata
    • Organizer
      Middle and Modern English Corpus Linguistics Conference(MMECL 2011)
    • Place of Presentation
      大阪大学中之島センター
    • Related Report
      2013 Final Research Report
  • [Presentation] Statistical text-mining on English Woman's Journal2011

    • Author(s)
      Tomoji Tabata, Harold Short, Gerhard Brey, Maki Miyake, Yuichiro Kobayashi, José Miguel Monteiro Vieira, Matteo Romanello
    • Organizer
      Osaka Symposium on Digital Humanities 2011
    • Place of Presentation
      Graduate School of Language and Culture, Osaka University
    • Related Report
      2013 Final Research Report
  • [Presentation] Text-Mining in Corpus Stylistics: Spotlighting Linguistic Variations in the Inaugural Addresses of U.S. Presidents2011

    • Author(s)
      Tabata, Tomoji
    • Organizer
      Modern Linguistics Association in Korea
    • Place of Presentation
      韓国・大田大学
    • Related Report
      2011 Research-status Report
  • [Presentation] Using Random Forests to identify Dickensian style2011

    • Author(s)
      Tabata, Tomoji
    • Organizer
      Language Individuation: A Symposium in Honour of John Burrows
    • Place of Presentation
      University of Newcastle, NSW, Australia
    • Related Report
      2011 Research-status Report
  • [Presentation] Investigating Dickensian style through Random Forests2011

    • Author(s)
      Tabata, Tomoji
    • Organizer
      Middle and Modern English Corpus Linguistics 2011
    • Place of Presentation
      大阪大学中之島センター
    • Related Report
      2011 Research-status Report
  • [Presentation] Statistical Text-Mining on English Woman’s Journal2011

    • Author(s)
      Tabata, Tomoji, Harold Short, Gerhard Brey, Maki Miyake, Jose Miguel Vieira, Yu’ichiro Kobayashi, Matteo Romanello
    • Organizer
      Osaka Symposium on Digital Humanities 2011
    • Place of Presentation
      大阪大学
    • Related Report
      2011 Research-status Report
  • [Book] 言語研究のためのテキストマイニング2014

    • Author(s)
      田畑 智司・岸江 信介
    • Publisher
      ひつじ書房
    • Related Report
      2013 Final Research Report
  • [Book] Advancing Digital Humanities2014

    • Author(s)
      Bode, K. and Arthur, P. L
    • Publisher
      Palgrave(in press)
    • Related Report
      2013 Final Research Report
  • [Book] 言語研究のためのテキストマイニング2014

    • Author(s)
      田畑 智司・岸江 信介(編)
    • Total Pages
      200
    • Publisher
      ひつじ書房
    • Related Report
      2013 Annual Research Report
  • [Book] これからのコロケーション研究(共著)「第4章文体とコロケーション(田畑 智司)」2012

    • Author(s)
      堀 正広
    • Publisher
      ひつじ書房
    • Related Report
      2013 Final Research Report

URL: 

Published: 2011-08-05   Modified: 2019-07-29  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi