• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Machine-learning Approaches to Corpus Stylistics: Towards the Creation of International Collaborative Network

Research Project

Project/Area Number 18H00675
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section一般
Review Section Basic Section 02080:English linguistics-related
Research InstitutionOsaka University

Principal Investigator

Tabata Tomoji  大阪大学, 大学院人文学研究科(言語文化学専攻), 教授 (10249873)

Project Period (FY) 2018-04-01 – 2022-03-31
Project Status Completed (Fiscal Year 2023)
Budget Amount *help
¥15,730,000 (Direct Cost: ¥12,100,000、Indirect Cost: ¥3,630,000)
Fiscal Year 2021: ¥4,030,000 (Direct Cost: ¥3,100,000、Indirect Cost: ¥930,000)
Fiscal Year 2020: ¥4,550,000 (Direct Cost: ¥3,500,000、Indirect Cost: ¥1,050,000)
Fiscal Year 2019: ¥2,730,000 (Direct Cost: ¥2,100,000、Indirect Cost: ¥630,000)
Fiscal Year 2018: ¥4,420,000 (Direct Cost: ¥3,400,000、Indirect Cost: ¥1,020,000)
Keywordsコーパス文体論 / 国際連携 / 研究ネットワーク形成 / 自然言語処理 / デジタルヒューマニティーズ / トピックモデリング / ワードエンべディング / 言語学的文体論 / 機械学習 / 可視化 / テクスト分析 / テクストマイニング / 国際連携基盤創成 / トピックモデル / 国際連携拠点 / テクスト分析方法論 / ワードエンベディング / 文体 / 小説言語 / フォーラム
Outline of Final Research Achievements

In this study, we applied machine learning-based natural language processing techniques, notably topic modeling and word embedding, to analyze a corpus of Late Modern English novels, thereby proposing a new model for corpus stylistics research. This approach has illuminated topics and groups of words composed of low-frequency occurrences that traditional corpus linguistics methods might overlook, offering a quantitative foothold for exploring 'issues of meaning' that were previously difficult to approach with conventional methods. Based on the methodologies and insights derived from this research, we have initiated the formation of a collaborative foundation aimed at constructing a large-scale international research network with our academic partners both domestically and abroad.

Academic Significance and Societal Importance of the Research Achievements

この研究プロジェクトは,機械学習に基づく自然言語処理技術と言語学的文体分析を組み合わせた研究モデルを用いた点で特異な学術的意義をもつ。特に,「意味の領域」への量的アプローチの可能性を広げることで,後期近代英語散文の文体を従来とは異なる観点から分析を可能にし,人文学とデジタルの架橋を築いたと言えよう。社会的には,国内に加えて,国際的な研究連携ネットワークの構築により,知見や研究手法の共有,交換を可能にする基盤創成を行ったことは高く評価できると考えている。これにより,分野横断的な共同研究の萌芽形成や,人的交流に寄与し,学術的な洞察と技術的手法の融合から新たな研究領域が開かれる可能性がある。

Report

(5 results)
  • 2023 Final Research Report ( PDF )
  • 2021 Annual Research Report
  • 2020 Annual Research Report
  • 2019 Annual Research Report
  • 2018 Annual Research Report
  • Research Products

    (33 results)

All 2023 2022 2021 2020 2019 2018 Other

All Int'l Joint Research (3 results) Journal Article (4 results) (of which Peer Reviewed: 1 results) Presentation (21 results) (of which Int'l Joint Research: 13 results,  Invited: 12 results) Book (4 results) Funded Workshop (1 results)

  • [Int'l Joint Research] ヴュルツブルク大学(ドイツ)

    • Related Report
      2019 Annual Research Report
  • [Int'l Joint Research] オランダ先端人文社会科学研究所 (NIAS)(オランダ)

    • Related Report
      2019 Annual Research Report
  • [Int'l Joint Research] University of Wurzburg/University of Trier(ドイツ)

    • Related Report
      2018 Annual Research Report
  • [Journal Article] 確率論的トピックモデリングによるBritish classic fictionの「遠読」2023

    • Author(s)
      田畑 智司
    • Journal Title

      英文學研究 支部統合号

      Volume: 16 Pages: 36-48

    • Related Report
      2021 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Latent Topics in British Classic Fiction : Using LDA to classify texts into meaningful groups2020

    • Author(s)
      田畑 智司
    • Journal Title

      言語文化共同研究プロジェクト

      Volume: 2019 Pages: 47-58

    • DOI

      10.18910/76991

    • NAID

      120006883716

    • Year and Date
      2020-07-31
    • Related Report
      2019 Annual Research Report
  • [Journal Article] Mapping Dickens’s Style in the Network of Words, Topics, and Texts2018

    • Author(s)
      Tomoji Tabata
    • Journal Title

      テクストマイニングと デジタルヒューマニティーズ 2017

      Volume: 2018

    • Related Report
      2018 Annual Research Report
  • [Journal Article] Digital Humanities: デジタルで拡張する言語文化学研究2018

    • Author(s)
      田畑 智司
    • Journal Title

      テクストマイニングと デジタルヒューマニティーズ 2017

      Volume: 2018

    • Related Report
      2018 Annual Research Report
  • [Presentation] Using topic models to explore body language in Dickens’s literature and journalism2023

    • Author(s)
      Tomoji Tabata
    • Organizer
      Poetics and Linguistics Association Annual International Conference (PALA2023) "Green stylistics" University of Bologna, Italy
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Digital Humanities and Literary Linguistics: Using topic modelling to facilitate an empirical interchange of insights2023

    • Author(s)
      Tomoji Tabata
    • Organizer
      English Language and Literature Association of Korea International Conference (ELLAK2023), Hanyang University, Seoul
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Exploring body language in Dickens’s fiction through topic modelling2023

    • Author(s)
      Tomoji Tabata
    • Organizer
      英語コーパス学会第49回大会(学会設立30周年記念大会)
    • Related Report
      2021 Annual Research Report
  • [Presentation] 確率論的トピックモデリングによるBritish classic fictionの「遠読」(シンポジウム「デジタル時代の英語英米文学研究と英語教育」)2022

    • Author(s)
      田畑 智司
    • Organizer
      日本英文学会中国四国支部第74回大会
    • Related Report
      2020 Annual Research Report
  • [Presentation] Digital Humanities as/and computational Science2022

    • Author(s)
      Tomoji Tabata
    • Organizer
      Building Digital Humanities (Western Sydney University)
    • Related Report
      2020 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Different paths to the same peak: Digital humanities and Spitzerian stylistics2021

    • Author(s)
      Tomoji Tabata
    • Organizer
      The Poetics and Linguistics Association International Conference PALA 2021 Nottingham
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Language Action Types and the Semantics of Texts: Using Rhetorical Annotation to Classify Texts into Meaningful Groups2020

    • Author(s)
      Tomoji Tabata
    • Organizer
      2020 Korea-Japan Symposium on Digital Humanities
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] 「ズームイン・ズームアウト―デジタルヒューマニティーズとテクストの「読み」―」2020

    • Author(s)
      田畑 智司
    • Organizer
      Galeシンポジウム2020『第2回 デジタル人文学への誘い』
    • Related Report
      2019 Annual Research Report
    • Invited
  • [Presentation] Digital Humanities as Non-Linear Reading: Style in classic British fiction2019

    • Author(s)
      Tomoji Tabata
    • Organizer
      DADH 2019: The Tenth International Conference of Digital Archives and Digital Humanities
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] “Zooming in and zooming out”: Digital humanities and the (macro-/micro-) reading of texts2019

    • Author(s)
      Tomoji Tabata
    • Organizer
      Digital Humanities Lecture at National Chengchi University, Taipei, Taiwan
    • Related Report
      2019 Annual Research Report
    • Invited
  • [Presentation] Dickens, Collins and their Collaborations: Pinpointing style change in collaborative texts2019

    • Author(s)
      Tomoji Tabata
    • Organizer
      Trans Media World Literature Institute International Colloquium: Transhumanism, Trans Media, World Literature, and Digital Humanities, Dongguk University, Seoul, South Korea
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Experimental Stylometry2019

    • Author(s)
      Tomoji Tabata
    • Organizer
      Stylometry workshop Amsterdam at Advanced Study in the Humanities and Social Sciences (NIAS, Amsterdam, the Netherlands)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Tracing Thematic Transition in Dickens’s Literature and Journalism2019

    • Author(s)
      Tomoji Tabata
    • Organizer
      The Poetics and Linguistics Association International Conference PALA 2019 Liverpool
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Reading texts non-linearly: Classic British fiction and Dickens2019

    • Author(s)
      Tomoji Tabata
    • Organizer
      Gale Digital Humanities Day at the British Library
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Corpus approach to semantic style: Body language, n-grams, and topics2019

    • Author(s)
      Tomoji Tabata
    • Organizer
      Osaka Symposium on Corpus Stylistics
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Word Vectors and Semantic Style in Classic Fiction2019

    • Author(s)
      田畑 智司
    • Organizer
      「言語研究と統計2019」
    • Related Report
      2018 Annual Research Report
  • [Presentation] Dickens in Vector Space: Word Embeddings and Semantic Profiling of Style2018

    • Author(s)
      Tomoji Tabata
    • Organizer
      Poetics And Linguistics Association (PALA) 2018
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Collaborative Texts under a Stylometric Microscope: Investigating Texts of Mixed Authorship2018

    • Author(s)
      Tomoji Tabata
    • Organizer
      英語コーパス学会第44回大会
    • Related Report
      2018 Annual Research Report
  • [Presentation] Lexical Diversity in Classic British Fiction2018

    • Author(s)
      Tomoji Tabata
    • Organizer
      Osaka-Wurzburg Collaborative Workshop: Cross-Linguistics Perspectives on Complexity in Literary Texts
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Stylometry and Classic British Fiction2018

    • Author(s)
      田畑 智司
    • Organizer
      日本文体論学会第114回大会
    • Related Report
      2018 Annual Research Report
    • Invited
  • [Presentation] デジタルが変える「読み」― テクスト、データ、ディスタントリーディング ―2018

    • Author(s)
      田畑 智司
    • Organizer
      Galeシンポジウム2018「デジタル人文学への誘い」
    • Related Report
      2018 Annual Research Report
    • Invited
  • [Book] テクストマイニングとデジタルヒューマニティーズ20222023

    • Author(s)
      田畑 智司(編)
    • Total Pages
      78
    • Publisher
      大阪大学大学院人文学研究科
    • Related Report
      2021 Annual Research Report
  • [Book] テクストマイニングとデジタルヒューマニティーズ 20212021

    • Author(s)
      田畑 智司(編)
    • Total Pages
      68
    • Publisher
      大阪大学大学院言語文化研究科
    • Related Report
      2019 Annual Research Report
  • [Book] テクストマイニングとデジタルヒューマニティーズ 20202020

    • Author(s)
      田畑 智司(編)
    • Total Pages
      96
    • Publisher
      大阪大学大学院言語文化研究科
    • Related Report
      2019 Annual Research Report
  • [Book] テクストマインニングとデジタルヒューマニティーズ20172018

    • Author(s)
      田畑 智司,杉山 真央,土村 成美
    • Total Pages
      90
    • Publisher
      大阪大学大学院言語文化研究科
    • Related Report
      2018 Annual Research Report
  • [Funded Workshop] Digital Humanities Workshop Osaka 20202020

    • Related Report
      2019 Annual Research Report

URL: 

Published: 2018-04-23   Modified: 2025-01-30  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi