• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Development of an English corpus with visualized interrelationships among sentences in the framework of dependency grammar

Research Project

Project/Area Number 20K00583
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeMulti-year Fund
Section一般
Review Section Basic Section 02060:Linguistics-related
Research InstitutionMeiji University

Principal Investigator

Oya Masanori  明治大学, 国際日本学部, 専任教授 (60318748)

Project Period (FY) 2020-04-01 – 2024-03-31
Project Status Completed (Fiscal Year 2023)
Budget Amount *help
¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000)
Fiscal Year 2022: ¥130,000 (Direct Cost: ¥100,000、Indirect Cost: ¥30,000)
Fiscal Year 2021: ¥390,000 (Direct Cost: ¥300,000、Indirect Cost: ¥90,000)
Fiscal Year 2020: ¥520,000 (Direct Cost: ¥400,000、Indirect Cost: ¥120,000)
Keywords依存文法 / 日英対訳コーパス / 依存距離 / 対訳コーパス / 概念密度 / 日英パラレルコーパス / Dependency Grammar / Parallel Corpus / Network analysis / Centrality measures / 英語コーパス / 自然言語処理 / 談話情報 / 自然言語推論
Outline of Research at the Start

本研究は、言語学的に動機づけられた自然言語処理研究の試みの一つとして、依存文法(Dependency Grammar)の枠組を利用してテキスト中の複数の文の相互関係を可視化した英語
コーパス構築と、これを利用した自然言語推論システム構築を目的とする。
テキストを構成する文どうしの関係を各文中の単語間の関係で表現すること
によって、依存文法と概念的には同一の形式でテキストの談話構造を明示化することに寄与
するのみならず、文と文とに矛盾関係が生じている場合も研究対象とすることによって、従
来のRSTよりも汎用性と粒度の双方が高い談話構造表示形式を構築することが期待される。

Outline of Final Research Achievements

In this study, the results of syntactic analysis using a dependency grammar framework on a Japanese-English parallel corpus focused on the structural characteristics of each sentence and the general structural differences between Japanese and English. During this process, the research theme shifted to dependency distances, leading to presentations at various academic venues and papers published in several scholarly journals. Notably, it is found that the frequency distribution of dependency distances aligns with a specific distribution cross-linguistically, and that the average dependency distance in English sentences produced by learners varies according to their proficiency level. Particularly significant in the final year was the discovery of dependency relationships with much longer than average dependency distances, which exhibited significant differences between Japanese and English, paving the way for new research themes.

Academic Significance and Societal Importance of the Research Achievements

日本語と英語の統語構造を依存構造の枠組みで明示し、その構造特性を依存距離という数値で表現しかつ比較対照することによって、これら2言語の構造的差異を従来より客観的に把握することが可能になる。さらに、日本人英語学習者が産出する英文の平均依存距離が学習者の熟達度に応じて変化するといった知見は今後の英語教育への応用可能性を秘めている。

Report

(5 results)
  • 2023 Annual Research Report   Final Research Report ( PDF )
  • 2022 Research-status Report
  • 2021 Research-status Report
  • 2020 Research-status Report
  • Research Products

    (14 results)

All 2023 2022 2021 2020

All Journal Article (9 results) (of which Peer Reviewed: 9 results,  Open Access: 5 results) Presentation (4 results) (of which Int'l Joint Research: 4 results,  Invited: 1 results) Book (1 results)

  • [Journal Article] Low-Frequency Long-Distance Dependencies as “Long Tails”2023

    • Author(s)
      Oya, Masanori
    • Journal Title

      Proceedings of 37th Pacific Asia Conference on Language, Information and Computing (PACLIC) 37

      Volume: 37 Pages: 89-95

    • Related Report
      2023 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Propositional Idea Density of a Japanese Text and its English Translation in a Parallel Corpus2023

    • Author(s)
      Masanori Oya
    • Journal Title

      Global Japanese Studies Review

      Volume: 15 Pages: 97-105

    • Related Report
      2022 Research-status Report
    • Peer Reviewed / Open Access
  • [Journal Article] Differences of Mean Dependency Distances of English Essays Written by Learners of Different Proficiency Levels2023

    • Author(s)
      Masanori Oya
    • Journal Title

      Glottometrics

      Volume: 53 Pages: 24-41

    • DOI

      10.53482/2022_53_400

    • Related Report
      2022 Research-status Report
    • Peer Reviewed / Open Access
  • [Journal Article] The Relevance of Dependency Distances in the Study of L2 Production2022

    • Author(s)
      Masanori Oya
    • Journal Title

      The 41st Thailand TESOL Conference Proceedings

      Volume: -

    • Related Report
      2022 Research-status Report
    • Peer Reviewed / Open Access
  • [Journal Article] Developing a Japanese-English Parallel Corpus of "Japan, the Beautiful and Myself" by Yasunari Kawabata2022

    • Author(s)
      Masanori Oya
    • Journal Title

      Global Japanese Studies Review

      Volume: 14 (1) Pages: 1-26

    • Related Report
      2021 Research-status Report
    • Peer Reviewed
  • [Journal Article] Three Types of Average Dependency Distances of Sentences in a Multilingual Parallel Corpus2021

    • Author(s)
      Masanori Oya
    • Journal Title

      Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation (PACLIC 35)

      Volume: 35 Pages: 652-661

    • Related Report
      2021 Research-status Report
    • Peer Reviewed / Open Access
  • [Journal Article] Syntactic Similarity of the Sentences in a Multi-Lingual Parallel Corpus Based on the Euclidean Distance of Their Dependency Trees2021

    • Author(s)
      Masanori Oya
    • Journal Title

      Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation (PACLIC 34)

      Volume: 34 Pages: 225-233

    • Related Report
      2021 Research-status Report
    • Peer Reviewed / Open Access
  • [Journal Article] Analysis and Quantification of the Network of Lexical Connectedness Within a Text Based on Metrics Used in Network Analysis2021

    • Author(s)
      Masanori Oya
    • Journal Title

      Global Japanese Studies Review

      Volume: 13 Pages: 15-38

    • Related Report
      2020 Research-status Report
    • Peer Reviewed
  • [Journal Article] Structural divergence between root elements in English-Japanese translation pairs2020

    • Author(s)
      Masanori Oya
    • Journal Title

      Global Japanese Studies Review

      Volume: 12 Pages: 107-126

    • NAID

      120006872554

    • Related Report
      2020 Research-status Report
    • Peer Reviewed
  • [Presentation] Low-Frequency Long-Distance Dependencies as “Long Tails”2023

    • Author(s)
      Oya, Masanori
    • Organizer
      The 37th annual Meeting of Pacific Asia Conference on Language, Information and Computation (PACLIC 37)
    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research
  • [Presentation] The Relevance of Dependency Distances in the Study of L2 Production2022

    • Author(s)
      Masanori Oya
    • Organizer
      The 41st Thailand TESOL International (Virtual) Conference
    • Related Report
      2021 Research-status Report
    • Int'l Joint Research / Invited
  • [Presentation] Three Types of Average Dependency Distances of Sentences in a Multilingual Parallel Corpus2021

    • Author(s)
      Masanori Oya
    • Organizer
      Pacific Asia Conference on Language, Information and Computation (PACLIC) 35
    • Related Report
      2021 Research-status Report
    • Int'l Joint Research
  • [Presentation] Syntactic similarity of the sentences in a multi-lingual parallel corpus based on the Euclidean distance of their dependency trees2020

    • Author(s)
      Masanori Oya
    • Organizer
      Pacific Asia Conference on Language, Information and Computation (PACLIC) 34
    • Related Report
      2020 Research-status Report
    • Int'l Joint Research
  • [Book] 依存文法概説2022

    • Author(s)
      大矢政徳
    • Total Pages
      240
    • Publisher
      開拓社
    • Related Report
      2022 Research-status Report

URL: 

Published: 2020-04-28   Modified: 2025-01-30  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi