• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Research on annotation for the development of a parsed corpus of Japanese with a special focus on complex sentences

Research Project

Project/Area Number 15H03210
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section一般
Research Field Linguistics
Research InstitutionNational Institute for Japanese Language and Linguistics

Principal Investigator

Pardeshi Prashant  大学共同利用機関法人人間文化研究機構国立国語研究所, 理論・対照研究領域, 教授 (00374984)

Co-Investigator(Kenkyū-buntansha) 岸本 秀樹  神戸大学, 人文学研究科, 教授 (10234220)
野田 尚史  大学共同利用機関法人人間文化研究機構国立国語研究所, 日本語教育研究領域, 教授 (20144545)
吉本 啓  東北大学, 高度教養教育・学生支援機構, 教授 (50282017)
窪田 悠介  大学共同利用機関法人人間文化研究機構国立国語研究所, 理論・対照研究領域, 准教授 (60745149)
長崎 郁  大学共同利用機関法人人間文化研究機構国立国語研究所, 理論・対照研究領域, プロジェクト非常勤研究員 (70401445)
バトラー アラステア  弘前大学, 人文社会科学部, 准教授 (90588873)
HORN S.W.  大学共同利用機関法人人間文化研究機構国立国語研究所, 理論・対照研究領域, プロジェクト非常勤研究員 (70801538)
影山 太郎  大学共同利用機関法人人間文化研究機構国立国語研究所, 理論・対照研究領域, その他 (80068288)
Project Period (FY) 2015-04-01 – 2020-03-31
Project Status Completed (Fiscal Year 2019)
Budget Amount *help
¥16,640,000 (Direct Cost: ¥12,800,000、Indirect Cost: ¥3,840,000)
Fiscal Year 2019: ¥2,860,000 (Direct Cost: ¥2,200,000、Indirect Cost: ¥660,000)
Fiscal Year 2018: ¥3,250,000 (Direct Cost: ¥2,500,000、Indirect Cost: ¥750,000)
Fiscal Year 2017: ¥2,860,000 (Direct Cost: ¥2,200,000、Indirect Cost: ¥660,000)
Fiscal Year 2016: ¥2,860,000 (Direct Cost: ¥2,200,000、Indirect Cost: ¥660,000)
Fiscal Year 2015: ¥4,810,000 (Direct Cost: ¥3,700,000、Indirect Cost: ¥1,110,000)
Keywords関係節 / 複文 / アノテーション / 統語・意味解析付きコーパス / コーパス研究 / 従属節 / 統語・意味解析 / アノテーション研究 / 言語学 / コーパス言語学 / タグ付け作業 / 統語・意味解析情報タグ付きコーパス
Outline of Final Research Achievements

The goal of this research project was to build a large-scale parsed corpus (treebank) of modern Japanese that would enable the users to search and retrieve complex sentences involving relative clauses and subordinate clauses, which are peculiar characteristic of Japanese, through the development of annotation method and corpus search tools. This task was jointly carried out with the collaborative research project “Development of and Linguistic Research with a Parsed Corpus of Japanese” at NINJAL. A corpus of 40,831 sentences (560,098 words) was build and is made available for free access at the following site: http://npcmj.ninjal.ac.jp/. The corpus can be searched with the following search tools: http://npcmj.ninjal.ac.jp/explorer/
http://npcmj.ninjal.ac.jp/interfaces/index_en.html

Academic Significance and Societal Importance of the Research Achievements

本研究の成果として約4万文(56万語)規模の統語・意味解析付きコーパスが開発され、公開された。このコーパスでは現代日本語のテクストに対し文の統語・意味解析情報を付与し、複数な検索ツールが用意されており、多様な日本語の機能語や句構造、節の諸類型および複雑な構文を大量の言語データから検索・抽出して研究に活用することが可能である。また、初心用の検索ツール完備により、大学などで日本語の統語論の教育にも利用できる。さらに、このコーパスでは日英両言語表記で公開されており、日本語表記に精通してない海外の研究者も利用できるので、日本語研究の国際化に貢献できる。

Report

(6 results)
  • 2019 Annual Research Report   Final Research Report ( PDF )
  • 2018 Annual Research Report
  • 2017 Annual Research Report
  • 2016 Annual Research Report
  • 2015 Annual Research Report
  • Research Products

    (61 results)

All 2019 2018 2017 2016 2015 Other

All Journal Article (20 results) (of which Int'l Joint Research: 5 results,  Peer Reviewed: 7 results,  Open Access: 10 results,  Acknowledgement Compliant: 1 results) Presentation (37 results) (of which Int'l Joint Research: 5 results) Remarks (2 results) Funded Workshop (2 results)

  • [Journal Article] 統語・意味情報付きコーパスの開発に関する研究:中国語名詞句の解析について2019

    • Author(s)
      周振,吉本啓
    • Journal Title

      国立国語研究所論集

      Volume: 17 Pages: 35-65

    • NAID

      120006707635

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Probing the nature of an island constraint with a parsed corpus: A case study on the Coordinate Structure Constraint in Japanese2019

    • Author(s)
      Yusuke Kubota and Ai Kubota
    • Journal Title

      Linguistic Issues in Language Technology

      Volume: 18-3 Pages: 1-24

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Parsed corpus as a source for testing generalizations in Japanese2019

    • Author(s)
      Hideki Kishimoto and Prashant Pardeshi
    • Journal Title

      Linguistic Issues in Language Technology

      Volume: 18-2 Pages: 1-24

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] From discourse to logic with Stanford CoreNLP and Treebank Semantics2019

    • Author(s)
      Alastair Butler
    • Journal Title

      the Sixteenth International Workshop of Logic and Engineering of Natural Language Semantics (LENLS 16)

      Volume: 16 Pages: 1-4

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] NPCMJに対する述語項構造シソーラスの意味役割と概念フレームの付与2019

    • Author(s)
      竹内孔一,Alastair Butler,長崎郁,Prashant Pardeshi
    • Journal Title

      研究報告自然言語処理(NL)

      Volume: 2019-NL-241 Pages: 1-4

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] 統語・意味解析情報を伴う日本語コーパスの開発とその日本語教育・学習への応用2018

    • Author(s)
      吉本啓
    • Journal Title

      日本言語文藝研究

      Volume: 18 Pages: 1-11

    • Related Report
      2018 Annual Research Report
  • [Journal Article] Parsed Annotation with Semantic Calculation2018

    • Author(s)
      Stephen Wright Horn and Alastair Butler
    • Journal Title

      Proceedings of the 17th International Workshop on Treebanks and Linguistic Theories (TLT)

      Volume: 17 Pages: 39-52

    • Related Report
      2018 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Derived mappings for FrameNet construction from a parsed corpus of Japanese.2018

    • Author(s)
      Stephen Wright Horn, Alastair Butler, Iku Nagasaki and Kei Yoshimoto
    • Journal Title

      LREC 2018 Proceedings, International FrameNet Workshop, 11th edition of the Language Resources and Evaluation Conference

      Volume: 11 Pages: 28-32

    • Related Report
      2018 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] 前提投射の実例のツリーバンクによる検索2018

    • Author(s)
      窪田悠介, 峯島宏次
    • Journal Title

      日本言語学会第157回大会予稿集

      Volume: 157 Pages: 282-287

    • Related Report
      2018 Annual Research Report
  • [Journal Article] From meaning representations to syntactic trees2016

    • Author(s)
      Alastair Butler
    • Journal Title

      Proceedings of the Thirteenth International Workshop of Logic and Engineering of Natural Language Semantics13(LENLS13)

      Volume: 13 Pages: 147-160

    • Related Report
      2016 Annual Research Report
  • [Journal Article] DynamicPower at SemEval-2016 Task 8: Processing syntactic parse trees with a Dynamic Semantics core2016

    • Author(s)
      Alastair Butler
    • Journal Title

      Proceedings of SemEval-2016

      Volume: 10 Pages: 1148-1153

    • Related Report
      2016 Annual Research Report
  • [Journal Article] Deterministic natural language generation from meaning representations for machine translation2016

    • Author(s)
      Alastair Butler
    • Journal Title

      Proceedings of the 2nd Workshop on Semantics-Driven Machine Translation

      Volume: 2 Pages: 1-9

    • Related Report
      2016 Annual Research Report
  • [Journal Article] 統語・意味解析情報付き日本語コーパスのアノテー ション2016

    • Author(s)
      アラステア・バトラー・吉本啓・岸本秀樹・プラシャント・パルデシ
    • Journal Title

      言語処理学会第22回年次大会発表論文集

      Volume: 22 Pages: 589-592

    • Related Report
      2015 Annual Research Report
    • Open Access
  • [Journal Article] 中国語連体修飾節構文の解析2016

    • Author(s)
      周振・Alastair Butler・吉本啓
    • Journal Title

      言語処理学会第22回年次大会発表論文集

      Volume: 22 Pages: 809-812

    • Related Report
      2015 Annual Research Report
  • [Journal Article] 中国人日本語学習者のVN型二字漢語動詞の習 得に関する研究: VN型二字漢語動詞の一体性の視点から2015

    • Author(s)
      周振・吉本啓
    • Journal Title

      国際文化研究

      Volume: 21 Pages: 99-112

    • NAID

      120005626290

    • Related Report
      2015 Annual Research Report
  • [Journal Article] 統語・意味解析情報付き日本語 コーパスの開発2015

    • Author(s)
      プラシャント・パルデシ・Alastair Butler・吉本啓 ・岸本秀樹
    • Journal Title

      言語処理学会第21回年次大会発表論文集

      Volume: 21 Pages: 20-23

    • Related Report
      2015 Annual Research Report
    • Open Access
  • [Journal Article] Large scale semantic represent ation with flame graphs2015

    • Author(s)
      Alastair Butler and Kei Yoshimoto
    • Journal Title

      言語処理学会第21回年次大会発表論文集

      Volume: 21 Pages: 301-304

    • Related Report
      2015 Annual Research Report
    • Open Access / Acknowledgement Compliant
  • [Journal Article] Coindexed null elements for a Japanese parsed corpus2015

    • Author(s)
      Alastair Butler, Shota Hiayama and Kei Yoshimoto
    • Journal Title

      言語処理学会第21回年次大会発表論文集

      Volume: 21 Pages: 708-711

    • Related Report
      2015 Annual Research Report
    • Open Access
  • [Journal Article] 中国語意味解析コーパス構築のための句レベルのスコープアノテーション -文の構成要素の間のコントロール関係の同定および否定の作用域の制御を中心に-2015

    • Author(s)
      周振・Alastair Butler・吉本啓
    • Journal Title

      言語処理学会第21回年次大会発表論文集

      Volume: 21 Pages: 856-859

    • Related Report
      2015 Annual Research Report
    • Open Access
  • [Journal Article] 中国語結果構文の解析2015

    • Author(s)
      周振・Alastair Butler・吉本啓
    • Journal Title

      言語科学会第17回年次国際大会, ハンドブック

      Volume: 17 Pages: 56-59

    • Related Report
      2015 Annual Research Report
  • [Presentation] 高度文法情報付きコーパスとその日本語研究への応用2019

    • Author(s)
      プラシャント・パルデシ,吉本啓,窪田悠介,峯島宏次,三好伸芳,井戸美里,大久保弥
    • Organizer
      関西言語学会第44回大会シンポジウム「高度文法情報付きコーパスとその日本語研究への応用」
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] 前提投射の統語コーパスでの検索2019

    • Author(s)
      窪田悠介,峯島宏次
    • Organizer
      関西言語学会第44回大会シンポジウム「高度文法情報付きコーパスとその日本語研究への応用」
    • Related Report
      2019 Annual Research Report
  • [Presentation] Development of a parsed corpus and its applications to linguistic research and education2019

    • Author(s)
      Prashant Pardeshi, Kei Yoshimoto, Susanne Miyata, Koichi Takeuchi, and Hideki Kishimoto
    • Organizer
      言語科学会第21回国際年次大会 (JSLS 2019)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Developing a Japanese syntax textbook as part of NPCMJ Project2019

    • Author(s)
      Hideki Kishimoto
    • Organizer
      言語科学会第21回国際年次大会 (JSLS 2019)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Treebanks and grammatical research: A case study on the Coordinate Structure Constraint in Japanese2019

    • Author(s)
      Yusuke Kubota
    • Organizer
      2019 Joint Conference of Linguistic Societies in Korea and the 26th Joint Workshop on Linguistics and Language Processing (JWLLP-26)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Parsed annotation with semantic calculation.2018

    • Author(s)
      Alastair Butler, Stephen Wright Horn.
    • Organizer
      17th International Workshop on Treebanks and Linguistic Theory, University of Oslo, Norway.
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] A unified interface for exploring English and Japanese.2018

    • Author(s)
      Alastair Butler.
    • Organizer
      日本英語学会第36回大会シンポジウム「ツリーバンク開発と言語理論」, 横浜国立大学
    • Related Report
      2018 Annual Research Report
  • [Presentation] English/Japanese contrastive study based on normalisation, a step in the semantic processing.2018

    • Author(s)
      Stephen Wright Horn, Alastair Butler.
    • Organizer
      日本英語学会第36回大会シンポジウム「ツリーバンク開発と言語理論」, 横浜国立大学.
    • Related Report
      2018 Annual Research Report
  • [Presentation] 言語研究と統語・意味解析情報付きコーパス2018

    • Author(s)
      吉本啓
    • Organizer
      日本英語学会第36回大会シンポジウム「ツリーバンク開発と言語理論」, 横浜国立大学.
    • Related Report
      2018 Annual Research Report
  • [Presentation] 構文検索ツールNPCMJ Explorer」2018

    • Author(s)
      鈴木彩香, 窪田悠介, プラシャント・パルデシ.
    • Organizer
      日本英語学会第36回大会シンポジウム『ツリーバンク開発と言語理論』, 横浜国立大学
    • Related Report
      2018 Annual Research Report
  • [Presentation] reebank meets descriptive grammar: The NPCMJ Explorer.2018

    • Author(s)
      Ayaka Suzuki, Yusuke Kubota, Prashant Pardeshi
    • Organizer
      Grammar and Corpora 2018, University of Paris-Diderot
    • Related Report
      2018 Annual Research Report
  • [Presentation] 中国語存現文の解析2018

    • Author(s)
      周振, アラステア・バトラー, 吉本啓
    • Organizer
      言語科学会第20回国際年次大会, 文京学院大学
    • Related Report
      2018 Annual Research Report
  • [Presentation] 統語コーパスと言語研究 ー構文検索ツールNPCMJ Explorerからの視点ー2018

    • Author(s)
      鈴木彩香, 窪田悠介, プラシャント・パルデシ
    • Organizer
      Morphology and Lexicon Forum 2018, 筑波大学
    • Related Report
      2018 Annual Research Report
  • [Presentation] The lexical semantics of control: A view from Japanese.2018

    • Author(s)
      Yusuke Kubota
    • Organizer
      Workshop on the clause structure of Japanese and Korean (The 25th International Conference on Head-Driven Phrase Structure Grammar), University of Tokyo.
    • Related Report
      2018 Annual Research Report
  • [Presentation] Using treebanks for linguistic research.2018

    • Author(s)
      Yusuke Kubota
    • Organizer
      he English Linguistic Society of Japan 11th International Spring Forum, Hokkaido University.
    • Related Report
      2018 Annual Research Report
  • [Presentation] Reconsidering the Coordinate Structure Constraint once again: Corpus-based evidence2018

    • Author(s)
      Yusuke Kubota
    • Organizer
      Conceptual and Methodological Alternatives in Theoretical Linguistics
    • Related Report
      2017 Annual Research Report
  • [Presentation] 統語解析情報付きコーパス検索用インタフェースの開発2018

    • Author(s)
      アラステア・バトラー, 長崎郁, スティーブン・ライト・ホーン, プラシャント・パルデシ, 吉本啓
    • Organizer
      言語処理学会第24回年次大会
    • Related Report
      2017 Annual Research Report
  • [Presentation] Seeding lexical semantics: resources using parsed corpora2017

    • Author(s)
      Alastair Butler, Stephen Wright Horn, Iku Nagasaki
    • Organizer
      NINJAL International Symposium "Exploiting Parsed Corpora: Apllication in Research, Pedagogy and Processing"
    • Related Report
      2017 Annual Research Report
  • [Presentation] Parsed corpus as a source for testing generalizations in Japanese syntax.2017

    • Author(s)
      Hideki Kishimoto, Prashant Pardeshi
    • Organizer
      NINJAL International Symposium "Exploiting Parsed Corpora: Apllication in Research, Pedagogy and Processing"
    • Related Report
      2017 Annual Research Report
  • [Presentation] A case study on the Coordinate Structure Constraint in Japanese2017

    • Author(s)
      Yusuke Kubota, Ai Kubota
    • Organizer
      NINJAL International Symposium "Exploiting Parsed Corpora: Apllication in Research, Pedagogy and Processing"
    • Related Report
      2017 Annual Research Report
  • [Presentation] Exploiting coreferential information in NPCMJ for L2 reading of Japanese texts2017

    • Author(s)
      Kei Yoshimoto, Akiko Takahashi
    • Organizer
      NINJAL International Symposium "Exploiting Parsed Corpora: Apllication in Research, Pedagogy and Processing"
    • Related Report
      2017 Annual Research Report
  • [Presentation] Treebank Semantics parsed corpus series2017

    • Author(s)
      Alastair Butler, Stephen Wright Horn
    • Organizer
      NINJAL International Symposium "Exploiting Parsed Corpora: Apllication in Research, Pedagogy and Processing"
    • Related Report
      2017 Annual Research Report
  • [Presentation] 「ツリーバンク検索への「UNIX 的」アプローチ」2017

    • Author(s)
      窪田悠介
    • Organizer
      国語研究所言語資源活用ワークショップ
    • Related Report
      2017 Annual Research Report
  • [Presentation] 統語・意味解析情報付き日本語コーパスのアノテーション2016

    • Author(s)
      アラステア・バトラー、 吉本 啓、 岸本 秀樹、 プラシャント・パルデシ
    • Organizer
      言語処理学会 第22回年次大会
    • Place of Presentation
      東北大学
    • Year and Date
      2016-03-07
    • Related Report
      2015 Annual Research Report
  • [Presentation] Treebank annotation of FraCaS and JSeM2016

    • Author(s)
      Alastair Butler, Ai Kubota, Shota Hiyama and Kei Yoshimoto
    • Organizer
      Logic and Engineering of Natural Language Semantics (LENLS 13)
    • Related Report
      2016 Annual Research Report
  • [Presentation] From meaning representations to syntactic trees2016

    • Author(s)
      Alastair Butler
    • Organizer
      Logic and Engineering of Natural Language Semantics (LENLS 13)
    • Related Report
      2016 Annual Research Report
  • [Presentation] Deterministic natural language generation from meaning representations for machine translation2016

    • Author(s)
      Alastair Butler
    • Organizer
      2nd Workshop on Semantics-Driven Machine Translation
    • Related Report
      2016 Annual Research Report
  • [Presentation] ワークショップ「イントロダクション」統語・意味解析情報付き日本語コーパスの構築に向けて2016

    • Author(s)
      プラシャント・バルデシ
    • Organizer
      日本言語学会第153回大会
    • Related Report
      2016 Annual Research Report
  • [Presentation] ワークショップ「まとめと将来の展望」統語・意味解析情報付き日本語コーパスの構築に向けて2016

    • Author(s)
      プラシャント・バルデシ
    • Organizer
      日本言語学会第153回大会
    • Related Report
      2016 Annual Research Report
  • [Presentation] ワークショップ「アノテーション方式とコーパスの特色」統語・意味解析情報付き日本語コーパスの構築に向けて2016

    • Author(s)
      吉本啓
    • Organizer
      日本言語学会第153回大会
    • Related Report
      2016 Annual Research Report
  • [Presentation] ワークショップ「デモンストレーション」統語・意味解析情報付き日本語コーパスの構築に向けて2016

    • Author(s)
      アラステア・バトラー、窪田愛、窪田悠介
    • Organizer
      日本言語学会第153回大会
    • Related Report
      2016 Annual Research Report
  • [Presentation] Parsed Corpus Semantics2016

    • Author(s)
      Alastair Butler
    • Organizer
      New Landscapes in Theoretical Computational Linguistics
    • Related Report
      2016 Annual Research Report
  • [Presentation] A parsed corpus of Japanese enriched to reach levels of semantic analysis2016

    • Author(s)
      Alastair Butler, Shiro Akasegawa, Prashant Pardeshi and Kei Yoshimoto
    • Organizer
      Brandeis University, Boston, USA Colloquium
    • Related Report
      2016 Annual Research Report
  • [Presentation] 形式意味論と計算言語学の最近の動向2016

    • Author(s)
      窪田悠介
    • Organizer
      第13回東海意味論研究会
    • Related Report
      2016 Annual Research Report
  • [Presentation] 統語・意味解析情報を伴う日本語コーパスの開発とその日本語教育・学習への応用2016

    • Author(s)
      吉本啓
    • Organizer
      台湾日本語言文藝研究学会第15回定例学会
    • Related Report
      2016 Annual Research Report
  • [Presentation] 文の統語・意味解析情報をタグ付けした日本語構造体コーパスの開発2015

    • Author(s)
      吉本啓・プラシャント・パルデシ
    • Organizer
      関西言語学会ワークショップ
    • Place of Presentation
      神戸大学
    • Year and Date
      2015-06-13
    • Related Report
      2015 Annual Research Report
  • [Presentation] Development of Japanese Corpus Tagged with Syntactic and Semantic In formation2015

    • Author(s)
      Kei Yoshimoto and Alastair Butler
    • Organizer
      The 18th Joint Workshop on Linguistics and Language Processing. Korean Society for Language and Information. Kyung Hee University, Seoul
    • Place of Presentation
      韓国
    • Year and Date
      2015-05-22
    • Related Report
      2015 Annual Research Report
  • [Remarks] NPCMJ(NINJAL Parsed Corpus of Modern Japanese)

    • URL

      http://npcmj.ninjal.ac.jp/

    • Related Report
      2019 Annual Research Report 2018 Annual Research Report
  • [Remarks] NPCMJ Explorer

    • URL

      http://npcmj.ninjal.ac.jp/explorer/

    • Related Report
      2019 Annual Research Report 2018 Annual Research Report
  • [Funded Workshop] “Development of a parsed corpus and its applications to linguistic research and education”2019

    • Related Report
      2019 Annual Research Report
  • [Funded Workshop] Unshared Task on Theory and System analysis with FraCaS, MultiFraCaS and JSeM Test Suites2016

    • Related Report
      2016 Annual Research Report

URL: 

Published: 2015-04-16   Modified: 2022-11-04  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi