• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

A Sequence-to-sequence Model based Dissimilarity Measurement for Clustering Structural Data

Research Project

Project/Area Number 18K18068
Research Category

Grant-in-Aid for Early-Career Scientists

Allocation TypeMulti-year Fund
Review Section Basic Section 61010:Perceptual information processing-related
Research InstitutionTokyo University of Agriculture and Technology

Principal Investigator

NGUYENTUAN CUONG  東京農工大学, 工学(系)研究科(研究院), 特任助教 (10814246)

Project Period (FY) 2018-04-01 – 2021-03-31
Project Status Completed (Fiscal Year 2020)
Budget Amount *help
¥4,030,000 (Direct Cost: ¥3,100,000、Indirect Cost: ¥930,000)
Fiscal Year 2020: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000)
Fiscal Year 2019: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000)
Fiscal Year 2018: ¥2,730,000 (Direct Cost: ¥2,100,000、Indirect Cost: ¥630,000)
Keywordsclustering / online handwriting / offline handwriting / generative sequence / sequence to sequence / handwritten answers / mathematical expressions / handwriting recognition / handwriting / mathematical expression / weakly supervised / hierarchical features / CNN / dissimilarity / semi-supervised learning / sequential data / structural data
Outline of Final Research Achievements

We have finished applying the proposed generative sequence dissimilarity for clustering of handwritten mathematical answers. The method outperforms other global feature based clustering methods such as Deep Embedded Clustering and Siamese Networks. The method also superior to the hierarchical feature representations by Convolutional Neural Networks with Weakly Supervised learning. We have applied the method for clustering online handwritten mathematical expressions and show that the proposed metric is better than edit distance metric. We continue to apply the method for a large-scale database of offline handwritten mathematical answers collected from the preliminary examination.

Academic Significance and Societal Importance of the Research Achievements

大規模な手書き数式回答をクラスタリングできると,同じ回答がグループ化され,採点する手間を削減し,採点の効率と信頼性を向上する.本研究は,クラスタリングするため,構造認識とそれらの関係を学習することの重要性を強調している.

Report

(4 results)
  • 2020 Annual Research Report   Final Research Report ( PDF )
  • 2019 Research-status Report
  • 2018 Research-status Report
  • Research Products

    (24 results)

All 2021 2020 2019 2018

All Journal Article (7 results) (of which Int'l Joint Research: 4 results,  Peer Reviewed: 7 results,  Open Access: 3 results) Presentation (17 results) (of which Int'l Joint Research: 13 results)

  • [Journal Article] Clustering of Handwritten Mathematical Expressions for Computer-Assisted Marking2021

    • Author(s)
      Vu tran minh KHUONG,Khanh Minh PHAN, Huy Quang UNG, Cuong Tuan NGUYEN, Masaki NAKAGAWA
    • Journal Title

      IEICE Transactions on Information and Systems

      Volume: E104.D Issue: 2 Pages: 275-284

    • DOI

      10.1587/transinf.2020EDP7087

    • NAID

      130007979511

    • ISSN
      0916-8532, 1745-1361
    • Year and Date
      2021-02-01
    • Related Report
      2020 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Clustering online handwritten mathematical expressions2021

    • Author(s)
      Ung Huy Quang、Nguyen Cuong Tuan、Phan Khanh Minh、Khuong Vu Tran Minh、Nakagawa Masaki
    • Journal Title

      Pattern Recognition Letters

      Volume: 146 Pages: 267-275

    • DOI

      10.1016/j.patrec.2021.03.027

    • Related Report
      2020 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A CNN based Localization and Classification Features for Clustering Offline Handwritten Mathematical Expression2020

    • Author(s)
      Cuong Tuan Nguyen, Vu Tran Minh Khuong, Hung Tuan Nguyen, Masaki Nakagawa
    • Journal Title

      Pattern Recognition Letters

      Volume: Vol. 131 Pages: 113-120

    • DOI

      10.1016/j.patrec.2019.12.015

    • Related Report
      2020 Annual Research Report 2019 Research-status Report
    • Peer Reviewed / Open Access
  • [Journal Article] An attention-based row-column encoder-decoder model for text recognition in Japanese historical documents2020

    • Author(s)
      Nam Tuan Ly, Cuong Tuan Nguyen, Masaki Nakagawa
    • Journal Title

      Pattern Recognition Letters

      Volume: Vol. 136 Pages: 134-141

    • DOI

      10.1016/j.patrec.2020.05.026

    • Related Report
      2020 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] A unified method for augmented incremental recognition of online handwritten Japanese and English text2019

    • Author(s)
      Nguyen Cuong Tuan、Indurkhya Bipin、Nakagawa Masaki
    • Journal Title

      International Journal on Document Analysis and Recognition (IJDAR)

      Volume: 23 Issue: 1 Pages: 53-72

    • DOI

      10.1007/s10032-019-00343-y

    • Related Report
      2019 Research-status Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Robust and real-time stroke order evaluation using incremental stroke context for learners to write Kanji characters correctly2019

    • Author(s)
      Nguyen Cuong Tuan、Nguyen Hung Tuan、Mita Kazuhiro、Nakagawa Masaki
    • Journal Title

      Pattern Recognition Letters

      Volume: 121 Pages: 140-149

    • DOI

      10.1016/j.patrec.2018.07.025

    • Related Report
      2018 Research-status Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Text-independent writer identification using convolutional neural network2019

    • Author(s)
      Nguyen Hung Tuan、Nguyen Cuong Tuan、Ino Takeya、Indurkhya Bipin、Nakagawa Masaki
    • Journal Title

      Pattern Recognition Letters

      Volume: 121 Pages: 104-112

    • DOI

      10.1016/j.patrec.2018.07.022

    • Related Report
      2018 Research-status Report
    • Peer Reviewed / Int'l Joint Research
  • [Presentation] GSSF: A Generative Sequence Similarity Function based on a Seq2Seq model for clustering online handwritten mathematical answers2021

    • Author(s)
      Huy Quang Ung, Cuong Tuan Nguyen, Hung Tuan Nguyen and Masaki Nakagawa
    • Organizer
      Proceedings of the International Conference on Document Analysis and Recognition, ICDAR2021
    • Related Report
      2020 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Global Context for improving recognition of Online Handwritten Mathematical Expressions2021

    • Author(s)
      Cuong Tuan Nguyen, Thanh-Nghia Truong, Hung Tuan Nguyen and Masaki Nakagawa
    • Organizer
      Proceedings of the International Conference on Document Analysis and Recognition, ICDAR2021
    • Related Report
      2020 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Online trajectory recovery from offline handwritten Japanese kanji characters of multiple strokes2020

    • Author(s)
      Hung Tuan Nguyen, Tsubasa Nakamura, Cuong Tuan Nguyen, Masaki Nakagawa
    • Organizer
      Proceedings of International Conference on Pattern Recognition, ICPR2020
    • Related Report
      2020 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Online Handwritten Mathematical Symbol Segmentation and Recognition by Bidirectional Context2020

    • Author(s)
      Cuong Tuan Nguyen, Thanh Nghia Truong, Huy Quang Ung, Masaki Nakagawa
    • Organizer
      Proceedings of International Conference on Frontiers in Handwriting Recognition, ICFHR2020
    • Related Report
      2020 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Attention Augmented Convolutional Recurrent Network for Handwritten Japanese Text Recognition2020

    • Author(s)
      Nam Tuan Ly, Cuong Tuan Nguyen, Masaki Nakagawa
    • Organizer
      Proceedings of International Conference on Frontiers in Handwriting Recognition, ICFHR2020
    • Related Report
      2020 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Improvement of end-to-end offline handwritten mathematical expression recognition by weakly supervised learning2020

    • Author(s)
      Thanh Nghia Truong, Cuong Tuan Nguyen, Khanh Minh Phan, Masaki Nakagawa
    • Organizer
      Proceedings of International Conference on Frontiers in Handwriting Recognition, ICFHR2020
    • Related Report
      2020 Annual Research Report
    • Int'l Joint Research
  • [Presentation] A Siamese Network based approach for matching various sizes of excavated wooden fragments2020

    • Author(s)
      Trung Tan Ngo, Cuong Tuan Nguyen, Masaki Nakagawa
    • Organizer
      Proceedings of International Conference on Frontiers in Handwriting Recognition, ICFHR2020
    • Related Report
      2020 Annual Research Report
    • Int'l Joint Research
  • [Presentation] A Semantic Segmentation-based Method for Handwritten Japanese Text Recognition2020

    • Author(s)
      Kha Cong Nguyen, Cuong Tuan Nguyen, Masaki Nakagawa
    • Organizer
      Proceedings of International Conference on Frontiers in Handwriting Recognition, ICFHR2020
    • Related Report
      2020 Annual Research Report
    • Int'l Joint Research
  • [Presentation] CNN and 2D BLSTM for Local Feature Extraction in Handwritten Mathematical Expression Recognition2020

    • Author(s)
      Kei Morizumi, Cuong Tuan Nguyen, Ikuko Shimizu, Masaki Nakagawa
    • Organizer
      IEICE Technical Report, PRMU2020-56
    • Related Report
      2020 Annual Research Report
  • [Presentation] Improvement of a Computer Automated Marking System for Online Handwritten Math Answers employing Machine Recognition2019

    • Author(s)
      Xiuyu Liang, Shinsuke Sasaki, Cuong Tuan Nguyen, Masaki Nakagawa
    • Organizer
      IEICE Technical Report, PRMU2018-135
    • Related Report
      2018 Research-status Report
  • [Presentation] 日本古典籍くずし字文書の文字列認識2019

    • Author(s)
      佐藤旭,小林心,Nam Tuan Ly, Cuong Tuan Nguyen, 北本朝展,中川正樹
    • Organizer
      情報処理学会技術報告, Vol. 2019-CH-119, No. 4, pp. 1-4
    • Related Report
      2018 Research-status Report
  • [Presentation] Text Segmentation for Japanese Historical Documents using Fully Convolutional Neural Network2019

    • Author(s)
      Hung Tuan Nguyen, Cuong Tuan Nguyen, Masaki Nakagawa, Asanobu Kitamoto
    • Organizer
      情報処理学会技術報告, Vol. 2019-CH-119, No. 4, pp. 5-9
    • Related Report
      2018 Research-status Report
  • [Presentation] Bag-of-features for clustering online handwritten mathematical expressions2018

    • Author(s)
      Huy Quang Ung, Vu Tran Minh Khuong, Anh Duc Le, Cuong Tuan Nguyen, Masaki Nakagawa
    • Organizer
      International Conference on Pattern Recognition and Artificial Intelligent
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Clustering Offline Handwritten Mathematical Answers for Computer-Assisted Marking2018

    • Author(s)
      Vu Tran Minh Khuong, Huy Quang Ung, Cuong Tuan Nguyen, Masaki Nakagawa
    • Organizer
      International Conference on Pattern Recognition and Artificial Intelligent
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Online Japanese Handwriting recognizers using Recurrent Neural Networks2018

    • Author(s)
      Hung Tuan Nguyen, Cuong Tuan Nguyen, Masaki Nakagawa
    • Organizer
      International Conference on Frontiers of Handwritting Recognition
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Training and End-to-End Model for Offline Handwritten Japanese Text Recognition by Generated Synthetic Patterns2018

    • Author(s)
      Nam Tuan Ly, Cuong Tuan Nguyen, Masaki Nakagawa
    • Organizer
      International Conference on Frontiers of Handwritting Recognition
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] ICFHR2018-Competition on Vietnamese Online Handwritten Text Recognition using HANDS-VNOnDB2018

    • Author(s)
      Hung Tuan Nguyen, Cuong Tuan Nguyen, Masaki Nakagawa
    • Organizer
      International Conference on Frontiers of Handwritting Recognition
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research

URL: 

Published: 2018-04-23   Modified: 2022-01-27  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi