2017 Fiscal Year Annual Research Report

A Study on Social Context Summarization

Research Project

Project/Area Number	15K16048
Research Institution	Japan Advanced Institute of Science and Technology
Principal Investigator	NGUYEN MinhLe 北陸先端科学技術大学院大学, 先端科学技術研究科, 准教授 (30509401)
Project Period (FY)	2015-04-01 – 2018-03-31
Keywords	Sentence extraction / Social context / Deep Learning / Sentence compression / LSTM / Co-factorization
Outline of Annual Research Achievements	We successfully showed that the support of social context (user-generated content such as comments or tweets and third-party sources can be helpful for extracting high-quality summarizes. The models perform on the three data sets showed promising results in terms of ROUGUE-scores. We also propose an Integer Linear Programming method which utilizing the constraints formulating from social context information. The results showed that our model can improve ROUGE-score compared to the state of the art models on social context summarization. On the other hand, we perform an unsupervised method using matrix co-factorization approach for social context summarization. The model captures the mutual information between sentences and comments by assuming they share hidden topics which achieves promising performance. We work on sentence compression using deep learning which combined model of enhanced Bidirectional Long Short Term Memory (Bi-LSTM) and well-known classifiers such as CRF and SVM for compressing sentence. Our models are trained and evaluated on public English and Vietnamese data sets, showing their state-of-the-art performance. In addition to the model, we proposed a deep learning model for working on with tree structured and graph structure. The models can work effectively when dealing with the problem of source code analyzing. The models can be applied for the problem of natural language processing including social context summarization.
Remarks	The system for sentence compression using deep learning.

Research Products
(8 results)

All 2018 2017 Other

All Journal Article (5 results) (of which Int'l Joint Research: 3 results, Peer Reviewed: 4 results) Presentation (2 results) (of which Int'l Joint Research: 1 results, Invited: 1 results) Remarks (1 results)

[Journal Article] Automatically classifying source code using tree-based approaches2018
- Author(s)
  Anh Viet Phan, Phuong Ngoc Chau, Minh Le Nguyen, Lam Thu Bui
- Journal Title
  
  Data & Knowledge Engineering
  
  Volume: 114 Pages: 12-25
- DOI
  http://dx.doi.org/10.1016/j.datak.2017.07.003
- Peer Reviewed / Int'l Joint Research
[Journal Article] Social context summarization using user-generated content and third-party sources2018
- Author(s)
  Minh-Tien Nguyen, Duc-Vu Tran , Le-Minh Nguyen
- Journal Title
  
  Knowledge-Base d Systems
  
  Volume: 144 Pages: 51-64
- DOI
  https://doi.org/10.1016/j.knosys.2017.12.023
- Peer Reviewed / Int'l Joint Research
[Journal Article] Multilingual opinion mining on YouTube ? A convolutional N-gram BiLSTM word embedding2018
- Author(s)
  Nguyen Huy Tien、Le Nguyen Minh
- Journal Title
  
  Information Processing & Management
  
  Volume: 54 Pages: 451～462
- DOI
  https://doi.org/10.1016/j.ipm.2018.02.001
- Peer Reviewed / Int'l Joint Research
[Journal Article] Deletion-Based Sentence Compression Using Bi-enc-dec LSTM2017
- Author(s)
  Dac-Viet Lai, Nguyen Truong Son, and Nguyen Le Minh
- Journal Title
  
  PACLING 2017
  
  Volume: CCIS 781 Pages: 249-260
- DOI
  https://doi.org/10.1007/978-981-10-8438-6_20
[Journal Article] Utilizing User Posts to Enrich Web Document Summarization with Matrix Co-factorization2017
- Author(s)
  Minh Tien Nguyen, Tranh Viet Cuong, Nguyen Xuan Hoai, Le Minh Nguyen
- Journal Title
  
  SoICT 2017 Proceedings of the Eighth International Symposium on Information and Communication Technology
  
  Volume: 1 Pages: 70-77
- DOI
  https://doi.org/10.1145/3155133.3155196
- Peer Reviewed
[Presentation] Graph-Based Deep Learning for NLP2018
- Author(s)
  Nguyen Minh Le
- Organizer
  Seminar on University of Information Engineering and Technology
- Invited
[Presentation] Deletion-Based Sentence Compression Using Bi-enc-dec LSTM2018
- Author(s)
  Lai Dac Viet
- Organizer
  PACLING 2017
- Int'l Joint Research
[Remarks]
- URL
  https://s242-097.jaist.ac.jp/sum/en/

2017 Fiscal Year Annual Research Report

A Study on Social Context Summarization

Principal Investigator

NGUYEN MinhLe 北陸先端科学技術大学院大学, 先端科学技術研究科, 准教授 (30509401)

Research Products

[Journal Article] Automatically classifying source code using tree-based approaches2018

Author(s)

Journal Title

DOI

[Journal Article] Social context summarization using user-generated content and third-party sources2018

Author(s)

Journal Title

DOI

[Journal Article] Multilingual opinion mining on YouTube ? A convolutional N-gram BiLSTM word embedding2018

Author(s)

Journal Title

DOI

[Journal Article] Deletion-Based Sentence Compression Using Bi-enc-dec LSTM2017

Author(s)

Journal Title

DOI

[Journal Article] Utilizing User Posts to Enrich Web Document Summarization with Matrix Co-factorization2017

Author(s)

Journal Title

DOI

[Presentation] Graph-Based Deep Learning for NLP2018

Author(s)

Organizer

[Presentation] Deletion-Based Sentence Compression Using Bi-enc-dec LSTM2018

Author(s)

Organizer

[Remarks]

URL