• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Research for unsupervised acoustic pattern discovery with zero resources

Research Project

Project/Area Number 17K00237
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeMulti-year Fund
Section一般
Research Field Perceptual information processing
Research InstitutionNara Institute of Science and Technology

Principal Investigator

Sakti Sakriani  奈良先端科学技術大学院大学, 先端科学技術研究科, 特任准教授 (00395005)

Co-Investigator(Kenkyū-buntansha) 中村 哲  奈良先端科学技術大学院大学, データ駆動型サイエンス創造センター, 教授 (30263429)
Project Period (FY) 2017-04-01 – 2020-03-31
Project Status Completed (Fiscal Year 2019)
Budget Amount *help
¥4,550,000 (Direct Cost: ¥3,500,000、Indirect Cost: ¥1,050,000)
Fiscal Year 2019: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000)
Fiscal Year 2018: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Fiscal Year 2017: ¥1,950,000 (Direct Cost: ¥1,500,000、Indirect Cost: ¥450,000)
Keywords音声認識 / ゼロ資源音声技術 / 脳波 / 音声翻訳 / 音声情報処理
Outline of Final Research Achievements

With the Tokyo Olympics and Paralympics approaching, language barriers between tourists are becoming critical problems to overcome. Current speech recognition and speech translation have been readily available, but only for several languages where large resources are available. Here, we addressed zero-resource speech problem where language specific knowledge and collection of transcribed data are not available. In order to understand the unknown language, we analyzed and investigated the process by which the human brain processes language. In addition, we have developed a closed-loop speech chain model based on deep learning so that we can learn how to listen while the machine is speaking. This is the first deep learning model that integrates human speech recognition and production behavior.

Academic Significance and Societal Importance of the Research Achievements

アフリカ言語(ツォンガ語)とインドネシア言語のゼロリソースモデリングの構築に成功した。また、2017年と2019年の世界ゼロ資源スピーチチャレンジに参加し、提案手法で上位結果を得ることができた。さらに、深層学習に基づく閉ループスピーチチェインモデルを開発して、機械が話している間、聞く方法を学習できるようにした。2019年では世界言語言語コンソーシアムのためにユネスコとも協力した。この研究の結果は、トップ会議(ASRU、Interspeech、ICASSP)とトップジャーナル(IEEE / ACM TASLP)で公開された。さらに、スピーチチェインモデルの特許も取得した。

Report

(4 results)
  • 2019 Annual Research Report   Final Research Report ( PDF )
  • 2018 Research-status Report
  • 2017 Research-status Report
  • Research Products

    (77 results)

All 2020 2019 2018 2017 Other

All Int'l Joint Research (2 results) Journal Article (22 results) (of which Int'l Joint Research: 20 results,  Peer Reviewed: 22 results,  Open Access: 12 results) Presentation (51 results) (of which Int'l Joint Research: 38 results) Patent(Industrial Property Rights) (2 results)

  • [Int'l Joint Research] University of Indonesia/Institute Technology Bandung(インドネシア)

    • Related Report
      2018 Research-status Report
  • [Int'l Joint Research] University of Indonesia/Institute Technology Bandung(Indonesia)

    • Related Report
      2017 Research-status Report
  • [Journal Article] Leveraging Neural Caption Translation with Visually Grounded Paraphrase Augmentation2020

    • Author(s)
      Johanes Effendi, Katsuhito Sudoh, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      IEICE Transactions on Information and Systems

      Volume: E103.D Issue: 3 Pages: 674-683

    • DOI

      10.1587/transinf.2019EDP7065

    • NAID

      130007804146

    • ISSN
      0916-8532, 1745-1361
    • Year and Date
      2020-03-01
    • Related Report
      2019 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Recurrent Neural Network Compression Based on Low-Rank Tensor Representation2020

    • Author(s)
      Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      IEICE Transactions on Information and Systems

      Volume: E103.D Issue: 2 Pages: 435-449

    • DOI

      10.1587/transinf.2019EDP7040

    • NAID

      130007793590

    • ISSN
      0916-8532, 1745-1361
    • Year and Date
      2020-02-01
    • Related Report
      2019 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Machine Speech Chain2020

    • Author(s)
      Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      IEEE/ACM Transactions on Audio, Speech, and Language Processing

      Volume: - Pages: 976-989

    • DOI

      10.1109/taslp.2020.2977776

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Neural Oscillation-Based Classification of Japanese Spoken Sentences During Speech Perception2019

    • Author(s)
      Hiroki Watanabe, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      IEICE Transactions on Information and Systems

      Volume: E102.D Issue: 2 Pages: 383-391

    • DOI

      10.1587/transinf.2018EDP7293

    • NAID

      130007586194

    • ISSN
      0916-8532, 1745-1361
    • Year and Date
      2019-02-01
    • Related Report
      2019 Annual Research Report 2018 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] End-to-End Speech Recognition Sequence Training with Reinforcement Learning2019

    • Author(s)
      Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      IEEE Access

      Volume: Volume: 7 Pages: 79758-79769

    • DOI

      10.1109/access.2019.2922617

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Positive Emotion Elicitation in Chat-Based Dialogue Systems2019

    • Author(s)
      Nurul Lubis, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura
    • Journal Title

      IEEE/ACM Transactions on Audio, Speech and Language Processing

      Volume: Volume: 27, Issue: 4 Issue: 4 Pages: 866-877

    • DOI

      10.1109/taslp.2019.2900910

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Synchronization between overt speech envelope and EEG oscillations during imagined speech2019

    • Author(s)
      Hiroki Watanabe, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      Neuroscience Research

      Volume: Volume 153 Pages: 48-55

    • DOI

      10.1016/j.neures.2019.04.004

    • NAID

      120006847155

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Electroencephalogram-Based Single Trial Detection of Language Expectation Violations in Listening to Speech2019

    • Author(s)
      Hiroki Tanaka, Hiroki Watanabe, Hayato Maki, Sakti Sakriani, Satoshi Nakamura
    • Journal Title

      Frontiers in Computational Neuroscience

      Volume: 13 Pages: 1-11

    • DOI

      10.3389/fncom.2019.00015

    • Related Report
      2018 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Construction of Spontaneous Emotion Corpus from Indonesian TV Talk Shows and Its Application on Multimodal Emotion Recognition2018

    • Author(s)
      Nurul Lubis, Dessi Lestari, Sakriani Sakti, Ayu Purwarianti, and Satoshi Nakamura
    • Journal Title

      IEICE Transactions on Information and Systems

      Volume: E101.D Issue: 8 Pages: 2092-2100

    • DOI

      10.1587/transinf.2017EDP7362

    • NAID

      130007429560

    • ISSN
      0916-8532, 1745-1361
    • Year and Date
      2018-08-01
    • Related Report
      2018 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Quality Prediction of Synthesized Speech Based on Tensor Structured EEG Signals2018

    • Author(s)
      Hayato Maki, Sakriani Sakti, Hiroki Tanaka, Satoshi Nakamura
    • Journal Title

      PloS One

      Volume: 13 Issue: 6 Pages: 1-13

    • DOI

      10.1371/journal.pone.0193521

    • Related Report
      2018 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Sequence-to-Sequence Models for Emphasis Speech Translation2018

    • Author(s)
      Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      IEEE/ACM Transactions on Audio, Speech, and Language Processing

      Volume: 26 Issue: 10 Pages: 1873-1883

    • DOI

      10.1109/taslp.2018.2846402

    • Related Report
      2018 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Dirichlet Process Mixture of Mixtures Model for Unsupervised Subword Modeling2018

    • Author(s)
      Michael Heck, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      IEEE/ACM Transactions on Audio, Speech, and Language Processing

      Volume: 26 Issue: 11 Pages: 2027-2042

    • DOI

      10.1109/taslp.2018.2852500

    • Related Report
      2018 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Learning Supervised Feature Transformations on Zero Resources for Improved Acoustic Unit Discovery2018

    • Author(s)
      Michael Heck, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      IEICE Transactions on Information and Systems

      Volume: E101.D Issue: 1 Pages: 205-214

    • DOI

      10.1587/transinf.2017EDP7175

    • NAID

      130006301188

    • ISSN
      0916-8532, 1745-1361
    • Related Report
      2017 Research-status Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Graph Regularized Tensor Factorization for Single-trial EEG Analysis2018

    • Author(s)
      Hayato Maki, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      Proceeding of International Conference on Acoustic, Speech, and Signal Processing (ICASSP)

      Volume: Vol. 1

    • Related Report
      2017 Research-status Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Quality Prediction of Synthesized Speech Based on Tensor Structured EEG Signals2018

    • Author(s)
      Hayato Maki, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      Transaction of PLOS One

      Volume: Vol. 1

    • Related Report
      2017 Research-status Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Subject-independent Classification of Japanese Spoken Sentences by Multiple Frequency Bands Phase Pattern of EEG Response during Speech Perception2017

    • Author(s)
      Hiroki Watanabe, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      Proceeding of INTERSPEECH 2017

      Volume: Vol.1 Pages: 2431-2435

    • DOI

      10.21437/interspeech.2017-854

    • Related Report
      2017 Research-status Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Speech Recognition Features Based On Deep Latent Gaussian Models2017

    • Author(s)
      Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      Proceeding of IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2017)

      Volume: Vol.1 Pages: 1-6

    • DOI

      10.1109/mlsp.2017.8168174

    • Related Report
      2017 Research-status Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Local Monotonic Attention Mechanism for End-to-End Speech and Language Processing2017

    • Author(s)
      Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      Proceedings of the The 8th International Joint Conference on Natural Language Processing

      Volume: Vol. 1 Pages: 431-440

    • Related Report
      2017 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] End-to-End Speech Recognition with Local Monotonic Attention2017

    • Author(s)
      Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      Proceedings of NIPS Workshop on Machine Learning for Audio Signal Processing (ML4Audio)

      Volume: なし

    • Related Report
      2017 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Listening while Speaking: Speech Chain by Deep Learning2017

    • Author(s)
      Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      Proceedings of IEEE Automatic Speech Recognition and Understanding (ASRU)

      Volume: Vol. 1 Pages: 301-308

    • DOI

      10.1109/asru.2017.8268950

    • Related Report
      2017 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Attention-based Wav2Text with Feature Transfer Learning2017

    • Author(s)
      Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      Proceedings of IEEE Automatic Speech Recognition and Understanding (ASRU)

      Volume: Vol. 1 Pages: 309-315

    • DOI

      10.1109/asru.2017.8268951

    • Related Report
      2017 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Feature Optimized DPGMM Clustering for Unsupervised Subword Modeling: A Contribution to ZeroSpeech 20172017

    • Author(s)
      Michael Heck, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      Proceedings of IEEE Automatic Speech Recognition and Understanding (ASRU)

      Volume: Vol. 1 Pages: 740-746

    • DOI

      10.1109/asru.2017.8269011

    • Related Report
      2017 Research-status Report
    • Peer Reviewed / Int'l Joint Research
  • [Presentation] Neural Incremental Speech Recognition Through Attention Transfer2020

    • Author(s)
      Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      ANLP
    • Related Report
      2019 Annual Research Report
  • [Presentation] From Speech Chain to Multimodal Chain: Leveraging Cross-modal Data Augmentation for Semi-supervised Learning2020

    • Author(s)
      Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      ANLP
    • Related Report
      2019 Annual Research Report
  • [Presentation] Speech-to-Speech Translation without Text2020

    • Author(s)
      Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      ANLP
    • Related Report
      2019 Annual Research Report
  • [Presentation] Neural Machine Translation with Acoustic Embedding2019

    • Author(s)
      Takatomo Kano, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Zero-shot Code-switching ASR and TTS with Multilingual Machine Speech Chain2019

    • Author(s)
      Sahoko Nakayama, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Listening while Speaking: Improving ASR through Multimodal Chain2019

    • Author(s)
      Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Speech-to-speech Translation between Untranscribed Unknown Languages2019

    • Author(s)
      Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Dialogue Model and Response Generation for Emotion Improvement Elicitation2019

    • Author(s)
      Nurul Lubis, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura
    • Organizer
      the 3rd Conversational AI workshop - NeurIPS 2019
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Recognition and Translation of Code-switching Speech Utterances2019

    • Author(s)
      Sahoko Nakayama, Takatomo Kano, Andros Tjandra, Sakriani Sakti, and Satoshi Nakamura
    • Organizer
      Oriental COCOSDA 2019
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Phoneme Level Speaking Rate Variation on Waveform Generation using GAN-TTS2019

    • Author(s)
      Mayuko Okamoto, Sakriani Sakti, and Satoshi Nakamura
    • Organizer
      Oriental COCOSDA 2019
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Sequence-to-sequence Learning via Attention Transfer for Incremental Speech Recognition2019

    • Author(s)
      Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      Interspeech 2019
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] VQVAE Unsupervised Unit Discovery and Multi-Scale Code2Spec Inverter for Zerospeech Challenge 20192019

    • Author(s)
      Andros Tjandra, Berrak Sisman, Mingyang Zhang, Sakriani Sakti, Haizou Li, Satoshi Nakamura
    • Organizer
      Interspeech 2019
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Neural iTTS: Toward Synthesizing Speech in Real-time with End-to-end Neural Text-to-Speech Framework2019

    • Author(s)
      Tomoya Yanagita, Sakriani Sakti and Satoshi Nakamura
    • Organizer
      SSW
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Speech Quality Evaluation of Synthesized Japanese Speech Using EEG2019

    • Author(s)
      Ivan Halim Parmonangan, Hiroki Tanaka, Sakriani Sakti, Shinnosuke Takamichi, Satoshi Nakamura
    • Organizer
      Interspeech 2019
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] EEG Analysis towards Evaluating Synthesized Speech Quality2019

    • Author(s)
      Ivan Halim Parmonangan, Hiroki Tanaka, Sakti Sakriani, Shinnosuke Takamichi, Satoshi Nakamura
    • Organizer
      IEEE Engineering in Medicine and Biology Society
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Cross-lingual speech-based ToBI label generation using bidirectional LSTM2019

    • Author(s)
      Marco Vetter, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] End-to-end feedback loss in speech chain framework via straight-through estimator2019

    • Author(s)
      Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Speech Artifact Removal from EEG Recordings of Spoken Word Production with Tensor Decomposition2019

    • Author(s)
      Holy Lovenia, Hiroki Tanaka, Sakriani Sakti, Ayu Purwarianti, Satoshi Nakamura
    • Organizer
      IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] カリキュラムラーニングを用いた音声翻訳の学習戦略の提案2019

    • Author(s)
      叶 高朋, Sakriani Sakti, 中村 哲
    • Organizer
      言語処理学会 第25回年次大会(NLP2019)
    • Related Report
      2018 Research-status Report
  • [Presentation] Machine Speech Chainに基づく半教師あり学習を用いた日英コードスイッチング音声の認識2019

    • Author(s)
      中山 佐保子, Andros Tjandra, Sakriani Sakti, 中村 哲
    • Organizer
      言語処理学会 第25回年次大会(NLP2019)
    • Related Report
      2018 Research-status Report
  • [Presentation] Affect-sensitive Dialogue Response Generation for Positive Emotion Elicitation2019

    • Author(s)
      Nurul Lubis, Sakriani Sakti, Koichiro Yoshino and Satoshi Nakamura
    • Organizer
      言語処理学会 第25回年次大会(NLP2019)
    • Related Report
      2018 Research-status Report
  • [Presentation] Enhancing Neural Machine Translation with Image-based Paraphrase Augmentation2019

    • Author(s)
      Johanes Effendi, Sakriani Sakti, Katsuhito Sudoh and Satoshi Nakamura
    • Organizer
      言語処理学会 第25回年次大会(NLP2019)
    • Related Report
      2018 Research-status Report
  • [Presentation] Speaker and Emotion Recognition of TV-Series Data Using Multimodal and Multitask Deep Learning2019

    • Author(s)
      Sashi Novitasari, Quoc Truong Do, Sakriani Sakti, Dessi Lestari and Satoshi Nakamura
    • Organizer
      言語処理学会 第25回年次大会(NLP2019)
    • Related Report
      2018 Research-status Report
  • [Presentation] Unifying Speech Recognition and Generation with Machine Speech Chain2019

    • Author(s)
      Andros Tjandra, Sakriani Sakti and Satoshi Nakamura
    • Organizer
      言語処理学会 第25回年次大会(NLP2019)
    • Related Report
      2018 Research-status Report
  • [Presentation] Sequence-to-Sequence ASR Optimization via Reinforcement Learning2018

    • Author(s)
      Andors Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Graph regularized tensor factorization for single-trial EEG analysis2018

    • Author(s)
      Hayato Maki, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Construction of English-French Multimodal Affective Conversational Corpus from Drama TV Series2018

    • Author(s)
      Sashi Novitasari, Quoc-Truong Do, Sakriani Sakti, Dessi Lestari, Satoshi Nakamura
    • Organizer
      LREC 2018
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Multi-modal Muti-task Deep Learning for Speaker and Emotion Recognition of TV-series Data2018

    • Author(s)
      Sashi Novitasari, Quoc-Truong Do, Sakriani Sakti, Dessi Lestari, Satoshi Nakamura
    • Organizer
      Oriental COCOSDA 2018
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Japanese-English Code-Switching Speech Data Construction2018

    • Author(s)
      Sahoko Nakayama, Takatomo Kano, Quoc-Truong Do, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      Oriental COCOSDA 2018
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Single-trial Detection of Semantic Anomalies from EEG during Listening to Spoken Sentences2018

    • Author(s)
      Hiroki Tanaka, Hiroki Watanabe, Hayato Maki, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2018)
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Compressing End-to-End ASR Networks by Tensor-Train Decomposition2018

    • Author(s)
      Takuma Mori, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      Interspeech 2018
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Optimizing DPGMM Clustering in Zero Resource Setting Based on Functional Load2018

    • Author(s)
      Bin Wu, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      SLTU 2018
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Incremental TTS for Japanese Language2018

    • Author(s)
      Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      Interspeech 2018
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Machine Speech Chain with One-shot Speaker Adaptation2018

    • Author(s)
      Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      Interspeech 2018
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Speech Chain for Semi-Supervised Learning of Japanese-English Code-Switching ASR and TTS2018

    • Author(s)
      Sahoko Nakayama, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      IEEE SLT
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Multi-scale Alignment and Contextual History for Attention Mechanism in Sequence-to-Sequence Model2018

    • Author(s)
      Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      IEEE SLT
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Toward Multi-features Emphases Speech Translation: Assessment of Human Emphases Production and Perception with Speech and Text Clues2018

    • Author(s)
      Quoc-Truong Do, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      IEEE SLT
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Using Spoken Word Posterior Features in Neural Machine Translation2018

    • Author(s)
      Kaho Osamura, Takatomo Kano, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      IWSLT 2018
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Multi-paraphrase Augmentation to Leverage Neural Caption Translation2018

    • Author(s)
      Johanes Effendi, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura
    • Organizer
      IWSLT 2018
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Machine Speech Chain with Deep Learning2018

    • Author(s)
      Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      日本音響学会2018年秋季研究発表会
    • Related Report
      2018 Research-status Report
  • [Presentation] Multimodal Database of Negative Emotion Recovery in Dyadic Interactions: Construction and Analysis2018

    • Author(s)
      Nurul Lubis, Michael Heck, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura
    • Organizer
      日本音響学会2018年秋季研究発表会
    • Related Report
      2018 Research-status Report
  • [Presentation] 日英コードスイッチング音声データの構築2018

    • Author(s)
      中山佐保子, ド クオック チュオン, サクティ サクリアニ, 中村哲
    • Organizer
      日本音響学会2018年秋季研究発表会
    • Related Report
      2018 Research-status Report
  • [Presentation] Visual Description Paraphrase Corpus Creation with Various Elementary Operations2018

    • Author(s)
      Johanes Effendi, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      日本音響学会2018年秋季研究発表会
    • Related Report
      2018 Research-status Report
  • [Presentation] Graph Regularized Tensor Factorization for Single-trial EEG Analysis2018

    • Author(s)
      Hayato Maki
    • Organizer
      International Conference on Acoustic, Speech, and Signal Processing (ICASSP)
    • Related Report
      2017 Research-status Report
    • Int'l Joint Research
  • [Presentation] Subject-independent Classification of Japanese Spoken Sentences by Multiple Frequency Bands Phase Pattern of EEG Response during Speech Perception2017

    • Author(s)
      Hiroki Watanabe
    • Organizer
      INTERSPEECH
    • Related Report
      2017 Research-status Report
    • Int'l Joint Research
  • [Presentation] Speech Recognition Features Based On Deep Latent Gaussian Models2017

    • Author(s)
      Andros Tjandra, Sakriani Sakti
    • Organizer
      IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2017)
    • Related Report
      2017 Research-status Report
    • Int'l Joint Research
  • [Presentation] Local Monotonic Attention Mechanism for End-to-End Speech and Language Processing2017

    • Author(s)
      Andros Tjandra
    • Organizer
      the International Joint Conference on Natural Language Processing (IJCNLP 2017)
    • Related Report
      2017 Research-status Report
    • Int'l Joint Research
  • [Presentation] End-to-End Speech Recognition with Local Monotonic Attention2017

    • Author(s)
      Andros Tjandra
    • Organizer
      NIPS Workshop on Machine Learning for Audio Signal Processing (ML4Audio)
    • Related Report
      2017 Research-status Report
    • Int'l Joint Research
  • [Presentation] Listening while Speaking: Speech Chain by Deep Learning2017

    • Author(s)
      Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      IEEE Automatic Speech Recognition and Understanding (ASRU)
    • Related Report
      2017 Research-status Report
    • Int'l Joint Research
  • [Presentation] Attention-based Wav2Text with Feature Transfer Learning2017

    • Author(s)
      Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      IEEE Automatic Speech Recognition and Understanding (ASRU)
    • Related Report
      2017 Research-status Report
    • Int'l Joint Research
  • [Presentation] Feature Optimized DPGMM Clustering for Unsupervised Subword Modeling: A Contribution to ZeroSpeech 20172017

    • Author(s)
      Michael Heck, Sakriani Sakti
    • Organizer
      IEEE Automatic Speech Recognition and Understanding (ASRU)
    • Related Report
      2017 Research-status Report
    • Int'l Joint Research
  • [Patent(Industrial Property Rights)] スピーチチェイン装置、コンピュータプログラムおよびDNN音声認識・合成相互学習方法2018

    • Inventor(s)
      アンドロス チャンドラ, サクリアニ サクティ, 中村 哲
    • Industrial Property Rights Holder
      アンドロス チャンドラ, サクリアニ サクティ, 中村 哲
    • Industrial Property Rights Type
      特許
    • Patent Publication Number
      2019-120841
    • Filing Date
      2018
    • Acquisition Date
      2019
    • Related Report
      2019 Annual Research Report
  • [Patent(Industrial Property Rights)] 国立大学法人 奈良先端科学技術大学院大学2017

    • Inventor(s)
      アンドロスチャンドラ, サクリアニサクティ,中村哲
    • Industrial Property Rights Holder
      アンドロスチャンドラ, サクリアニサクティ,中村哲
    • Industrial Property Rights Type
      特許
    • Industrial Property Number
      2018-001538
    • Filing Date
      2017
    • Acquisition Date
      2018
    • Related Report
      2017 Research-status Report

URL: 

Published: 2017-04-28   Modified: 2021-02-19  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi