2018 Fiscal Year Annual Research Report

Next generation speech translation research

Research Project

Project/Area Number	17H06101
Research Institution	Nara Institute of Science and Technology
Principal Investigator	中村哲奈良先端科学技術大学院大学, データ駆動型サイエンス創造センター, 教授 (30263429)
Co-Investigator(Kenkyū-buntansha)	河原達也京都大学, 情報学研究科, 教授 (00234104) 猿渡洋東京大学, 大学院情報理工学系研究科, 教授 (30324974) 戸田智基名古屋大学, 情報基盤センター, 教授 (90403328) 森島繁生早稲田大学, 理工学術院, 教授 (10200411) 高道慎之介東京大学, 大学院情報理工学系研究科, 助教 (90784330) 松本裕治奈良先端科学技術大学院大学, 先端科学技術研究科, 教授 (10211575) 須藤克仁奈良先端科学技術大学院大学, 先端科学技術研究科, 准教授 (00396152) サクリアニサクティ奈良先端科学技術大学院大学, 先端科学技術研究科, 特任准教授 (00395005) 吉野幸一郎奈良先端科学技術大学院大学, 先端科学技術研究科, 助教 (70760148) 田中宏季奈良先端科学技術大学院大学, 先端科学技術研究科, 助教 (10757834)
Project Period (FY)	2017-05-31 – 2022-03-31
Keywords	音声翻訳
Outline of Annual Research Achievements	①A)音源教師有り手法である独立深層学習行列分析を新たに提案しその有効性を確認。生成モデルにスパース性を付与することで有意に雑音抑圧性能を改善可能と提示。B)常時音声認識を様々なドメインに適応すべく単語単位のEnd-to-Endモデルの適応法を研究。注意機構モデルを前向きだけでなく後向きに適用し組合せる方式を提案。C)一定時間待機型の逐次ニューラル機械翻訳の分析、英日翻訳では語順の違いが大きく影響することを確認。リアルタイム機械通訳システム構築に向けHMMベースのインクリメンタル音声合成手法を提案。スピーチチェーンに音声認識を統合。コードスイッチング検討。D)多言語機械翻訳のデータ欠落を補う手法を提案し翻訳精度を向上。訳語選択の柔軟性を向上させるニューラル機械翻訳方式を提案し小語彙条件下で性能向上。E)対話翻訳用にDialog State Tracking Challenge 2を用いた日英データセットとニューラルネットワークを用いたベースライントラッカーを構築。 ②A)音声とテキストの強調による人の知覚を主観評価で分析。音声、テキスト、動画等マルチモーダルコーパスの収集。深層学習によるマルチモーダル・マルチタスク認識の検討。B)深層学習に基づく音声特徴量変換処理と音声波形生成処理を組合せ声質変換性能の大幅な改善に成功。 ③撮影環境、顔向きを限定されないポートレート1枚から被写体の人物の顔の3次元形状、アルベドテクスチャ、ディスプレースメントマップ、スペキュラーBRDFを深層学習で精度よく推定する技術を開発。 ④違和感検出モデルの改良。意味違反のみならず統語違反の自動検出。多層パーセプロトロンにより6割程度での性能を達成。同時通訳時の通訳者の脳活動測定を実施。 ⑤同時通訳コーパス約124時間分（TED等）＋約5時間分（日本記者クラブ）を構築。
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason いずれの研究項目においても、計画に遅れを取らないよう研究を進めている。特に2018年度は、同時通訳音声コーパスの構築に注力した。
Strategy for Future Research Activity	①スパース性付与型独立低ランク行列分析と独立深層学習行列分析に基づき、音声認識システムとの統合を進め、雑音下常時音声認識システムの構築を行う。ダイレクト音声翻訳、リアルタイム機械通訳システム、マルチモーダルチェーンの構築を継続する。英日間の機械翻訳に適した同時機械翻訳方式の検討を行い、構築中の同時通訳コーパスを用いた性能検証を実施する。同時通訳データのデータ拡張手法について検討する。損失関数の工夫によるニューラル機械翻訳の頑健性向上に取り組む。対話翻訳のベースラインシステムを構築し、既存の枠組みで解決可能な現象と難しい現象の切り分けを行う。対話翻訳データセットの他機関研究者向けリリースを行う。 ②マルチ特性翻訳および感情翻訳の研究を継続して行う。深層波形生成モデルを用いた声質変換システムの高精度化に取り組むとともに、音声翻訳システムへの導入を目指し、同一言語発話対を必要としない非パラレル声質変換処理への拡張に取り組む。 ③頭部のみならず、キャラクタの印象を強く左右する全身像の3次元復元にも今後取り組んでいく。 ④違和感検出の時系列を考慮したモデルの構築と、実刺激での検出を取り組む。また同時通訳の実験データを解析し、同時通訳中の認知負荷を定量化することを目指す。 ⑤ 同時通訳コーパス構築を継続する。他分野化に着目する。同時機械翻訳のプロトタイプシステムを作成する。

Research Products
(77 results)

All 2019 2018 Other

All Journal Article (10 results) (of which Int'l Joint Research: 3 results, Peer Reviewed: 9 results, Open Access: 10 results) Presentation (66 results) (of which Int'l Joint Research: 36 results, Invited: 7 results) Remarks (1 results)

[Journal Article] Neural Oscillation-Based Classification of Japanese Spoken Sentences During Speech Perception2019
- Author(s)
  Hiroki WATANABE, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura
- Journal Title
  
  IEICE Transactions on Information and Systems
  
  Volume: Volume E102.D, issue 2 Pages: 383-391
- DOI
  10.1587/transinf.2018EDP7293
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Electroencephalogram-Based Single Trial Detection of Language Expectation Violations in Listening to Speech2019
- Author(s)
  Hiroki Tanaka, Hiroki Watanabe, Hayato Maki, Sakti Sakriani, Satoshi Nakamura
- Journal Title
  
  Frontiers in Computational Neuroscience
  
  Volume: Vol. 13 Pages: －
- DOI
  10.3389/fncom.2019.00015
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Generalized independent low-rank matrix analysis using heavy-tailed distributions for blind source separation2018
- Author(s)
  Daichi Kitamura, Shinichi Mogami, Yoshiki Mitsui, Norihiro Takamune, Hiroshi Saruwatari, Nobutaka Ono, Yu Takahashi and Kazunobu Kondo
- Journal Title
  
  EURASIP Journal on Advances in Signal Processing
  
  Volume: 2018:28 Pages: -
- DOI
  10.1186/s13634-018-0549-5
- Peer Reviewed / Open Access
[Journal Article] Quality Prediction of Synthesized Speech Based on Tensor Structured EEG Signals2018
- Author(s)
  Hayato Maki, Sakriani Sakti, Hiroki Tanaka, Satoshi Nakamura
- Journal Title
  
  PloS One
  
  Volume: 13 Pages: 1-13
- DOI
  10.1371/journal.pone.0193521
- Peer Reviewed / Open Access
[Journal Article] Bayesian multichannel audio source separation based on integrated source and spatial models.2018
- Author(s)
  K.Itakura, Y.Bando, E.Nakamura, K.Itoyama, K.Yoshii, and T.Kawahara
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 26 Pages: 831--846
- DOI
  10.1109/TASLP.2017.2789320
- Peer Reviewed / Open Access
[Journal Article] Voice Animator: Automatic Lip-Synching in Limited Animation by Audio2018
- Author(s)
  Shoichi Furukawa, Tsukasa Fukusato, Shugo Yamaguchi, Shigeo Morishima
- Journal Title
  
  ?Lecture Notes in Computer Science?book series (LNCS, volume 10714)
  
  Volume: 10714 Pages: 153-171
- DOI
  10.1007/978-3-319-76270-8_12
- Peer Reviewed / Open Access
[Journal Article] Construction of Spontaneous Emotion Corpus from Indonesian TV Talk Shows and Its Application on Multimodal Emotion Recognition2018
- Author(s)
  Nurul Lubis, Dessi Lestari, Sakriani Sakti, Ayu Purwarianti, and Satoshi Nakamura
- Journal Title
  
  Transactions on Information and Systems, Institute of Electronics, Information and Communication Engineers (IEICE)
  
  Volume: E101-D Pages: 2092-2100
- DOI
  10.1587/transinf.2017EDP7362
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Sequence-to-Sequence Models for Emphasis Speech Translation2018
- Author(s)
  Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 26 Pages: 1873 - 1883
- DOI
  10.1109/TASLP.2018.2846402
- Peer Reviewed / Open Access
[Journal Article] Dirichlet Process Mixture of Mixtures Model for Unsupervised Subword Modeling2018
- Author(s)
  Michael Heck, Sakriani Sakti, Satoshi Nakamura
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 26 Pages: 2027 - 2042
- DOI
  10.1109/TASLP.2018.2852500
- Peer Reviewed / Open Access
[Journal Article] 音声翻訳システムにおける音声変換の利用2018
- Author(s)
  高道慎之介, 戸田智基
- Journal Title
  
  日本音響学会誌
  
  Volume: 74 Pages: 535--538
- DOI
  10.20697/jasj.74.9_535
- Open Access
[Presentation] カリキュラムラーニングを用いた音声翻訳の学習戦略の提案2019
- Author(s)
  叶高朋, Sakriani Sakti, 中村哲
- Organizer
  言語処理学会第25回年次大会（NLP2019)
[Presentation] Machine Speech Chainに基づく半教師あり学習を用いた日英コードスイッチング音声の認識2019
- Author(s)
  中山佐保子, Andros Tjandra, Sakriani Sakti, 中村哲
- Organizer
  言語処理学会第25回年次大会（NLP2019)
[Presentation] Affect-sensitive Dialogue Response Generation for Positive Emotion Elicitation2019
- Author(s)
  Nurul Lubis, Sakriani Sakti, Koichiro Yoshino and Satoshi Nakamura
- Organizer
  言語処理学会第25回年次大会（NLP2019)
[Presentation] Enhancing Neural Machine Translation with Image-based Paraphrase Augmentation2019
- Author(s)
  Johanes Effendi, Sakriani Sakti, Katsuhito Sudoh and Satoshi Nakamura
- Organizer
  言語処理学会第25回年次大会（NLP2019)
[Presentation] Speaker and Emotion Recognition of TV-Series Data Using Multimodal and Multitask Deep Learning2019
- Author(s)
  Sashi Novitasari, Quoc Truong Do, Sakriani Sakti, Dessi Lestari and Satoshi Nakamura
- Organizer
  言語処理学会第25回年次大会（NLP2019)
[Presentation] Unifying Speech Recognition and Generation with Machine Speech Chain2019
- Author(s)
  Andros Tjandra, Sakriani Sakti and Satoshi Nakamura
- Organizer
  言語処理学会第25回年次大会（NLP2019)
[Presentation] 英日同時通訳におけるニューラル機械翻訳の検討2019
- Author(s)
  帖佐克己, 須藤克仁, 中村哲
- Organizer
  言語処理学会第25回年次大会（NLP2019)
[Presentation] 単語分散表現を使った誤差によるニューラル機械翻訳の学習2019
- Author(s)
  帖佐克己, 須藤克仁, 中村哲
- Organizer
  言語処理学会第25回年次大会（NLP2019)
[Presentation] マルチソースニューラル機械翻訳における翻訳時の原言語欠落補完2019
- Author(s)
  西村優汰, 須藤克仁, Graham Neubig, 中村哲
- Organizer
  言語処理学会第25回年次大会（NLP2019)
[Presentation] End-to-end Learning of Segmented Robot Behaviors and Descriptions2019
- Author(s)
  Kohei Wakimoto, Koichiro Yoshino, Satoshi Nakamura
- Organizer
  SIG-SLUD
[Presentation] 音声認識を用いた字幕作成システムの改良.2019
- Author(s)
  秋田祐哉, 上乃聖, 三村正人, 河原達也.
- Organizer
  情報処理学会研究会SIG-AAC
[Presentation] 時変複素一般化ガウス分布に基づく独立深層学習行列分析2019
- Author(s)
  牧島直輝，高宗典玄，北村大地，猿渡洋，高橋祐，近藤多伸，中嶋広明
- Organizer
  日本音響学会2019年春季研究発表会
[Presentation] 教師あり及び半教師あり条件下における独立深層学習行列分析の実験的評価2019
- Author(s)
  牧島直輝, 最上伸一, 高宗典玄, 高道慎之介, 北村大地, 猿渡洋, 高橋祐, 近藤多伸, and 中嶋広明
- Organizer
  日本音響学会2019年春季研究発表会
[Presentation] 乗算型更新式に基づくランク制約付き空間共分散モデルの推定2019
- Author(s)
  久保優騎, 高宗典玄, 北村大地, 猿渡洋
- Organizer
  日本音響学会2019年春季研究発表会
[Presentation] ブラインド音源分離における多変量複素Student’s t 分布に基づくランク制約付き空間共分散モデルの推定2019
- Author(s)
  久保優騎, 高宗典玄, 北村大地, 猿渡洋
- Organizer
  2018年3月度応用音響研究会
[Presentation] Reducing mismatch of WaveNet vocoder for variational autoencoder based voice conversion2019
- Author(s)
  W.-C. Huang, Y.-C. Wu, H.-T. Hwang, P.L. Tobing, T. Hayashi, K. Kobayashi, T. Toda, Y. Tsao, H.-M. Wang
- Organizer
  日本音響学会2019年春季研究発表会
[Presentation] Voice conversion with cyclic recurrent neural network for WaveNet fine-tuning2019
- Author(s)
  P.L. Tobing, Y.-C. Wu, T. Hayashi, K. Kobayashi, T. Toda
- Organizer
  日本音響学会2019年春季研究発表会
[Presentation] Independent deeply learned matrix analysis for multichannel audio source separation2018
- Author(s)
  Shinichi Mogami, Hayato Sumino, Daichi Kitamura, Norihiro Takamune, Shinnosuke Takamichi and Hiroshi Saruwatari
- Organizer
  European Signal Processing Conference (EUSIPCO)
- Int'l Joint Research / Invited
[Presentation] Vectorwise Coordinate Descent Algorithm for Spatially Regularized Independent Low-Rank Matrix Analysis2018
- Author(s)
  Yoshiki Mitsui, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, and Kazunobu Kondo
- Organizer
  2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- Int'l Joint Research
[Presentation] Sequence-to-Sequence ASR Optimization via Reinforcement Learning2018
- Author(s)
  A. Tjandra, S. Sakti, S. Nakamura
- Organizer
  2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- Int'l Joint Research
[Presentation] Graph regularized tensor factorization for single-trial EEG analysis2018
- Author(s)
  Hayato Maki, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura
- Organizer
  2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- Int'l Joint Research
[Presentation] 半教師あり独立深層学習行列分析におけるデータ拡張に基づく音源モデル適応2018
- Author(s)
  牧島直輝，高宗典玄，高道慎之介，北村大地，猿渡洋，高橋祐，近藤多伸，中嶋広明
- Organizer
  日本音響学会2018年秋季研究発表会
[Presentation] ヘビーテイル生成モデルに基づく独立深層学習行列分析による多チャネル音源分離2018
- Author(s)
  牧島直輝, 最上伸一, 高宗典玄, 北村大地, 猿渡洋, 高橋祐, 近藤多伸, 中嶋広明
- Organizer
  信号処理シンポジウム
[Presentation] Construction of English-French Multimodal Affective Conversational Corpus from Drama TV Series2018
- Author(s)
  S. Novitasari, Q.-T. Do, S. Sakti, D. Lestari, S. Nakamura
- Organizer
  LREC 2018
- Int'l Joint Research
[Presentation] Multi-modal Muti-task Deep Learning for Speaker and Emotion Recognition of TV-series Data2018
- Author(s)
  S. Novitasari, Q.-T. Do, S. Sakti, D. Lestari, S. Nakamura
- Organizer
  Oriental COCOSDA 2018
- Int'l Joint Research
[Presentation] Japanese-English Code-Switching Speech Data Construction2018
- Author(s)
  S. Nakayama, T. Kano, Q.-T Do, S. Sakti, S. Nakamura
- Organizer
  Oriental COCOSDA 2018
- Int'l Joint Research
[Presentation] Single-trial Detection of Semantic Anomalies from EEG during Listening to Spoken Sentences2018
- Author(s)
  Hiroki Tanaka, Hiroki Watanabe, Hayato Maki, Sakriani Sakti, Satoshi Nakamura
- Organizer
  International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2018)
- Int'l Joint Research
[Presentation] Compressing End-to-End ASR Networks by Tensor-Train Decomposition2018
- Author(s)
  T. Mori, A. Tjandra, S. Sakti, S. Nakamura
- Organizer
  Interspeech 2018
- Int'l Joint Research
[Presentation] Optimizing DPGMM Clustering in Zero Resource Setting Based on Functional Load2018
- Author(s)
  B. Wu, S. Sakti, S. Nakamura
- Organizer
  SLTU 2018
- Int'l Joint Research
[Presentation] Incremental TTS for Japanese Language2018
- Author(s)
  T. Yanagita, S. Sakti, S. Nakamura
- Organizer
  Interspeech 2018
- Int'l Joint Research
[Presentation] Machine Speech Chain with One-shot Speaker Adaptation2018
- Author(s)
  A. Tjandra, S. Sakti, S. Nakamura
- Organizer
  Interspeech 2018
- Int'l Joint Research
[Presentation] Speech Chain for Semi-Supervised Learning of Japanese-English Code-Switching ASR and TTS2018
- Author(s)
  S. Nakayama, A. Tjandra, S. Sakti, S. Nakamura
- Organizer
  IEEE SLT
- Int'l Joint Research
[Presentation] Multi-scale Alignment and Contextual History for Attention Mechanism in Sequence-to-Sequence Model2018
- Author(s)
  A. Tjandra, S. Sakti, S. Nakamura
- Organizer
  IEEE SLT
- Int'l Joint Research
[Presentation] Toward Multi-features Emphases Speech Translation: Assessment of Human Emphases Production and Perception with Speech and Text Clues2018
- Author(s)
  Q.-T. Do, S. Sakti, S. Nakamura
- Organizer
  IEEE SLT
- Int'l Joint Research
[Presentation] Using Spoken Word Posterior Features in Neural Machine Translation2018
- Author(s)
  K. Osamura, T. Kano, S. Sakti, S. Nakamura
- Organizer
  IWSLT 2018
- Int'l Joint Research
[Presentation] Multi-paraphrase Augmentation to Leverage Neural Caption Translation2018
- Author(s)
  J. Effendi, S. Sakti, K. Sudoh, S. Nakamura
- Organizer
  IWSLT 2018
- Int'l Joint Research
[Presentation] Toward Machine Speech Chain with Semi-supervised Learning by ASR-TTS coupling and Next Generation Speech-to-speech Translation2018
- Author(s)
  Satoshi Nakamura
- Organizer
  LISTEN Workshop/ Summer School
- Int'l Joint Research / Invited
[Presentation] Machine Speech Chain with Deep Learning2018
- Author(s)
  Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
- Organizer
  日本音響学会2018年秋季研究発表会
[Presentation] Multimodal Database of Negative Emotion Recovery in Dyadic Interactions: Construction and Analysis2018
- Author(s)
  Nurul Lubis, Michael Heck, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura
- Organizer
  日本音響学会2018年秋季研究発表会
[Presentation] 日英コードスイッチング音声データの構築2018
- Author(s)
  中山佐保子, ドクオックチュオン, サクティサクリアニ, 中村哲
- Organizer
  日本音響学会2018年秋季研究発表会
[Presentation] Visual Description Paraphrase Corpus Creation with Various Elementary Operations2018
- Author(s)
  Johanes Effendi, Sakriani Sakti, Satoshi Nakamura
- Organizer
  日本音響学会2018年秋季研究発表会
[Presentation] Impact of deception information on negotiation dialog management: A case study on doctor-patient conversations2018
- Author(s)
  Nguyen The Tung, Koichiro Yoshino, Sakti Sakriani, Satoshi Nakamura
- Organizer
  International Workshop on Spoken Dialogue System Technology (IWSDS 2018)
- Int'l Joint Research
[Presentation] Utilizing deception information for dialog management of doctor-patient conversations2018
- Author(s)
  Tung The Nguyen, Koichiro Yoshino, Sakriani Sakti, Satoshi Nakamura
- Organizer
  第32回人工知能学会全国大会
[Presentation] 人物設定付き対話収集ツールの構築2018
- Author(s)
  杉山享志朗, 吉野幸一郎, 中村哲
- Organizer
  SIG-SLUD
[Presentation] Listening Skills Assessment through Computer Agents2018
- Author(s)
  Hiroki Tanaka, Hideki Negoro, Hidemi Iwasaka, Satoshi Nakamura
- Organizer
  ACM International Conference on Multimodal Interaction (ICMI)
- Int'l Joint Research
[Presentation] Leveraging sequence-to-sequence speech synthesis for enhancing acoustic-to-word speech recognition.2018
- Author(s)
  M.Mimura, S.Ueno, H.Inaguma, S.Sakai, and T.Kawahara.
- Organizer
  IEEE Spoken Language Technology Workshop (SLT)
- Int'l Joint Research
[Presentation] Improving OOV detection and resolution with external language models in acoustic-to-word ASR.2018
- Author(s)
  H.Inaguma, M.Mimura, S.Sakai, and T.Kawahara.
- Organizer
  IEEE Spoken Language Technology Workshop (SLT)
- Int'l Joint Research
[Presentation] Human-like conversational robot.2018
- Author(s)
  T.Kawahara.
- Organizer
  APSIPA ASC
- Int'l Joint Research / Invited
[Presentation] Forward-backward attention decoder.2018
- Author(s)
  M.Mimura, S.Sakai, and T.Kawahara.
- Organizer
  INTERSPEECH
- Int'l Joint Research
[Presentation] Spoken dialogue system for a human-like conversational robot ERICA.2018
- Author(s)
  T.Kawahara.
- Organizer
  Int'l Workshop Spoken Dialogue Systems (IWSDS)
- Int'l Joint Research / Invited
[Presentation] Acoustic-to-word attention-based model complemented with character-level CTC-based model.2018
- Author(s)
  S.Ueno, H.Inaguma, M.Mimura, and T.Kawahara.
- Organizer
  IEEE-ICASSP
- Int'l Joint Research
[Presentation] Statistical speech enhancement based on probabilistic integration of variational autoencoder and non-negative matrix factorization.2018
- Author(s)
  Y.Bando, M.Mimura, K.Itoyama, K.Yoshii, and T.Kawahara.
- Organizer
  IEEE-ICASSP
- Int'l Joint Research
[Presentation] Unsupervised beamforming based on multichannel nonnegative matrix factorization for noisy speech recognition.2018
- Author(s)
  K.Shimada, Y.Bando, M.Mimura, K.Itoyama, K.Yoshii, and T.Kawahara.
- Organizer
  IEEE-ICASSP
- Int'l Joint Research
[Presentation] An end-to-end approach to joint social signal detection and automatic speech recognition.2018
- Author(s)
  H.Inaguma, M.Mimura, K.Inoue, K.Yoshii, and T.Kawahara.
- Organizer
  IEEE-ICASSP
- Int'l Joint Research
[Presentation] 音声認識の方法論の変遷と展望～Acoustic-to-Wordモデルを中心に～.2018
- Author(s)
  河原達也
- Organizer
  研究報告音声言語情報処理（SLP）
- Invited
[Presentation] End-to-End音声合成を用いた単語単位End-to-End音声認識のデータ拡張.2018
- Author(s)
  上乃聖, 三村正人, 坂井信輔, 河原達也.
- Organizer
  情報処理学会研究会SIG-SLP
[Presentation] アンドロイドERICAによる人間レベルの音声対話.2018
- Author(s)
  河原達也.
- Organizer
  人工知能学会研究会SIG-SLUD
- Invited
[Presentation] 独立低ランク行列分析を用いたフルランク空間共分散モデルに基づくブラインド音源分離2018
- Author(s)
  久保優騎, 高宗典玄, 北村大地, 猿渡洋
- Organizer
  日本音響学会2018年秋季研究発表会
[Presentation] A spoofing benchmark for the 2018 voice conversion challenge: leveraging from spoofing countermeasures for speech artifact assessment2018
- Author(s)
  T. Kinnunen, J. Lorenzo-Trueba, J. Yamagishi, T. Toda, D. Saito, F. Villavicencio, Z. Ling
- Organizer
  Odyssey 2018
- Int'l Joint Research
[Presentation] The voice conversion challenge 2018: promoting development of parallel and nonparallel methods2018
- Author(s)
  J. Lorenzo-Trueba, J. Yamagishi, T. Toda, D. Saito, F. Villavicencio, T. Kinnunen, Z. Ling
- Organizer
  Odyssey 2018
- Int'l Joint Research
[Presentation] The NU non-parallel voice conversion system for the voice conversion challenge 20182018
- Author(s)
  Y. Wu, P.L. Tobing, T. Hayashi, K. Kobayashi, T. Toda
- Organizer
  Odyssey 2018
- Int'l Joint Research
[Presentation] NU voice conversion system for the voice conversion challenge 20182018
- Author(s)
  P.L. Tobing, Y. Wu, T. Hayashi, K. Kobayashi, T. Toda
- Organizer
  Odyssey 2018
- Int'l Joint Research
[Presentation] Collapsed segment detection and reduction for WaveNet vocoder2018
- Author(s)
  Y. Wu, K. Kobayashi, T. Hayashi, P.L. Tobing, T. Toda
- Organizer
  INTERSPEECH 2018
- Int'l Joint Research
[Presentation] An evaluation of deep spectral mappings and WaveNet vocoder for voice conversion2018
- Author(s)
  P.L. Tobing, T. Hayashi, Y. Wu, K. Kobayashi, T. Toda
- Organizer
  IEEE SLT 2018
- Int'l Joint Research
[Presentation] Prosody-aware subword embedding considering Japanese intonation systems and its application to DNN-based multi-dialect speech synthesis2018
- Author(s)
  Takanori Akiyama, Shinnosuke Takamichi, and Hiroshi Saruwatari
- Organizer
  APSIPA ASC
- Int'l Joint Research
[Presentation] コンピュータによる自動通訳を目指して2018
- Author(s)
  中村　哲
- Organizer
  日本通訳翻訳学会　第19回年次大会
- Invited
[Remarks] 科研費基盤(S): 次世代音声翻訳の研究
- URL
  https://ahcweb01.naist.jp/research/kakenhi-ngst/

2018 Fiscal Year Annual Research Report

Next generation speech translation research

Principal Investigator

中村 哲 奈良先端科学技術大学院大学, データ駆動型サイエンス創造センター, 教授 (30263429)

Current Status of Research Progress

Reason

Research Products

[Journal Article] Neural Oscillation-Based Classification of Japanese Spoken Sentences During Speech Perception2019

Author(s)

Journal Title

DOI

[Journal Article] Electroencephalogram-Based Single Trial Detection of Language Expectation Violations in Listening to Speech2019

Author(s)

Journal Title

DOI

[Journal Article] Generalized independent low-rank matrix analysis using heavy-tailed distributions for blind source separation2018

Author(s)

Journal Title

DOI

[Journal Article] Quality Prediction of Synthesized Speech Based on Tensor Structured EEG Signals2018

Author(s)

Journal Title

DOI

[Journal Article] Bayesian multichannel audio source separation based on integrated source and spatial models.2018

Author(s)

Journal Title

DOI

[Journal Article] Voice Animator: Automatic Lip-Synching in Limited Animation by Audio2018

Author(s)

Journal Title

DOI

[Journal Article] Construction of Spontaneous Emotion Corpus from Indonesian TV Talk Shows and Its Application on Multimodal Emotion Recognition2018

Author(s)

Journal Title

DOI

[Journal Article] Sequence-to-Sequence Models for Emphasis Speech Translation2018

Author(s)

Journal Title

DOI

[Journal Article] Dirichlet Process Mixture of Mixtures Model for Unsupervised Subword Modeling2018

Author(s)

Journal Title

DOI

[Journal Article] 音声翻訳システムにおける音声変換の利用2018

Author(s)

Journal Title

DOI

[Presentation] カリキュラムラーニングを用いた音声翻訳の学習戦略の提案2019

Author(s)

Organizer

[Presentation] Machine Speech Chainに基づく半教師あり学習を用いた日英コードスイッチング音声の認識2019

Author(s)

Organizer

[Presentation] Affect-sensitive Dialogue Response Generation for Positive Emotion Elicitation2019

Author(s)

Organizer

[Presentation] Enhancing Neural Machine Translation with Image-based Paraphrase Augmentation2019

Author(s)

Organizer

[Presentation] Speaker and Emotion Recognition of TV-Series Data Using Multimodal and Multitask Deep Learning2019

Author(s)

Organizer

[Presentation] Unifying Speech Recognition and Generation with Machine Speech Chain2019

Author(s)

Organizer

[Presentation] 英日同時通訳におけるニューラル機械翻訳の検討2019

Author(s)

Organizer

[Presentation] 単語分散表現を使った誤差によるニューラル機械翻訳の学習2019

Author(s)

Organizer

[Presentation] マルチソースニューラル機械翻訳における翻訳時の原言語欠落補完2019

Author(s)

Organizer

[Presentation] End-to-end Learning of Segmented Robot Behaviors and Descriptions2019

Author(s)

Organizer

[Presentation] 音声認識を用いた字幕作成システムの改良.2019

Author(s)

Organizer

中村哲奈良先端科学技術大学院大学, データ駆動型サイエンス創造センター, 教授 (30263429)