Next generation speech translation research

Research Project

Project/Area Number	17H06101
Research Category	Grant-in-Aid for Scientific Research (S)
Allocation Type	Single-year Grants
Research Field	Perceptual information processing
Research Institution	Nara Institute of Science and Technology
Principal Investigator	Nakamura Satoshi 奈良先端科学技術大学院大学, データ駆動型サイエンス創造センター, 教授 (30263429)
Co-Investigator(Kenkyū-buntansha)	河原達也京都大学, 情報学研究科, 教授 (00234104) 猿渡洋東京大学, 大学院情報理工学系研究科, 教授 (30324974) 戸田智基名古屋大学, 情報基盤センター, 教授 (90403328) 森島繁生早稲田大学, 理工学術院, 教授 (10200411) 高道慎之介東京大学, 大学院情報理工学系研究科, 助教 (90784330) 須藤克仁奈良先端科学技術大学院大学, 先端科学技術研究科, 准教授 (00396152) SAKTI Sakriani (サクリアニサクティ) 奈良先端科学技術大学院大学, 先端科学技術研究科, 特任准教授 (00395005) 吉野幸一郎奈良先端科学技術大学院大学, 先端科学技術研究科, 助教 (70760148) 田中宏季奈良先端科学技術大学院大学, 先端科学技術研究科, 助教 (10757834) 松本裕治奈良先端科学技術大学院大学, 先端科学技術研究科, 教授 (10211575)
Project Period (FY)	2017-05-31 – 2022-03-31
Project Status	Discontinued (Fiscal Year 2021)
Budget Amount *help	¥204,230,000 (Direct Cost: ¥157,100,000、Indirect Cost: ¥47,130,000) Fiscal Year 2021: ¥37,830,000 (Direct Cost: ¥29,100,000、Indirect Cost: ¥8,730,000) Fiscal Year 2020: ¥36,400,000 (Direct Cost: ¥28,000,000、Indirect Cost: ¥8,400,000) Fiscal Year 2019: ¥38,610,000 (Direct Cost: ¥29,700,000、Indirect Cost: ¥8,910,000) Fiscal Year 2018: ¥46,280,000 (Direct Cost: ¥35,600,000、Indirect Cost: ¥10,680,000) Fiscal Year 2017: ¥45,110,000 (Direct Cost: ¥34,700,000、Indirect Cost: ¥10,410,000)
Keywords	自動音声同時通訳 / 音声翻訳 / 逐次音声認識・音声合成 / 逐次機会翻訳 / パラ言語音声翻訳 / 音声顔画像翻訳
Outline of Final Research Achievements	With the goal to develop a speech translation system having an equal ability to human simultaneous interpreters, we conducted research on automatic incremental speech translation with consideration of sentence structure difference between languages, paralinguistic speech translation to extract, preserve and reproduce speaker’s emotion, emphasis, and individuality, as well as video caption translation by using lip sync for videos. Moreover, we created 330 hours corpora of JP-EN speech translation. As an achievement of the research, we established the basic technologies for the English-Japanese simultaneous speech translation, the paralinguistic speech translation, and the video caption translation, and built up a prototype containing these technologies.
Academic Significance and Societal Importance of the Research Achievements	日本の社会、企業の国際化、オンライン会議の急増により、外国人とのコミュニケーションが必要な場面が急増している。人間の同時通訳者のような自動通訳が実現できれば社会、経済活動が促進できる。音声から音声への同時通訳では、文字を読む必要がないだけでなく、感情や強調なども伝えることができるため、より自然なコミュニケーションができる。本研究では自動音声同時通訳、感情、顔表情を保持して翻訳する翻訳技術を開発し、このニーズに応えることができる。
Assessment Rating	Verification Result (Rating) A	Assessment Rating	Result (Rating) A: Progress in the research is steadily towards the initial goal. Expected research results are expected.

Report

(10 results)

2022 Research Progress Assessment (Verification Result) ( PDF )
2021 Final Research Report ( PDF )
2020 Annual Research Report Abstract(Research Progress Assessment) ( PDF ) Research Progress Assessment (Result) ( PDF )
2019 Annual Research Report
2018 Annual Research Report
2017 Abstract ( PDF ) Comments on the Screening Results ( PDF ) Annual Research Report

Research Products
(291 results)

All 2021 2020 2019 2018 2017 2016 Other

All Journal Article (48 results) (of which Int'l Joint Research: 38 results, Peer Reviewed: 47 results, Open Access: 35 results, Acknowledgement Compliant: 1 results) Presentation (239 results) (of which Int'l Joint Research: 127 results, Invited: 15 results) Book (1 results) Remarks (2 results) Funded Workshop (1 results)

[Journal Article] Acoustic model-based subword tokenization and prosodic-context extraction without language knowledge for text-to-speech synthesis2021
- Author(s)
  Masashi Aso, Shinnosuke Takamichi, Norihiro Takamune, and Hiroshi Saruwatari
- Journal Title
  
  Elsevier Speech Communication
  
  Volume: 125 Pages: 53-60
- DOI
  10.1016/j.specom.2020.09.003
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Perceptual-similarity-aware deep speaker representation learning for multi-speaker generative modeling2021
- Author(s)
  Yuki Saito, Shinnosuke Takamichi, Hiroshi Saruwatari
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 29 Pages: 1033-1048
- DOI
  10.1109/taslp.2021.3059114
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Pretraining techniques for sequenceto-sequence voice conversion2021
- Author(s)
  Wen-Chin Huang, Tomoki Hayashi, Yi-Chiao Wu, Hirokazu Kameoka, Tomoki Toda
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 29 Pages: 745-755
- DOI
  10.1109/taslp.2021.3049336
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Quasi-periodic parallel WaveGAN: a non-autoregressive raw waveform generative model with pitch-dependent dilated convolution neural network2021
- Author(s)
  Yi-Chiao Wu, Tomoki Hayashi, Takuma Okamoto, Hisashi Kawai, Tomoki Toda
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 29 Pages: 792-806
- DOI
  10.1109/taslp.2021.3051765
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Quasi-periodic WaveNet: an autoregressive raw waveform generative model with pitch-dependent dilated convolution neural network2021
- Author(s)
  Yi-Chiao Wu, Tomoki Hayashi, Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Toda
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 29 Pages: 1134-1148
- DOI
  10.1109/taslp.2021.3061245
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] End-to-End Image-to-Speech Generation for Untranscribed Unknown Languages2021
- Author(s)
  Johanes Effendi, Sakriani Sakti, Satoshi Nakamura
- Journal Title
  
  IEEE Access
  
  Volume: 9 Pages: 55144-55154
- DOI
  10.1109/access.2021.3071541
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] 単言語話者のための日英コードスイッチング音声の認識と翻訳2021
- Author(s)
  中山佐保子、サクティサクリアニ、中村哲
- Journal Title
  
  情報処理学会論文誌
  
  Volume: Vol.62 No.3 Pages: 903-914
- NAID
  120007089951
- Related Report
  2020 Annual Research Report
- Peer Reviewed
[Journal Article] ReMOT: A Model-agnostic Refinement for Multiple Object Tracking2021
- Author(s)
  Fan Yang, Xin Chang, Sakriani Sakti, Yang Wu, Satoshi Nakamura
- Journal Title
  
  Image and Vision Computing
  
  Volume: 106 Pages: 1-9
- DOI
  10.1016/j.imavis.2020.104091
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Selective Attention Measurement of Experienced Simultaneous Interpreters using EEG Phase-locked Response2021
- Author(s)
  Haruko Yagura, Hiroki Tanaka, Taiki Kinoshita, Hiroki Watanabe, Shunnosuke Motomura, Katsuhito Sudoh, Satoshi Nakamura
- Journal Title
  
  Frontiers in Human Neuroscience
  
  Volume: -
- DOI
  10.3389/fnhum.2021.581525
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Leveraging Neural Caption Translation with Visually Grounded Paraphrase Augmentation2020
- Author(s)
  Johanes Effendi, Katsuhito Sudoh, Sakriani Sakti, Satoshi Nakamura
- Journal Title
  
  IEICE Transactions on Information and Systems
  
  Volume: E103.D Issue: 3 Pages: 674-683
- DOI
  10.1587/transinf.2019EDP7065
- NAID
  130007804146
- ISSN
  0916-8532, 1745-1361
- Year and Date
  2020-03-01
- Related Report
  2019 Annual Research Report
- Peer Reviewed
[Journal Article] Recurrent Neural Network Compression Based on Low-Rank Tensor Representation2020
- Author(s)
  Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
- Journal Title
  
  IEICE Transactions on Information and Systems
  
  Volume: E103.D Issue: 2 Pages: 435-449
- DOI
  10.1587/transinf.2019EDP7040
- NAID
  130007793590
- ISSN
  0916-8532, 1745-1361
- Year and Date
  2020-02-01
- Related Report
  2019 Annual Research Report
- Peer Reviewed
[Journal Article] Dialog Management of Healthcare Consulting System by Utilizing Deceptive Information2020
- Author(s)
  The Tung Nguyen, Koichiro Yoshino, Sakriani Sakti, Satoshi Nakamura
- Journal Title
  
  Transactions of the Japanese Society for Artificial Intelligence
  
  Volume: 35 Issue: 1 Pages: DSI-C_1-12
- DOI
  10.1527/tjsai.DSI-C
- NAID
  130007779287
- ISSN
  1346-0714, 1346-8030
- Year and Date
  2020-01-01
- Related Report
  2019 Annual Research Report
- Peer Reviewed
[Journal Article] Phase Reconstruction from Amplitude Spectrograms Based on Directional-Statistics Deep Neural Networks2020
- Author(s)
  Shinnosuke Takamichi, Yuki Saito, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari
- Journal Title
  
  Elsevier Signal Processing
  
  Volume: 169 Pages: 1-12
- DOI
  10.1016/j.sigpro.2019.107368
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Blind Speech Extraction Based on Rank-Constrained Spatial Covariance Matrix Estimation With Multivariate Generalized Gaussian Distribution2020
- Author(s)
  Yuki Kubo, Norihiro Takamune, Daichi Kitamura, and Hiroshi Saruwatari
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 28 Pages: 1948-1968
- DOI
  10.1109/taslp.2020.3003165
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Independent deeply learned matrix analysis with automatic selection of stable microphone-wise update and fast sourcewise update of demixing matrix2020
- Author(s)
  Naoki Makishima, Yoshiki Mitsui, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, Kazunobu Kondo
- Journal Title
  
  Signal Processing
  
  Volume: 178 Pages: 1-12
- DOI
  10.1016/j.sigpro.2020.107753
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Non-parallel voice conversion system with WaveNet vocoder and collapsed speech suppression2020
- Author(s)
  Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda
- Journal Title
  
  IEEE Access
  
  Volume: 8 Pages: 62094-62106
- DOI
  10.1109/access.2020.2984007
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] An evaluation of voice conversion with neural network spectral mapping models and WaveNet vocoder2020
- Author(s)
  Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda
- Journal Title
  
  APSIPA Transactions on Signal and Information Processing
  
  Volume: 9 Issue: 1 Pages: 1-14
- DOI
  10.1017/atsip.2020.24
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Tackling Perception Bias in Unsupervised Phoneme Discovery Using DPGMM-RNN Hybrid Model and Functional Load2020
- Author(s)
  Bin Wu, Sakriani Sakti, Jinsong Zhang, Satoshi Nakamura
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: Vol. 29 Pages: 348-362
- DOI
  10.1109/taslp.2020.3042016
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Instance-level Heterogeneous Domain Adaptation for Limited-labeled Sketch-to-Photo Retrieval2020
- Author(s)
  Fan Yang, Yang Wu, Zheng Wang, Xiang Li, Sakriani Sakti, Satoshi Nakamura
- Journal Title
  
  IEEE Transaction on Multimedia
  
  Volume: - Pages: 1-1
- DOI
  10.1109/tmm.2020.3009476
- NAID
  120006900505
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] End-to-end Speech Translation with Transcoding by Multi-task Learning for Distant Language Pairs2020
- Author(s)
  Takatomo Kano, Sakriani Sakti, Satoshi Nakamura
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech and Language Processing
  
  Volume: Vol: 28, No. 1 Pages: 1342-1355
- DOI
  10.1109/taslp.2020.2986886
- NAID
  120006847154
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Machine Speech Chain2020
- Author(s)
  Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: － Pages: 976-989
- DOI
  10.1109/taslp.2020.2977776
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Multi-Source Neural Machine Translation with Missing Data2020
- Author(s)
  Yuta Nishimura, Katsuhito Sudoh, Graham Neubig, Satoshi Nakamura
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: Vol. 28 Pages: 569-580
- DOI
  10.1109/taslp.2019.2959224
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Real-time Rendering of Layered Materials with Anisotropic Normal Distributions2020
- Author(s)
  Yamaguchi Tomoya, Yatagawa Tatsuya, Tokuyoshi Yusuke, Morishima Shigeo
- Journal Title
  
  Computational Visual Media
  
  Volume: 6 Issue: 1 Pages: 29-36
- DOI
  10.1007/s41095-019-0154-z
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Analysis of Conversational Listening Skills toward Agent-based Social Skills Training2020
- Author(s)
  Hiroki Tanaka, Hidemi Iwasaka, Hideki Negoro, Satoshi Nakamura
- Journal Title
  
  Journal on Multimodal User Interfaces
  
  Volume: 14 Issue: 1 Pages: 73-82
- DOI
  10.1007/s12193-019-00313-y
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Cross-lingual transfer learning of non-native acoustic modeling for pronunciation error detection and diagnosis2020
- Author(s)
  R.Duan, T.Kawahara, M.Dantsuji, and H.Nanjo
- Journal Title
  
  IEEE/ACM Trans. Audio, Speech & Language Process
  
  Volume: Vol.28, No.1 Pages: 391-401
- DOI
  10.1109/taslp.2019.2955858
- NAID
  120006817577
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Neural Oscillation-Based Classification of Japanese Spoken Sentences During Speech Perception2019
- Author(s)
  Hiroki Watanabe, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura
- Journal Title
  
  IEICE Transactions on Information and Systems
  
  Volume: E102.D Issue: 2 Pages: 383-391
- DOI
  10.1587/transinf.2018EDP7293
- NAID
  130007586194
- ISSN
  0916-8532, 1745-1361
- Year and Date
  2019-02-01
- Related Report
  2019 Annual Research Report 2018 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Synchronization between overt speech envelope and EEG oscillations during imagined speech2019
- Author(s)
  Hiroki Watanabe, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura
- Journal Title
  
  Neuroscience Research
  
  Volume: Volume 153 Pages: 48-55
- DOI
  10.1016/j.neures.2019.04.004
- NAID
  120006847155
- Related Report
  2020 Annual Research Report 2019 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Voice conversion with CycleRNN-based spectral mapping and finely-tuned WaveNet vocoder2019
- Author(s)
  P.L. Tobing, Y.-C. Wu, T. Hayashi, K. Kobayashi, T. Toda
- Journal Title
  
  IEEE Access
  
  Volume: Vol. 7, No. 1 Pages: 171114-171125
- DOI
  10.1109/access.2019.2955978
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Towards Machine Speech-to-speech Translation2019
- Author(s)
  Satoshi Nakamura, Katsuhito Sudoh, Sakriani Sakti
- Journal Title
  
  Interpreting Technologies, Revista Tradumatica
  
  Volume: No.17 Issue: 17 Pages: 81-87
- DOI
  10.5565/rev/tradumatica.238
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Independent Deeply Learned Matrix Analysis for Determined Audio Source Separation2019
- Author(s)
  Makishima Naoki、Mogami Shinichi、Takamune Norihiro、Kitamura Daichi、Sumino Hayato、Takamichi Shinnosuke、Saruwatari Hiroshi、Ono Nobutaka
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 27 Issue: 10 Pages: 1601-1615
- DOI
  10.1109/taslp.2019.2925450
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] End-to-End Speech Recognition Sequence Training with Reinforcement Learning2019
- Author(s)
  Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
- Journal Title
  
  IEEE Access
  
  Volume: Volume: 7 Pages: 79758-79769
- DOI
  10.1109/access.2019.2922617
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Positive Emotion Elicitation in Chat-Based Dialogue Systems2019
- Author(s)
  Nurul Lubis, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech and Language Processing
  
  Volume: Volume: 27, Issue: 4 Issue: 4 Pages: 866-877
- DOI
  10.1109/taslp.2019.2900910
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Real‐time Indirect Illumination of Emissive Inhomogeneous Volumes using Layered Polygonal Area Lights2019
- Author(s)
  Kuge Takahiro、Yatagawa Tatsuya、Morishima Shigeo
- Journal Title
  
  Computer Graphics Forum
  
  Volume: 38 Issue: 7 Pages: 449-460
- DOI
  10.1111/cgf.13851
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Electroencephalogram-Based Single Trial Detection of Language Expectation Violations in Listening to Speech2019
- Author(s)
  Hiroki Tanaka, Hiroki Watanabe, Hayato Maki, Sakti Sakriani, Satoshi Nakamura
- Journal Title
  
  Frontiers in Computational Neuroscience
  
  Volume: 13 Pages: 1-11
- DOI
  10.3389/fncom.2019.00015
- Related Report
  2019 Annual Research Report 2018 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Semi-supervised multichannel speech enhancement with a deep speech prior2019
- Author(s)
  K.Sekiguchi, Y.Bando, A.A.Nugraha, K.Yoshii, and T.Kawahara
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: Vol.27, No.12 Issue: 12 Pages: 2197-2212
- DOI
  10.1109/taslp.2019.2944348
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Unsupervised speech enhancement based on multichannel NMF-informed beamforming for noise-robust automatic speech recognition2019
- Author(s)
  K.Shimada, Y.Bando, M.Mimura, K.Itoyama, K.Yoshii, and T.Kawahara
- Journal Title
  
  IEEE/ACM Trans. Audio, Speech & Language Processing
  
  Volume: 27 Issue: 5 Pages: 960-971
- DOI
  10.1109/taslp.2019.2907015
- NAID
  120006621539
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Application of voice conversion to speech-to-speech translation2018
- Author(s)
  高道慎之介, 戸田智基
- Journal Title
  
  THE JOURNAL OF THE ACOUSTICAL SOCIETY OF JAPAN
  
  Volume: 74 Issue: 9 Pages: 535-538
- DOI
  10.20697/jasj.74.9_535
- NAID
  130007606922
- ISSN
  0369-4232, 2432-2040
- Year and Date
  2018-09-01
- Related Report
  2018 Annual Research Report
- Open Access
[Journal Article] Construction of Spontaneous Emotion Corpus from Indonesian TV Talk Shows and Its Application on Multimodal Emotion Recognition2018
- Author(s)
  Nurul Lubis, Dessi Lestari, Sakriani Sakti, Ayu Purwarianti, and Satoshi Nakamura
- Journal Title
  
  IEICE Transactions on Information and Systems
  
  Volume: E101.D Issue: 8 Pages: 2092-2100
- DOI
  10.1587/transinf.2017EDP7362
- NAID
  130007429560
- ISSN
  0916-8532, 1745-1361
- Year and Date
  2018-08-01
- Related Report
  2018 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Generalized independent low-rank matrix analysis using heavy-tailed distributions for blind source separation2018
- Author(s)
  Daichi Kitamura, Shinichi Mogami, Yoshiki Mitsui, Norihiro Takamune, Hiroshi Saruwatari, Nobutaka Ono, Yu Takahashi, and Kazunobu Kondo
- Journal Title
  
  EURASIP Journal on Advances in Signal Processing
  
  Volume: - Issue: 1 Pages: 1-28
- DOI
  10.1186/s13634-018-0549-5
- Related Report
  2018 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Quality Prediction of Synthesized Speech Based on Tensor Structured EEG Signals2018
- Author(s)
  Hayato Maki, Sakriani Sakti, Hiroki Tanaka, Satoshi Nakamura
- Journal Title
  
  PloS One
  
  Volume: 13 Issue: 6 Pages: 1-13
- DOI
  10.1371/journal.pone.0193521
- Related Report
  2018 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Bayesian multichannel audio source separation based on integrated source and spatial models.2018
- Author(s)
  K.Itakura, Y.Bando, E.Nakamura, K.Itoyama, K.Yoshii, and T.Kawahara
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 26 Issue: 4 Pages: 831-846
- DOI
  10.1109/taslp.2017.2789320
- Related Report
  2018 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Voice Animator: Automatic Lip-Synching in Limited Animation by Audio2018
- Author(s)
  Shoichi Furukawa, Tsukasa Fukusato, Shugo Yamaguchi, Shigeo Morishima
- Journal Title
  
  ?Lecture Notes in Computer Science?book series (LNCS, volume 10714)
  
  Volume: 10714 Pages: 153-171
- DOI
  10.1007/978-3-319-76270-8_12
- ISBN
  9783319762692, 9783319762708
- Related Report
  2018 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Sequence-to-Sequence Models for Emphasis Speech Translation2018
- Author(s)
  Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 26 Issue: 10 Pages: 1873-1883
- DOI
  10.1109/taslp.2018.2846402
- Related Report
  2018 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Dirichlet Process Mixture of Mixtures Model for Unsupervised Subword Modeling2018
- Author(s)
  Michael Heck, Sakriani Sakti, Satoshi Nakamura
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 26 Issue: 11 Pages: 2027-2042
- DOI
  10.1109/taslp.2018.2852500
- Related Report
  2018 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Statistical Parametric Speech Synthesis Incorporating Generative Adversarial Networks2018
- Author(s)
  Yuki Saito, Shinnosuke Takamichi, Hiroshi Saruwatari
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processin
  
  Volume: 26 Issue: 1 Pages: 84-96
- DOI
  10.1109/taslp.2017.2761547
- Related Report
  2017 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Detecting Dementia through Interactive Computer Avatars2017
- Author(s)
  Hiroki Tanaka, Hiroyoshi Adachi, Norimichi Ukita, Manabu Ikeda, Hiroaki Kazui, Takashi Kudo, Satoshi Nakamura
- Journal Title
  
  IEEE Journal of Translational Engineering in Health and Medicine,
  
  Volume: 5 Pages: 1-11
- DOI
  10.1109/jtehm.2017.2752152
- Related Report
  2017 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Articulatory Modeling for Pronunciation Error Detection without Non-Native Training Data Based on DNN Transfer Learning2017
- Author(s)
  R.Duan, T.Kawahara, M.Dantsuji, and J.Zhang
- Journal Title
  
  IEICE Transactions on Information and Systems
  
  Volume: E100.D Issue: 9 Pages: 2174-2182
- DOI
  10.1587/transinf.2017EDP7019
- NAID
  130006038443
- ISSN
  0916-8532, 1745-1361
- Related Report
  2017 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Preserving Word-level Emphasis in Speech-to-speech Translation2016
- Author(s)
  Quoc Truong Do, Tomoki Toda, Graham Neubig, Sakriani Sakti and Satoshi Nakamura
- Journal Title
  
  IEEE Transactions on Audio, Speech and Language Processing
  
  Volume: vol. 25 no.3 Issue: 3 Pages: 544-556
- DOI
  10.1109/taslp.2016.2643280
- NAID
  120006226308
- Related Report
  2017 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research / Acknowledgement Compliant
[Presentation] ランク制約付き空間共分散行列推定法における補助関数法に基づく雑音欠落ランク空間基底に対する新しい更新則2021
- Author(s)
  近藤祐斗, 久保優騎, 高宗典玄, 北村大地, 猿渡洋
- Organizer
  日本音響学会2021春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] スタガードモデル化三重対角型共分散行列を用いた独立半正定値テンソル分析によるブラインド音源分離2021
- Author(s)
  近藤樹、高宗典玄、北村大地、猿渡洋、池下林太郎、中谷智広
- Organizer
  日本音響学会2021春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 経験ベイズ独立深層学習行列分析による多チャネル音源分離2021
- Author(s)
  蓮実拓也，中村友彦，高宗典玄，猿渡洋，北村大地，高橋祐，近藤多伸
- Organizer
  日本音響学会2021春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 独立深層学習テンソル分析に基づく多チャネル音源分離2021
- Author(s)
  成澤直輝，池下林太郎，高宗典玄，北村大地，中村友彦，猿渡洋，中谷智広
- Organizer
  日本音響学会2021春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 大規模言語モデルによる未観測文の生成機構を持つEnd-to-Endインクリメンタル音声合成2021
- Author(s)
  佐伯高明，高道慎之介，猿渡洋
- Organizer
  音声研究会 (SP)
- Related Report
  2020 Annual Research Report
[Presentation] テキスト音声合成のためのポストフィルタ用WaveNetボコーダの学習条件に関する評価2021
- Author(s)
  安原和輝, Yi-Chiao Wu, Patrick Lumban Tobing, 松永悟行, 大谷大和, 戸田智基
- Organizer
  日本音響学会2021年春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] Transformer-based Direct Speech-to-speech Translation with Transcoder2021
- Author(s)
  Takatomo Kano, Sakriani Sakti, Satoshi Nakamura
- Organizer
  IEEE Spoken Language Technology Workshop
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Incorporating Discriminative DPGMM Posteriorgrams for Low-resource ASR2021
- Author(s)
  Bin Wu, Sakriani Sakti and Satoshi Nakamura
- Organizer
  IEEE Spoken Language Technology Workshop
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] 言語情報とパラ言語情報を考慮したニューラル音声翻訳2021
- Author(s)
  徳山太顕, Sakriani Sakti, 須藤克仁, 中村哲
- Organizer
  言語処理学会第27回年次大会(NLP2021)
- Related Report
  2020 Annual Research Report
[Presentation] マルチリンガルマシーンスピーチチェーンを用いたゼロショットコードスイッチングの音声認識と音声合成2021
- Author(s)
  中山佐保子, チャンドラアンドロス, サクティサクリアニ, 中村哲
- Organizer
  日本音響学会 2021年春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] MFCC-DPGMM Features for Enhancing Low-Resource ASR2021
- Author(s)
  Bin Wu, Sakriani Sakti and Satoshi Nakamura
- Organizer
  The 2021 Spring meeting of the Acoustical Society of Japan
- Related Report
  2020 Annual Research Report
[Presentation] Real-time Neural Machine Speech Chain2021
- Author(s)
  Sashi Novitasari, Andros Tjandra, Tomoya Yanagita, Sakriani Sakti, and Satoshi Nakamura
- Organizer
  The 2021 Spring meeting of the Acoustical Society of Japan
- Related Report
  2020 Annual Research Report
[Presentation] Improving ASR with Multimodal Machine Chain2021
- Author(s)
  Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
- Organizer
  The 2021 Spring meeting of the Acoustical Society of Japan
- Related Report
  2020 Annual Research Report
[Presentation] Positional Encoding への摂動付与による長さ制御を用いた非自己回帰型機械翻訳のための知識蒸留2021
- Author(s)
  岡佑依, 須藤克仁, 中村哲
- Organizer
  言語処理学会第27回年次大会
- Related Report
  2020 Annual Research Report
[Presentation] 音声認識仮説の曖昧性を考慮するMulti-task End-to-End音声翻訳2021
- Author(s)
  胡尤佳, 須藤克仁, Sakriani Sakti, 中村哲
- Organizer
  言語処理学会第27回年次大会
- Related Report
  2020 Annual Research Report
[Presentation] 文法誤り訂正モデルのエラー分析に基づく疑似データ生成の効果検証2021
- Author(s)
  土肥康輔, 須藤克仁, 中村哲
- Organizer
  言語処理学会第27回年次大会
- Related Report
  2020 Annual Research Report
[Presentation] 文脈文アノテーションによるドキュメント機械翻訳の精度向上に関する研究2021
- Author(s)
  安本玄樹, 須藤克仁, 中村哲
- Organizer
  言語処理学会第27回年次大会
- Related Report
  2020 Annual Research Report
[Presentation] 人手書き起こしの知識を用いた音声認識誤りに頑健な機械翻訳2021
- Author(s)
  福田りょう, 須藤克仁, 中村哲
- Organizer
  言語処理学会第27回年次大会
- Related Report
  2020 Annual Research Report
[Presentation] 分割統治的ニューラル機械翻訳2021
- Author(s)
  加納保昌, 須藤克仁, 中村哲
- Organizer
  言語処理学会第27回年次大会
- Related Report
  2020 Annual Research Report
[Presentation] CTC-synchronous training for monotonic attention model2020
- Author(s)
  H.Inaguma, M.Mimura, and T.Kawahara
- Organizer
  INTERSPEECH 2020
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Distilling the knowledge of BERT for sequence-to-sequence ASR2020
- Author(s)
  H.Futami, H.Inaguma, S.Ueno, M.Mimura, S.Sakai, and T.Kawahara
- Organizer
  INTERSPEECH 2020
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] End-to-end text-to-speech synthesis with unaligned multiple language units based on attention2020
- Author(s)
  Masashi Aso, Shinnosuke Takamichi, and Hiroshi Saruwatari
- Organizer
  Interspeech 2020
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] 音源分離のための周波数間相関を考慮した多変量複素Gauss分布に基づく深層学習による分散共分散行列推定の検討2020
- Author(s)
  成澤直輝，高宗典玄，北村大地，中村友彦，猿渡洋
- Organizer
  日本音響学会2020秋季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] Cross-lingual Text-To-Speech Synthesis via Domain Adaptation and Perceptual Similarity Regression in Speaker Space2020
- Author(s)
  Detai Xin, Yuki Saito, Shinnosuke Takamichi, Tomoki Koriyama, and Hiroshi Saruwatari
- Organizer
  Interspeech 2020
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] A cyclical post-filtering approach to mismatch refinement of neural vocoder for text-to-speech systems2020
- Author(s)
  Yi-Chiao Wu, Patrick Lumban Tobing, Kazuki Yasuhara, Noriyuki Matsunaga, Yamato Ohtani, Tomoki Toda
- Organizer
  INTERSPEECH
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Cyclic spectral modeling for unsupervised unit discovery into voice conversion with excitation and waveform modeling2020
- Author(s)
  Patrick Lumban Tobing, Tomoki Hayashi, Yi-Chiao Wu, Kazuhiro Kobayashi, Tomoki Toda
- Organizer
  INTERSPEECH
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Voice Conversion Challenge 2020: Intra-lingual semi-parallel and crosslingual voice conversion2020
- Author(s)
  Zhao Yi, Wen-Chin Huang, Xiaohai Tian, Junichi Yamagishi, Rohan Kumar Das, Tomi Kinnunen, Zhenhua Ling, Tomoki Toda
- Organizer
  Joint workshop for the Blizzard Challenge and Voice Conversion Challenge 2020
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Predictions of subjective ratings and spoofing assessments of Voice Conversion Challenge 2020 submissions2020
- Author(s)
  Rohan Kumar Das, Tomi Kinnunen, Wen-Chin Huang, Zhenhua Ling, Junichi Yamagishi, Zhao Yi, Xiaohai Tian, Tomoki Toda
- Organizer
  Joint workshop for the Blizzard Challenge and Voice Conversion Challenge 2020
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Baseline system of Voice Conversion Challenge 2020 with cyclic variational autoencoder and parallel WaveGAN2020
- Author(s)
  Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Toda
- Organizer
  Joint workshop for the Blizzard Challenge and Voice Conversion Challenge 2020
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] The sequence-to-sequence baseline for the Voice Conversion Challenge 2020: cascading ASR and TTS2020
- Author(s)
  Wen-Chin Huang, Tomoki Hayashi, Shinji Watanabe, Tomoki Toda
- Organizer
  Joint workshop for the Blizzard Challenge and Voice Conversion Challenge 2020
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] The NU voice conversion system for the Voice Conversion Challenge 2020: on the effectiveness of sequence-tosequence models and autoregressive neural vocoders2020
- Author(s)
  Wen-Chin Huang, Patrick Lumban Tobing, Yi-Chiao Wu, Kazuhiro Kobayashi, Tomoki Toda
- Organizer
  Joint workshop for the Blizzard Challenge and Voice Conversion Challenge 2020
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Song2Face: Synthesizing Singing Facial Animation from Audio2020
- Author(s)
  Shohei Iwase, Takuya Kato, Shugo Yamaguchi, Yukitaka Tsuchiya, Shigeo Morishima
- Organizer
  SIGGRAPH Asia 2020 Technical Communications
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Style Controllable Facial Animation Synthesis from SInging Audio2020
- Author(s)
  Shohei Iwase, Takuya Kato, Shugo Yamaguchi, Yukitaka Tsuchiya, Shigeo Morishima
- Organizer
  Visual Computer 2020
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Do We Need Sound for Sound Source Localization?2020
- Author(s)
  Takashi Oya, Shohei Iwase, Ryota Natsume, Takahiro Itazuri, Shugo Yamaguchi, Shigeo Morishima
- Organizer
  Asian Conference on Computer Vision (ACCV), 2020
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Towards Speech Entrainment: Considering ASR Information in Speaking Rate Variation of TTS Waveform Generation2020
- Author(s)
  Mayuko Okamoto, Sakriani Sakti and Satoshi Nakamura
- Organizer
  Oriental COCOSDA
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time2020
- Author(s)
  Sashi Novitasari, Andros Tjandra, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura
- Organizer
  INTERSPEECH
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Augmenting Images for ASR and TTS throughSingle-loop and Dual-loop Multimodal Chain Framework2020
- Author(s)
  Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
- Organizer
  INTERSPEECH
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Neural Speech Completion2020
- Author(s)
  Kazuki Tsunematsu, Johanes Effendi, Sakriani Sakti, and Satoshi Nakamura
- Organizer
  INTERSPEECH
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge2020
- Author(s)
  Andros Tjandra, Sakriani Sakti, Satoshi Nakamura.
- Organizer
  INTERSPEECH
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units2020
- Author(s)
  Ewan Dunbar, Julien Karadayi, Mathieu Bernard, Xuan-Nga Cao, Robin Algayres, Lucas Ondel, Laurent Besacier, Sakriani Sakti, Emmanuel Dupoux
- Organizer
  INTERSPEECH
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] ReMOTS: Self-Supervised Refining Multi-Object Tracking and Segmentation2020
- Author(s)
  Fan Yang, Xin Chang, Chenyu Dang, Ziqiang Zheng, Yang Wu, Sakriani Sakti, and Satoshi Nakamura
- Organizer
  BMTT MOTChallenge Workshop of CVPR
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis2020
- Author(s)
  Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
- Organizer
  Joint Workshop on Spoken Language Technologies for Under-resourced languages (SLTU) and Collaboration and Computing for Under-Resourced Languages (CCURL)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] 文の構造を考慮した適切な韻律の音声合成2020
- Author(s)
  国広有衣子, サクティサクリアニ, 須藤克仁, 中村哲
- Organizer
  第133回音声言語情報処理研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 音声の破損により失った文字情報を復元する音声認識2020
- Author(s)
  東佑樹，Sakriani Sakti，中村哲
- Organizer
  第133回音声言語情報処理研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] Combining Audio and Brain Activity for Predicting Speech Quality2020
- Author(s)
  Ivan Halim Parmonangan
- Organizer
  Interspeech
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Linguistic Features during Speech Utterances in the Context of Social Skills Training2020
- Author(s)
  Hiroki Tanaka
- Organizer
  EMBC
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Analysis of selective attention processing on experienced simultaneous interpreters using EEG phase synchronization2020
- Author(s)
  Haruko Yagura
- Organizer
  EMBC
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Sequential Attention-based Detection of Semantic Incongrueties from EEG While Listening to Speech2020
- Author(s)
  Shunnosuke Motomura
- Organizer
  EMBC
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Incorporating Noisy Length Constraints into Transformer with Length-aware Positional Encodings2020
- Author(s)
  Yui Oka, Katsuki Chousa, Katsuhito Sudoh and Satoshi Nakamura
- Organizer
  the 28th International Conference on Computational Linguistics(COLING 2020
- Related Report
  2020 Annual Research Report
[Presentation] NAIST's Machine Translation Systems for IWSLT 2020 Conversational Speech Translation Task2020
- Author(s)
  Ryo Fukuda, Katsuhito Sudoh, Satoshi Nakamura
- Organizer
  the 17th International Conference on Spoken Language Translation (IWSLT)
- Related Report
  2020 Annual Research Report
[Presentation] 漸進的な音声認識・機械翻訳・テキスト音声合成に基づく音声から音声への同時翻訳2020
- Author(s)
  中村哲，Novitasari Sashi，帖佐克己，柳田智也，二又航介，須藤克仁，Sakti Sakriani
- Organizer
  日本音響学会2020春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 様々な合成単位におけるEnd-to-end 逐次音声合成の検討2020
- Author(s)
  柳田智也, サクティサクリアニ, 中村哲
- Organizer
  日本音響学会2020春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 独立深層学習行列分析におけるマイクロホン毎及び音源毎の座標降下法に基づく分離行列更新法の周波数別自動選択法2020
- Author(s)
  牧島直輝，高宗典玄，北村大地，猿渡洋，高橋祐，近藤多伸
- Organizer
  日本音響学会2020春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 音響モデル尤度に基づくsubword 分割の韻律推定精度における評価2020
- Author(s)
  阿曽真至，高道慎之介，高宗典玄，猿渡洋
- Organizer
  日本音響学会2020年春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] Neural Incremental Speech Recognition Through Attention Transfer2020
- Author(s)
  Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
- Organizer
  言語処理学会第26回年次大会
- Related Report
  2019 Annual Research Report
[Presentation] From Speech Chain to Multimodal Chain: Leveraging Cross-modal Data Augmentation for Semi-supervised Learning2020
- Author(s)
  Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
- Organizer
  言語処理学会第26回年次大会
- Related Report
  2019 Annual Research Report
[Presentation] Neural Machine Translation Improvement by Acoustic Embedding2020
- Author(s)
  叶高朋, サクティサクリアニ, 中村哲
- Organizer
  言語処理学会第26回年次大会
- Related Report
  2019 Annual Research Report
[Presentation] Speech-to-Speech Translation without Text2020
- Author(s)
  Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
- Organizer
  言語処理学会第26回年次大会
- Related Report
  2019 Annual Research Report
[Presentation] Positional Encoding出力長制御を用いた英日ニューラル機械翻訳の検討2020
- Author(s)
  岡佑依, 帖佐克己, 須藤克仁, 中村哲
- Organizer
  言語処理学会第26回年次大会
- Related Report
  2019 Annual Research Report
[Presentation] 教師なし機械翻訳に基づく話し言葉翻訳へのドメイン適応の検討2020
- Author(s)
  福田りょう, 須藤克仁, 中村哲
- Organizer
  言語処理学会第26回年次大会
- Related Report
  2019 Annual Research Report
[Presentation] 漸進的な音声認識・機械翻訳・テキスト音声合成に基づく音声から音声への同時翻訳2020
- Author(s)
  須藤克仁, Sashi Novitasari, 帖佐克己, 柳田智也, 二又航介, Sakriani Sakti, 中村哲
- Organizer
  言語処理学会第26回年次大会
- Related Report
  2019 Annual Research Report
[Presentation] 英日同時通訳システムのための疑似同時通訳コーパス自動生成手法の提案2020
- Author(s)
  二又航介, 須藤克仁, 中村哲
- Organizer
  言語処理学会第26回年次大会
- Related Report
  2019 Annual Research Report
[Presentation] 自動字幕作成システムにおけるモデルの拡張2020
- Author(s)
  秋田祐哉, 上乃聖, 三村正人, 河原達也
- Organizer
  情報処理学会研究報告
- Related Report
  2019 Annual Research Report
[Presentation] ストリーミング注意機構型sequence-to-sequenceモデルによる講演音声認識2020
- Author(s)
  稲熊寛文, 三村正人, 河原達也
- Organizer
  情報処理学会研究報告
- Related Report
  2019 Annual Research Report
[Presentation] Neural Machine Translation with Acoustic Embedding2019
- Author(s)
  Takatomo Kano, Sakriani Sakti, Satoshi Nakamura
- Organizer
  IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Zero-shot Code-switching ASR and TTS with Multilingual Machine Speech Chain2019
- Author(s)
  Sahoko Nakayama, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
- Organizer
  IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Listening while Speaking: Improving ASR through Multimodal Chain2019
- Author(s)
  Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
- Organizer
  IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Speech-to-speech Translation between Untranscribed Unknown Languages2019
- Author(s)
  Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
- Organizer
  IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Dialogue Model and Response Generation for Emotion Improvement Elicitation2019
- Author(s)
  Nurul Lubis, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura
- Organizer
  the 3rd Conversational AI workshop - NeurIPS 2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Hierarchical Tensor Fusion Network for Deception Handling Negotiation Dialog Model2019
- Author(s)
  Nguyen The Tung, Koichiro Yoshino, Sakriani Sakti, Satoshi Nakamura
- Organizer
  the 3rd Conversational AI workshop - NeurIPS 2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Investigation of shallow WaveNet vocoder with Laplacian distribution output2019
- Author(s)
  P.L. Tobing, T. Hayashi, T. Toda
- Organizer
  IEEE Workshop Automatic Speech Recognition & Understanding (ASRU)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Multilingual end-to-end speech translation2019
- Author(s)
  H.Inaguma, K.Duh, T.Kawahara, and S.Watanabe
- Organizer
  IEEE Workshop Automatic Speech Recognition & Understanding (ASRU)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Acceleration of rank-constrained spatial covariance matrix estimation for blind speech extraction2019
- Author(s)
  Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari
- Organizer
  APSIPA Annual Summit and Conference 2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Robust demixing filter update algorithm based on microphone-wise coordinate descent for independent deeply learned matrix analysis2019
- Author(s)
  Naoki Makishima, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, Kazunobu Kondo
- Organizer
  APSIPA Annual Summit and Conference 2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Recent Advances in Speech Processing and Machine Translation Research at NAIST2019
- Author(s)
  Satoshi Nakamura
- Organizer
  International Conference on Artificial Intelligence and Speech Technology (AIST 2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Recognition and Translation of Code-switching Speech Utterances2019
- Author(s)
  Sahoko Nakayama, Takatomo Kano, Andros Tjandra, Sakriani Sakti, and Satoshi Nakamura
- Organizer
  Oriental COCOSDA 2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization2019
- Author(s)
  Shunsuke Saito, Zeng Huang, Ryota Natsume, Shigeo Morishima, Hao Li, Angjoo Kanazawa
- Organizer
  ICCV 2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Automatic Sign Dance Synthesis from Gesture-based Sign Language2019
- Author(s)
  Naoya Iwamoto, Hubert P. H. Shum, Wakana Asahina, Shigeo Morishima
- Organizer
  MIG 2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Phoneme Level Speaking Rate Variation on Waveform Generation using GAN-TTS2019
- Author(s)
  Mayuko Okamoto, Sakriani Sakti, and Satoshi Nakamura
- Organizer
  Oriental COCOSDA 2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Detecting Dementia from Face in Human-Agent Interaction2019
- Author(s)
  Hiroki Tanaka, Hiroyoshi Adachi, Hiroaki Kazui, Manabu Ikeda, Takashi Kudo, Satoshi Nakamura
- Organizer
  Adjunct of the 2019 International Conference on Multimodal Interaction (ICMI)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Detecting Syntactic Violations from Single-trial EEG using Recurrent Neural Networks2019
- Author(s)
  Shunnosuke Motomura, Hiroki Tanaka, Satoshi Nakamura
- Organizer
  Adjunct of the 2019 International Conference on Multimodal Interaction (ICMI)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Measuring Affective Sharing between Two People by EEG Hyperscanning2019
- Author(s)
  Taiki Kinoshita, Hiroki Tanaka, Koichiro Yoshino, Satoshi Nakamura
- Organizer
  Adjunct of the 2019 International Conference on Multimodal Interaction (ICMI)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Sequence-to-sequence Learning via Attention Transfer for Incremental Speech Recognition2019
- Author(s)
  Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
- Organizer
  Interspeech 2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] VQVAE Unsupervised Unit Discovery and Multi-Scale Code2Spec Inverter for Zerospeech Challenge 20192019
- Author(s)
  Andros Tjandra, Berrak Sisman, Mingyang Zhang, Sakriani Sakti, Haizou Li, Satoshi Nakamura
- Organizer
  Interspeech 2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] The Zero Resource Speech Challenge 2019: TTS Without T2019
- Author(s)
  Ewan Dunbar, Robin Algayres, Julien Karadayi, Mathieu Bernard, Juan Benjumea, Xuan-Nga Cao, Lucie Miskic, Charlotte Dugrain, Lucas Ondel, Alan W. Black, Laurent Besacier, Sakriani Sakti, Emmanuel Dupoux
- Organizer
  Interspeech2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Neural iTTS: Toward Synthesizing Speech in Real-time with End-to-end Neural Text-to-Speech Framework2019
- Author(s)
  Tomoya Yanagita, Sakriani Sakti and Satoshi Nakamura
- Organizer
  SSW
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Speech Quality Evaluation of Synthesized Japanese Speech Using EEG2019
- Author(s)
  Ivan Halim Parmonangan, Hiroki Tanaka, Sakriani Sakti, Shinnosuke Takamichi, Satoshi Nakamura
- Organizer
  Interspeech 2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Refined WaveNet vocoder for variational autoencoder based voice conversion2019
- Author(s)
  W.-C. Huang, Y.-C. Wu, H.-T. Hwang, P.L. Tobing, T. Hayashi, K. Kobayashi, T. Toda, Y. Tsao, H.-M. Wang
- Organizer
  27th European Signal Processing Conference (EUSIPCO2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Non-parallel voice conversion with cyclic variational autoencoder2019
- Author(s)
  P.L. Tobing, Y.-C. Wu, T. Hayashi, K. Kobayashi, T. Toda
- Organizer
  Interspeech 2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Improving variational autoencoder based voice conversion by conditioning on F0 and fully convolutional networks2019
- Author(s)
  W.-C. Huang, Y.-C. Wu, C.-C. Lo, P.L. Tobing, T. Hayashi, K. Kobayashi, T. Toda, Y. Tsao, H.-M. Wang
- Organizer
  Interspeech 2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Subword tokenization based on DNN-based acoustic model for end-to-end prosody generation2019
- Author(s)
  Masashi Aso, Shinnosuke Takamichi, Norihiro Takamune and Hiroshi Saruwatari
- Organizer
  The 10th ISCA SSW
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Column-wise update algorithm for independent deeply learned matrix analysis2019
- Author(s)
  Naoki Makishima, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, Kazunobu Kondo, Hiroaki Nakajima
- Organizer
  International Congress on Acoustics (ICA2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Efficient full-rank spatial covariance estimation using independent low-rank matrix analysis for blind source separation2019
- Author(s)
  Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari
- Organizer
  27th European Signal Processing Conference (EUSIPCO2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] EEG Analysis towards Evaluating Synthesized Speech Quality2019
- Author(s)
  Ivan Halim Parmonangan, Hiroki Tanaka, Sakti Sakriani, Shinnosuke Takamichi, Satoshi Nakamura
- Organizer
  IEEE Engineering in Medicine and Biology Society
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Generalized-Gaussian-distribution-based independent deeply learned matrix analysis for multichannel audio source separation2019
- Author(s)
  Naoki Makishima, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, Kazunobu Kondo, and Hiroaki Nakajima
- Organizer
  International Congress and Exhibition on Noise Control Engineering (INTERNOISE2019)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] SiCloPe: Silhouette-Based Clothed People2019
- Author(s)
  Ryota Natsume, Shunsuke Saito, Zeng Huang, Weikai Chen, Chongyang Ma, Hao Li, Shigeo Morishima
- Organizer
  CVPR 2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Cross-lingual speech-based ToBI label generation using bidirectional LSTM2019
- Author(s)
  Marco Vetter, Sakriani Sakti, Satoshi Nakamura
- Organizer
  IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] End-to-end feedback loss in speech chain framework via straight-through estimator2019
- Author(s)
  Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
- Organizer
  IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Speech Artifact Removal from EEG Recordings of Spoken Word Production with Tensor Decomposition2019
- Author(s)
  Holy Lovenia, Hiroki Tanaka, Sakriani Sakti, Ayu Purwarianti, Satoshi Nakamura
- Organizer
  IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Multi-speaker sequence-to-sequence speech synthesis for data augmentation in acoustic-to-word speech recognition2019
- Author(s)
  S.Ueno, M.Mimura, S.Sakai, and T.Kawahara
- Organizer
  IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Transfer learning of language-independent end-to-end ASR with language model fusion2019
- Author(s)
  H.Inaguma, J.Cho, M.K.Baskar, T.Kawahara, and S.Watanabe
- Organizer
  IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Voice conversion with cyclic recurrent neural network and fine-tuned WaveNet vocoder2019
- Author(s)
  P.L. Tobing, Y.-C. Wu, T. Hayashi, K. Kobayashi, T. Toda
- Organizer
  IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] GPU Smoke Simulation on Compressed DCT Space2019
- Author(s)
  Daichi Ishida, Ryoichi Ando, Shigeo Morishima
- Organizer
  Eurographics 2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] 階層的Tensor Fusion を用いた交渉対話における嘘検出2019
- Author(s)
  Nguyen The Tung, Koichiro Yoshino, Sakriani Sakti, Satoshi Nakamura
- Organizer
  SIG-SLUD
- Related Report
  2019 Annual Research Report
[Presentation] 脳波による聴覚定常反応を用いた同時通訳中の認知負荷の検証2019
- Author(s)
  矢倉晴子, 田中宏季, 木下泰輝, 渡部宏樹, 本村駿乃介, 須藤克仁, 中村哲
- Organizer
  聴覚研究会資料
- Related Report
  2019 Annual Research Report
[Presentation] ブラインド音声抽出のための多変量複素一般化Gauss 分布に基づくランク制約付き空間共分散行列推定法及びその高速化2019
- Author(s)
  久保優騎高宗典玄北村大地猿渡洋
- Organizer
  信学技報
- Related Report
  2019 Annual Research Report
[Presentation] 脳波信号の2名同時計測による感情共有の測定2019
- Author(s)
  木下泰輝, 田中宏季, 吉野幸一郎, 中村哲
- Organizer
  第9回社会神経科学研究会
- Related Report
  2019 Annual Research Report
[Presentation] 漸進的な音声認識・機械翻訳・テキスト音声合成に基づく音声から音声への同時翻訳2019
- Author(s)
  Sashi Novitasari, 帖佐克己, 柳田智也, 二又航介, 須藤克仁, Sakriani Sakti, 中村哲
- Organizer
  情報処理学会第242回自然言語処理研究会
- Related Report
  2019 Annual Research Report
[Presentation] End-to-End型テキスト音声合成におけるWaveNetボコーダの学習に関する調査2019
- Author(s)
  安原和輝, 林知樹, 戸田智基
- Organizer
  音講論
- Related Report
  2019 Annual Research Report
[Presentation] ランク制約付き空間共分散モデル推定法の逆行列展開による高速化2019
- Author(s)
  久保優騎, 高宗典玄, 北村大地, 猿渡洋
- Organizer
  日本音響学会2019秋季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 独立深層学習行列分析におけるマイクロホン毎の座標降下法に基づく分離行列更新2019
- Author(s)
  牧島直輝，高宗典玄，北村大地，猿渡洋，高橋祐，近藤多伸
- Organizer
  日本音響学会2019秋季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] スタイル変換技術による対訳コーパスから同時通訳コーパスへの拡張2019
- Author(s)
  二又航介, 須藤　克仁, 中村哲
- Organizer
  情報処理学会研究報告
- Related Report
  2019 Annual Research Report
[Presentation] 英日同時翻訳のためのConnectionist Temporal Classificationを用いたニューラル機械翻訳2019
- Author(s)
  帖佐克己, 須藤克仁, 中村哲
- Organizer
  情報処理学会研究報告
- Related Report
  2019 Annual Research Report
[Presentation] 言語横断な言語モデルによる原言語情報を活用した機械翻訳評価2019
- Author(s)
  高橋洸丞, 須藤克仁, 中村哲
- Organizer
  情報処理学会研究報告
- Related Report
  2019 Annual Research Report
[Presentation] End-to-End型テキスト音声合成におけるWaveNetボコーダの学習についての調査2019
- Author(s)
  安原和輝, 林知樹, 戸田智基
- Organizer
  信学技報
- Related Report
  2019 Annual Research Report
[Presentation] 入力音声に続く文章の予測2019
- Author(s)
  恒松和輝, サクリアニサクティ, 中村哲
- Organizer
  情報処理学会研究報告
- Related Report
  2019 Annual Research Report
[Presentation] 授業アーカイブの翻訳字幕自動作成システムの試作2019
- Author(s)
  須藤克仁，林輝昭，西村優汰，中村哲
- Organizer
  情報処理学会研究報告
- Related Report
  2019 Annual Research Report
[Presentation] ニューラルネットワークによる単一試行脳波信号を用いた音声文中の統語誤り検出2019
- Author(s)
  本村駿乃介, 田中宏季, 中村哲
- Organizer
  電子情報通信学会技術研究報告 NC/IBISML
- Related Report
  2019 Annual Research Report
[Presentation] 音素単位で話速制御を行う GAN-TT2019
- Author(s)
  岡本真由子, サクリアニサクティ, 中村哲
- Organizer
  情報処理学会研究報告
- Related Report
  2019 Annual Research Report
[Presentation] 民話を対象としたアイヌ語音声コーパスとend-to-end音声認識2019
- Author(s)
  松浦孝平, 上乃聖, 三村正人, 坂井信輔, 河原達也
- Organizer
  情報処理学会研究報告
- Related Report
  2019 Annual Research Report
[Presentation] 音声波形を入力とする単語単位end-to-end音声認識2019
- Author(s)
  上乃聖, 三村正人, 坂井信輔, 河原達也
- Organizer
  情報処理学会研究報告
- Related Report
  2019 Annual Research Report
[Presentation] Machine Speech Chain for Lifelong Learning2019
- Author(s)
  Satoshi Nakamura
- Organizer
  Life Long Learning for Spoken Language Systems Workshop
- Related Report
  2019 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Semi-supervised Learning by Machine Speech Chain for Multilingual2019
- Author(s)
  Satoshi Nakamura, Sakriani Sakti and Katsuhito Sudoh
- Organizer
  The International Conference Language Technologies for All (LT4All): Enabling Linguistic Diversity and Multilingualism Worldwide
- Related Report
  2019 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] End-to-end approach to ASR, TTS and Speech Translation2019
- Author(s)
  Satoshi Nakamura
- Organizer
  Task Force on Speech, Dialogue and Auditory Processing of CCF, China Computer Federation
- Related Report
  2019 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] 自動音声翻訳から自動音声通訳へ2019
- Author(s)
  中村　哲
- Organizer
  第123回音楽情報科学・第127回音声言語情報処理合同研究発表会
- Related Report
  2019 Annual Research Report
- Invited
[Presentation] Toward Automatic Speech Interpretation2019
- Author(s)
  Satoshi Nakamura
- Organizer
  CLI9
- Related Report
  2019 Annual Research Report
- Invited
[Presentation] カリキュラムラーニングを用いた音声翻訳の学習戦略の提案2019
- Author(s)
  叶高朋, Sakriani Sakti, 中村哲
- Organizer
  言語処理学会第25回年次大会（NLP2019)
- Related Report
  2018 Annual Research Report
[Presentation] Machine Speech Chainに基づく半教師あり学習を用いた日英コードスイッチング音声の認識2019
- Author(s)
  中山佐保子, Andros Tjandra, Sakriani Sakti, 中村哲
- Organizer
  言語処理学会第25回年次大会（NLP2019)
- Related Report
  2018 Annual Research Report
[Presentation] Affect-sensitive Dialogue Response Generation for Positive Emotion Elicitation2019
- Author(s)
  Nurul Lubis, Sakriani Sakti, Koichiro Yoshino and Satoshi Nakamura
- Organizer
  言語処理学会第25回年次大会（NLP2019)
- Related Report
  2018 Annual Research Report
[Presentation] Enhancing Neural Machine Translation with Image-based Paraphrase Augmentation2019
- Author(s)
  Johanes Effendi, Sakriani Sakti, Katsuhito Sudoh and Satoshi Nakamura
- Organizer
  言語処理学会第25回年次大会（NLP2019)
- Related Report
  2018 Annual Research Report
[Presentation] Speaker and Emotion Recognition of TV-Series Data Using Multimodal and Multitask Deep Learning2019
- Author(s)
  Sashi Novitasari, Quoc Truong Do, Sakriani Sakti, Dessi Lestari and Satoshi Nakamura
- Organizer
  言語処理学会第25回年次大会（NLP2019)
- Related Report
  2018 Annual Research Report
[Presentation] Unifying Speech Recognition and Generation with Machine Speech Chain2019
- Author(s)
  Andros Tjandra, Sakriani Sakti and Satoshi Nakamura
- Organizer
  言語処理学会第25回年次大会（NLP2019)
- Related Report
  2018 Annual Research Report
[Presentation] 英日同時通訳におけるニューラル機械翻訳の検討2019
- Author(s)
  帖佐克己, 須藤克仁, 中村哲
- Organizer
  言語処理学会第25回年次大会（NLP2019)
- Related Report
  2018 Annual Research Report
[Presentation] 単語分散表現を使った誤差によるニューラル機械翻訳の学習2019
- Author(s)
  帖佐克己, 須藤克仁, 中村哲
- Organizer
  言語処理学会第25回年次大会（NLP2019)
- Related Report
  2018 Annual Research Report
[Presentation] マルチソースニューラル機械翻訳における翻訳時の原言語欠落補完2019
- Author(s)
  西村優汰, 須藤克仁, Graham Neubig, 中村哲
- Organizer
  言語処理学会第25回年次大会（NLP2019)
- Related Report
  2018 Annual Research Report
[Presentation] End-to-end Learning of Segmented Robot Behaviors and Descriptions2019
- Author(s)
  Kohei Wakimoto, Koichiro Yoshino, Satoshi Nakamura
- Organizer
  SIG-SLUD
- Related Report
  2018 Annual Research Report
[Presentation] 音声認識を用いた字幕作成システムの改良.2019
- Author(s)
  秋田祐哉, 上乃聖, 三村正人, 河原達也.
- Organizer
  情報処理学会研究会SIG-AAC
- Related Report
  2018 Annual Research Report
[Presentation] 時変複素一般化ガウス分布に基づく独立深層学習行列分析2019
- Author(s)
  牧島直輝，高宗典玄，北村大地，猿渡洋，高橋祐，近藤多伸，中嶋広明
- Organizer
  日本音響学会2019年春季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] 教師あり及び半教師あり条件下における独立深層学習行列分析の実験的評価2019
- Author(s)
  牧島直輝, 最上伸一, 高宗典玄, 高道慎之介, 北村大地, 猿渡洋, 高橋祐, 近藤多伸, and 中嶋広明
- Organizer
  日本音響学会2019年春季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] 乗算型更新式に基づくランク制約付き空間共分散モデルの推定2019
- Author(s)
  久保優騎, 高宗典玄, 北村大地, 猿渡洋
- Organizer
  日本音響学会2019年春季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] ブラインド音源分離における多変量複素Student’s t 分布に基づくランク制約付き空間共分散モデルの推定2019
- Author(s)
  久保優騎, 高宗典玄, 北村大地, 猿渡洋
- Organizer
  2018年3月度応用音響研究会
- Related Report
  2018 Annual Research Report
[Presentation] Reducing mismatch of WaveNet vocoder for variational autoencoder based voice conversion2019
- Author(s)
  W.-C. Huang, Y.-C. Wu, H.-T. Hwang, P.L. Tobing, T. Hayashi, K. Kobayashi, T. Toda, Y. Tsao, H.-M. Wang
- Organizer
  日本音響学会2019年春季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] Voice conversion with cyclic recurrent neural network for WaveNet fine-tuning2019
- Author(s)
  P.L. Tobing, Y.-C. Wu, T. Hayashi, K. Kobayashi, T. Toda
- Organizer
  日本音響学会2019年春季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] Independent deeply learned matrix analysis for multichannel audio source separation2018
- Author(s)
  Shinichi Mogami, Hayato Sumino, Daichi Kitamura, Norihiro Takamune, Shinnosuke Takamichi and Hiroshi Saruwatari
- Organizer
  European Signal Processing Conference (EUSIPCO)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Vectorwise Coordinate Descent Algorithm for Spatially Regularized Independent Low-Rank Matrix Analysis2018
- Author(s)
  Yoshiki Mitsui, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, and Kazunobu Kondo
- Organizer
  2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Sequence-to-Sequence ASR Optimization via Reinforcement Learning2018
- Author(s)
  A. Tjandra, S. Sakti, S. Nakamura
- Organizer
  2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Graph regularized tensor factorization for single-trial EEG analysis2018
- Author(s)
  Hayato Maki, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura
- Organizer
  2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] 半教師あり独立深層学習行列分析におけるデータ拡張に基づく音源モデル適応2018
- Author(s)
  牧島直輝，高宗典玄，高道慎之介，北村大地，猿渡洋，高橋祐，近藤多伸，中嶋広明
- Organizer
  日本音響学会2018年秋季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] ヘビーテイル生成モデルに基づく独立深層学習行列分析による多チャネル音源分離2018
- Author(s)
  牧島直輝, 最上伸一, 高宗典玄, 北村大地, 猿渡洋, 高橋祐, 近藤多伸, 中嶋広明
- Organizer
  信号処理シンポジウム
- Related Report
  2018 Annual Research Report
[Presentation] Construction of English-French Multimodal Affective Conversational Corpus from Drama TV Series2018
- Author(s)
  S. Novitasari, Q.-T. Do, S. Sakti, D. Lestari, S. Nakamura
- Organizer
  LREC 2018
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Multi-modal Muti-task Deep Learning for Speaker and Emotion Recognition of TV-series Data2018
- Author(s)
  S. Novitasari, Q.-T. Do, S. Sakti, D. Lestari, S. Nakamura
- Organizer
  Oriental COCOSDA 2018
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Japanese-English Code-Switching Speech Data Construction2018
- Author(s)
  S. Nakayama, T. Kano, Q.-T Do, S. Sakti, S. Nakamura
- Organizer
  Oriental COCOSDA 2018
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Single-trial Detection of Semantic Anomalies from EEG during Listening to Spoken Sentences2018
- Author(s)
  Hiroki Tanaka, Hiroki Watanabe, Hayato Maki, Sakriani Sakti, Satoshi Nakamura
- Organizer
  International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Compressing End-to-End ASR Networks by Tensor-Train Decomposition2018
- Author(s)
  T. Mori, A. Tjandra, S. Sakti, S. Nakamura
- Organizer
  Interspeech 2018
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Optimizing DPGMM Clustering in Zero Resource Setting Based on Functional Load2018
- Author(s)
  B. Wu, S. Sakti, S. Nakamura
- Organizer
  SLTU 2018
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Incremental TTS for Japanese Language2018
- Author(s)
  T. Yanagita, S. Sakti, S. Nakamura
- Organizer
  Interspeech 2018
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Machine Speech Chain with One-shot Speaker Adaptation2018
- Author(s)
  A. Tjandra, S. Sakti, S. Nakamura
- Organizer
  Interspeech 2018
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Speech Chain for Semi-Supervised Learning of Japanese-English Code-Switching ASR and TTS2018
- Author(s)
  S. Nakayama, A. Tjandra, S. Sakti, S. Nakamura
- Organizer
  IEEE SLT
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Multi-scale Alignment and Contextual History for Attention Mechanism in Sequence-to-Sequence Model2018
- Author(s)
  A. Tjandra, S. Sakti, S. Nakamura
- Organizer
  IEEE SLT
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Toward Multi-features Emphases Speech Translation: Assessment of Human Emphases Production and Perception with Speech and Text Clues2018
- Author(s)
  Q.-T. Do, S. Sakti, S. Nakamura
- Organizer
  IEEE SLT
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Using Spoken Word Posterior Features in Neural Machine Translation2018
- Author(s)
  K. Osamura, T. Kano, S. Sakti, S. Nakamura
- Organizer
  IWSLT 2018
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Multi-paraphrase Augmentation to Leverage Neural Caption Translation2018
- Author(s)
  J. Effendi, S. Sakti, K. Sudoh, S. Nakamura
- Organizer
  IWSLT 2018
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Toward Machine Speech Chain with Semi-supervised Learning by ASR-TTS coupling and Next Generation Speech-to-speech Translation2018
- Author(s)
  Satoshi Nakamura
- Organizer
  LISTEN Workshop/ Summer School
- Related Report
  2018 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Machine Speech Chain with Deep Learning2018
- Author(s)
  Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
- Organizer
  日本音響学会2018年秋季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] Multimodal Database of Negative Emotion Recovery in Dyadic Interactions: Construction and Analysis2018
- Author(s)
  Nurul Lubis, Michael Heck, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura
- Organizer
  日本音響学会2018年秋季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] 日英コードスイッチング音声データの構築2018
- Author(s)
  中山佐保子, ドクオックチュオン, サクティサクリアニ, 中村哲
- Organizer
  日本音響学会2018年秋季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] Visual Description Paraphrase Corpus Creation with Various Elementary Operations2018
- Author(s)
  Johanes Effendi, Sakriani Sakti, Satoshi Nakamura
- Organizer
  日本音響学会2018年秋季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] Impact of deception information on negotiation dialog management: A case study on doctor-patient conversations2018
- Author(s)
  Nguyen The Tung, Koichiro Yoshino, Sakti Sakriani, Satoshi Nakamura
- Organizer
  International Workshop on Spoken Dialogue System Technology (IWSDS 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Utilizing deception information for dialog management of doctor-patient conversations2018
- Author(s)
  Tung The Nguyen, Koichiro Yoshino, Sakriani Sakti, Satoshi Nakamura
- Organizer
  第32回人工知能学会全国大会
- Related Report
  2018 Annual Research Report
[Presentation] 人物設定付き対話収集ツールの構築2018
- Author(s)
  杉山享志朗, 吉野幸一郎, 中村哲
- Organizer
  SIG-SLUD
- Related Report
  2018 Annual Research Report
[Presentation] Listening Skills Assessment through Computer Agents2018
- Author(s)
  Hiroki Tanaka, Hideki Negoro, Hidemi Iwasaka, Satoshi Nakamura
- Organizer
  ACM International Conference on Multimodal Interaction (ICMI)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Leveraging sequence-to-sequence speech synthesis for enhancing acoustic-to-word speech recognition.2018
- Author(s)
  M.Mimura, S.Ueno, H.Inaguma, S.Sakai, and T.Kawahara.
- Organizer
  IEEE Spoken Language Technology Workshop (SLT)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Improving OOV detection and resolution with external language models in acoustic-to-word ASR.2018
- Author(s)
  H.Inaguma, M.Mimura, S.Sakai, and T.Kawahara.
- Organizer
  IEEE Spoken Language Technology Workshop (SLT)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Human-like conversational robot.2018
- Author(s)
  T.Kawahara.
- Organizer
  APSIPA ASC
- Related Report
  2018 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Forward-backward attention decoder.2018
- Author(s)
  M.Mimura, S.Sakai, and T.Kawahara.
- Organizer
  INTERSPEECH
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Spoken dialogue system for a human-like conversational robot ERICA.2018
- Author(s)
  T.Kawahara.
- Organizer
  Int'l Workshop Spoken Dialogue Systems (IWSDS)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Acoustic-to-word attention-based model complemented with character-level CTC-based model.2018
- Author(s)
  S.Ueno, H.Inaguma, M.Mimura, and T.Kawahara.
- Organizer
  IEEE-ICASSP
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Statistical speech enhancement based on probabilistic integration of variational autoencoder and non-negative matrix factorization.2018
- Author(s)
  Y.Bando, M.Mimura, K.Itoyama, K.Yoshii, and T.Kawahara.
- Organizer
  IEEE-ICASSP
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Unsupervised beamforming based on multichannel nonnegative matrix factorization for noisy speech recognition.2018
- Author(s)
  K.Shimada, Y.Bando, M.Mimura, K.Itoyama, K.Yoshii, and T.Kawahara.
- Organizer
  IEEE-ICASSP
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] An end-to-end approach to joint social signal detection and automatic speech recognition.2018
- Author(s)
  H.Inaguma, M.Mimura, K.Inoue, K.Yoshii, and T.Kawahara.
- Organizer
  IEEE-ICASSP
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] 音声認識の方法論の変遷と展望～Acoustic-to-Wordモデルを中心に～.2018
- Author(s)
  河原達也
- Organizer
  研究報告音声言語情報処理（SLP）
- Related Report
  2018 Annual Research Report
- Invited
[Presentation] End-to-End音声合成を用いた単語単位End-to-End音声認識のデータ拡張.2018
- Author(s)
  上乃聖, 三村正人, 坂井信輔, 河原達也.
- Organizer
  情報処理学会研究会SIG-SLP
- Related Report
  2018 Annual Research Report
[Presentation] アンドロイドERICAによる人間レベルの音声対話.2018
- Author(s)
  河原達也.
- Organizer
  人工知能学会研究会SIG-SLUD
- Related Report
  2018 Annual Research Report
- Invited
[Presentation] 独立低ランク行列分析を用いたフルランク空間共分散モデルに基づくブラインド音源分離2018
- Author(s)
  久保優騎, 高宗典玄, 北村大地, 猿渡洋
- Organizer
  日本音響学会2018年秋季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] A spoofing benchmark for the 2018 voice conversion challenge: leveraging from spoofing countermeasures for speech artifact assessment2018
- Author(s)
  T. Kinnunen, J. Lorenzo-Trueba, J. Yamagishi, T. Toda, D. Saito, F. Villavicencio, Z. Ling
- Organizer
  Odyssey 2018
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] The voice conversion challenge 2018: promoting development of parallel and nonparallel methods2018
- Author(s)
  J. Lorenzo-Trueba, J. Yamagishi, T. Toda, D. Saito, F. Villavicencio, T. Kinnunen, Z. Ling
- Organizer
  Odyssey 2018
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] The NU non-parallel voice conversion system for the voice conversion challenge 20182018
- Author(s)
  Y. Wu, P.L. Tobing, T. Hayashi, K. Kobayashi, T. Toda
- Organizer
  Odyssey 2018
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] NU voice conversion system for the voice conversion challenge 20182018
- Author(s)
  P.L. Tobing, Y. Wu, T. Hayashi, K. Kobayashi, T. Toda
- Organizer
  Odyssey 2018
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Collapsed segment detection and reduction for WaveNet vocoder2018
- Author(s)
  Y. Wu, K. Kobayashi, T. Hayashi, P.L. Tobing, T. Toda
- Organizer
  INTERSPEECH 2018
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] An evaluation of deep spectral mappings and WaveNet vocoder for voice conversion2018
- Author(s)
  P.L. Tobing, T. Hayashi, Y. Wu, K. Kobayashi, T. Toda
- Organizer
  IEEE SLT 2018
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Prosody-aware subword embedding considering Japanese intonation systems and its application to DNN-based multi-dialect speech synthesis2018
- Author(s)
  Takanori Akiyama, Shinnosuke Takamichi, and Hiroshi Saruwatari
- Organizer
  APSIPA ASC
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] コンピュータによる自動通訳を目指して2018
- Author(s)
  中村　哲
- Organizer
  日本通訳翻訳学会　第19回年次大会
- Related Report
  2018 Annual Research Report
- Invited
[Presentation] 独立深層学習行列分析に基づく多チャネル音源分離2018
- Author(s)
  角野隼斗, 北村大地, 高宗典玄, 高道慎之介, 猿渡洋, 小野順貴
- Organizer
  日本音響学会 2018年春季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] Detecting Suppression of Negative Emotion by Time Series Change of Cerebral Blood Flow using fNIRS2018
- Author(s)
  Masahiro Honda, Hiroki Tanaka , Sakti Sakriani, Satoshi Nakamura
- Organizer
  IEEE International Conference on Biomedical and Health Informatics (BHI)
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Distilling Knowledge from a Multi-scale deep CNN Ensemble for Robust and Light-weight Acoustic Modeling2018
- Author(s)
  Michael Heck, Masayuki Suzuki, Takashi Fukuda, Gakuto Kurata, Satoshi Nakamura
- Organizer
  第120回音声言語情報処理研究会 (IPSJ SIG-SLP)
- Related Report
  2017 Annual Research Report
[Presentation] 音声認識単語仮説の曖昧性を考慮するニューラル機械翻訳2018
- Author(s)
  長村佳歩，叶高朋，SakrianiSakti，須藤克仁，中村哲
- Organizer
  言語処理学会第24回年次大会(NLP2018)
- Related Report
  2017 Annual Research Report
[Presentation] 原言語側の欠落を考慮したMulti-Source NMT2018
- Author(s)
  西村優汰, 須藤克仁, 中村哲
- Organizer
  言語処理学会第24回年次大会(NLP2018)
- Related Report
  2017 Annual Research Report
[Presentation] エージェントによる非定型質問への応答からの認知症検出2018
- Author(s)
  宇城毅犠，田中宏季，足立浩祥，數井裕光，池田学，工藤喬，中村哲
- Organizer
  IPSJ SIG
- Related Report
  2017 Annual Research Report
[Presentation] EEGを用いた合成音声に対する体感品質予想2018
- Author(s)
  真木勇人, Sakriani Sakti, 田中宏季, 中村哲
- Organizer
  電子情報通信学会MEとバイオサイバネティックス研究会（MBE）
- Related Report
  2017 Annual Research Report
[Presentation] 電極配置のグラフ構造を利用したテンソル分解による単一試行EEG解析2018
- Author(s)
  真木勇人, 田中宏季, Sakriani Sakti, 中村哲
- Organizer
  電子情報通信学会MEとバイオサイバネティックス研究会（MBE）
- Related Report
  2017 Annual Research Report
[Presentation] 生体信号からの感情コンピューティングと自閉症支援2018
- Author(s)
  田中宏季, 寺澤直人, 本田将大, 真木勇人, サクリアニサクティ
- Organizer
  第13回日本感性工学会春季大会
- Related Report
  2017 Annual Research Report
[Presentation] マルチチャネル非負値行列因子分解に基づくビームフォーミングを用いた雑音環境下音声認識.2018
- Author(s)
  島田一希, 坂東宜昭, 三村正人, 糸山克寿, 吉井和佳, 河原達也.
- Organizer
  電子情報通信学会SP
- Related Report
  2017 Annual Research Report
[Presentation] CTCによる文字単位のモデルを併用したattentionによる単語単位の end-to-end音声認識.2018
- Author(s)
  上乃聖, 稲熊寛文, 三村正人, 河原達也.
- Organizer
  情報処理学会SIG-SLP
- Related Report
  2017 Annual Research Report
[Presentation] 独立深層学習行列分析に基づく多チャネル音源分離の実験的評価2018
- Author(s)
  北村大地, 角野隼斗, 高宗典玄, 高道慎之介, 猿渡洋, 小野順貴
- Organizer
  電子情報通信学会技術研究報告音声研究会 (SP)
- Related Report
  2017 Annual Research Report
[Presentation] Development of NU voice conversion system 20182018
- Author(s)
  P.L. Tobing, Y.-C. Wu，T. Hayashi，K. Kobayashi，T. Toda
- Organizer
  電子情報通信学会技術研究報告音声研究会 (SP)
- Related Report
  2017 Annual Research Report
[Presentation] Development of NU non-parallel voice conversion system 20182018
- Author(s)
  Y.-C. Wu, P.L. Tobing, T. Hayashi, K. Kobayashi, T. Toda
- Organizer
  電子情報通信学会技術研究報告音声研究会 (SP)
- Related Report
  2017 Annual Research Report
[Presentation] Structured-Based Curriculum Learning for End-to-End English-Japanese Speech Translation2017
- Author(s)
  Takatomo Kano, Sakriani Sakti, Satoshi Nakamura
- Organizer
  Interspeech 2017
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Toward Expressive Speech Translation: A Unified Seq-to-Seq LSTMs Approach for Translating Words and Emphasis2017
- Author(s)
  Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura
- Organizer
  Interspeech 2017
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Subject-independent Classification of Japanese Spoken Sentences by Multiple Frequency Bands Phase Pattern of EEG Response during Speech Perception2017
- Author(s)
  Hiroki Watanabe, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamurau
- Organizer
  Interspeech 2017
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Ensembles of Multi-scale VGG Acoustic Models2017
- Author(s)
  Michael Heck, Masayuki Suzuki, Takashi Fukuda, Gakuto Kurata, Satoshi Nakamura
- Organizer
  Interspeech 2017
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Recognizing Emotionally Coloured Dialogue Speech using Speaker-Adapted DNN-CNN Bottleneck Features2017
- Author(s)
  K. Mukaihara, S. Sakti, S. Nakamura
- Organizer
  SPECOM 2017
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Feature Optimized DPGMM Clustering for Unsupervised Subword Modeling: A Contribution to ZEROSPEECH 20172017
- Author(s)
  M. Heck, S. Sakti, S. Nakamura
- Organizer
  ASRU 2017
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Attention-based Wav2Text with Feature Transfer2017
- Author(s)
  A. Tjandra, S. Sakti, S. Nakamura
- Organizer
  ASRU 2017
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Listening while Speaking: Speech Chain by Deep Learning2017
- Author(s)
  A. Tjandra, S. Sakti, S. Nakamura
- Organizer
  ASRU 2017
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Local Monotonic Attention Mechanism for End-to-end Speech and Language Processing2017
- Author(s)
  A. Tjandra, S. Sakti, S. Nakamura
- Organizer
  IJCNLP 2017
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Neural Machine Translation via Binary Code Prediction2017
- Author(s)
  Yusuke Oda, Philip Arthur, Graham Neubig, Koichiro Yoshino, Satoshi Nakamura
- Organizer
  55th Annual Meeting of the Association for Computational Linguistics (ACL) (Long Papers)
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] End-to-end Speech Recognition with Local Monotonic Attention2017
- Author(s)
  A. Tjandra, S. Sakti, S. Nakamura
- Organizer
  NIPS Workshop
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] 日本語インクリメンタル音声合成システム実装のための言語特徴の検討2017
- Author(s)
  柳田智也, S. Sakti, 中村哲
- Organizer
  情報処理学会音声言語情報処理研究会
- Related Report
  2017 Annual Research Report
[Presentation] テンソルトレイン分解によるEnd-to-End自動音声認識モデルの圧縮2017
- Author(s)
  森巧磨. Andros Tjandra, Sakriani Sakti, 中村哲
- Organizer
  情報処理学会自然言語処理研究会
- Related Report
  2017 Annual Research Report
[Presentation] Tracking Liking State in Brain Activity while Watching Multiple Movies2017
- Author(s)
  Naoto Terasawa, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura
- Organizer
  19th ACM International Conference on Multimodal Interaction (ICMI'17)
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] EEG-based Emotional State Tracking during Watching Movie considering Self-Assessment Manikin2017
- Author(s)
  Naoto Terasawa, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura
- Organizer
  39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC2017)
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Creation of a Multi-paraphrase Corpus based on Various Elementary Operations2017
- Author(s)
  Johanes Effendi, Sakriani Sakti, Satoshi Nakamura
- Organizer
  Oriental COCOSDA 2017
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] 音声文聴取時における意味違反が生じた際の脳波自動判別2017
- Author(s)
  田中宏季, 渡部宏樹, 真木勇人, Sakriani Sakti, 中村哲
- Organizer
  電子情報通信学会技術研究報告ヒューマン情報処理研究会 (HIP)
- Related Report
  2017 Annual Research Report
[Presentation] Knowledge Distillation from a Multi-scale VGG Ensemble for Acoustic Modeling2017
- Author(s)
  Michael Heck, Masayuki Suzuki, Takashi Fukuda, Gakuto Kurata and Satoshi Nakamura
- Organizer
  日本音響学会2017年秋季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] Dialogue Modeling for Eliciting Positive Emotion2017
- Author(s)
  Nurul Lubis, Sakriani Sakti, Koichiro Yoshino and Satoshi Nakamura
- Organizer
  日本音響学会2017年秋季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] Joint Translation of Words and Emphasis in Speech-to-Speech Translation using Sequence-to-Sequence Models2017
- Author(s)
  Quoc Truong Do, Sakriani Sakti and Satoshi Nakamura
- Organizer
  日本音響学会2017年秋季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] カリキュラムラーニングを用いた日英直接翻訳システムの提案2017
- Author(s)
  叶高朋, Sakriani Sakti, 中村哲
- Organizer
  日本音響学会2017年秋季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] Tensor Train based RNN Compression for Polyphonic Music Modelling2017
- Author(s)
  Andros Tjandra, Sakriani Sakti and Satoshi Nakamura
- Organizer
  日本音響学会2017年秋季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] 音声翻訳研究のこれから2017
- Author(s)
  中村哲
- Organizer
  日本音響学会2017年秋季研究発表会
- Related Report
  2017 Annual Research Report
- Invited
[Presentation] Cross-domain speech recognition using nonparallel corpora with cycle-consistent adversarial networks.2017
- Author(s)
  M.Mimura, S.Sakai, and T.Kawahara
- Organizer
  IEEE Workshop Automatic Speech Recognition & Understanding (ASRU)
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Automatic meeting transcription system for the Japanese Parliament (Diet).2017
- Author(s)
  T.Kawahara
- Organizer
  APSIPA ASC
- Related Report
  2017 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Modeling difficulties of second language learners using speech technology.2017
- Author(s)
  T.Kawahara
- Organizer
  Seoul International Conference on Speech Sciences (SICSS)
- Related Report
  2017 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Semi-blind speech enhancement based on recurrent neural network for source separation and dereverberation.2017
- Author(s)
  M.Wake, Y.Bando, M.Mimura, K.Itoyama, K.Yoshii, and T.Kawahara.
- Organizer
  IEEE Machine Learning for Signal Processing Workshop (MLSP)
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Combined multi-channel NMF-based robust beamforming for noisy speech recognition.2017
- Author(s)
  M.Mimura, Y.Bando, K.Shimada, S.Sakai, K.Yoshii, and T.Kawahara.
- Organizer
  INTERSPEECH
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] Social signal detection in spontaneous dialogue using bidirectional LSTM-CTC.2017
- Author(s)
  H.Inaguma, K.Inoue, M.Mimura, and T.Kawahara.
- Organizer
  INTERSPEECH
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] 雑音環境下音声認識のための多チャネル非負値行列因子分解に基づく教師なしビームフォーマ.2017
- Author(s)
  島田一希, 坂東宜昭, 三村正人, 糸山克寿, 吉井和佳, 河原達也.
- Organizer
  電子情報通信学会SP
- Related Report
  2017 Annual Research Report
[Presentation] 再帰型ニューラルネットワークを用いたセミブラインド音声分離・強調.2017
- Author(s)
  和気雅弥, 坂東宜昭, 三村正人, 糸山克寿, 吉井和佳, 河原達也.
- Organizer
  電子情報通信学会SP
- Related Report
  2017 Annual Research Report
[Presentation] 深層生成モデルを事前分布に用いた教師なし音声強調.2017
- Author(s)
  坂東宜昭, 三村正人, 糸山克寿, 吉井和佳, 河原達也.
- Organizer
  電子情報通信学会SP
- Related Report
  2017 Annual Research Report
[Presentation] End-to-endモデルによるsocial signals検出および音声認識との統合.2017
- Author(s)
  稲熊寛文, 井上昂治, 三村正人, 河原達也.
- Organizer
  情報処理学会SIG-SLP
- Related Report
  2017 Annual Research Report
[Book] 次世代音声言語研究シンポジウム2019講演資料集2019
- Author(s)
  中村哲, 須藤克仁, Sakriani Sakti, 田中宏季, 河原達也, 猿渡洋, 森島繁生, 戸田智基, 高道慎之介, Graham Neubig, Alex Waibel, 松下佳世, 山田優
- Total Pages
  195
- Publisher
  －
- Related Report
  2019 Annual Research Report
[Remarks] 科研費基盤(S): 次世代音声翻訳の研究
- URL
  https://ahcweb01.naist.jp/research/kakenhi-ngst/
- Related Report
  2019 Annual Research Report 2018 Annual Research Report
[Remarks] 次世代音声言語研究シンポジウム2019
- URL
  https://ahcweb01.naist.jp/s2s-symposium-2019/
- Related Report
  2019 Annual Research Report
[Funded Workshop] Symposium on Next Generation Spoken Language Research 20192019
- Related Report
  2019 Annual Research Report

Next generation speech translation research

Principal Investigator

Nakamura Satoshi 奈良先端科学技術大学院大学, データ駆動型サイエンス創造センター, 教授 (30263429)

¥204,230,000 (Direct Cost: ¥157,100,000、Indirect Cost: ¥47,130,000)

Verification Result (Rating)

Result (Rating)

Report

Research Products

[Journal Article] Acoustic model-based subword tokenization and prosodic-context extraction without language knowledge for text-to-speech synthesis2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Perceptual-similarity-aware deep speaker representation learning for multi-speaker generative modeling2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Pretraining techniques for sequenceto-sequence voice conversion2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Quasi-periodic parallel WaveGAN: a non-autoregressive raw waveform generative model with pitch-dependent dilated convolution neural network2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Quasi-periodic WaveNet: an autoregressive raw waveform generative model with pitch-dependent dilated convolution neural network2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] End-to-End Image-to-Speech Generation for Untranscribed Unknown Languages2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] 単言語話者のための日英コードスイッチング音声の認識と翻訳2021

Author(s)

Journal Title

NAID

Related Report

[Journal Article] ReMOT: A Model-agnostic Refinement for Multiple Object Tracking2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Selective Attention Measurement of Experienced Simultaneous Interpreters using EEG Phase-locked Response2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Leveraging Neural Caption Translation with Visually Grounded Paraphrase Augmentation2020

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] Recurrent Neural Network Compression Based on Low-Rank Tensor Representation2020

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] Dialog Management of Healthcare Consulting System by Utilizing Deceptive Information2020

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] Phase Reconstruction from Amplitude Spectrograms Based on Directional-Statistics Deep Neural Networks2020

Author(s)

Journal Title