• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Developing Low-Resource Multilingual Machine Speech Chain for Breaking Language Barriers

Research Project

Project/Area Number 23K21681
Project/Area Number (Other) 21H03467 (2021-2023)
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeMulti-year Fund (2024)
Single-year Grants (2021-2023)
Section一般
Review Section Basic Section 61010:Perceptual information processing-related
Research InstitutionNara Institute of Science and Technology (2024)
Japan Advanced Institute of Science and Technology (2021-2023)

Principal Investigator

SAKTI Sakriani  奈良先端科学技術大学院大学, 先端科学技術研究科, 教授 (00395005)

Co-Investigator(Kenkyū-buntansha) 中村 哲  奈良先端科学技術大学院大学, 先端科学技術研究科, 教授 (30263429)
Project Period (FY) 2024-04-01 – 2026-03-31
Project Status Granted (Fiscal Year 2024)
Budget Amount *help
¥17,160,000 (Direct Cost: ¥13,200,000、Indirect Cost: ¥3,960,000)
Fiscal Year 2025: ¥2,730,000 (Direct Cost: ¥2,100,000、Indirect Cost: ¥630,000)
Fiscal Year 2024: ¥3,250,000 (Direct Cost: ¥2,500,000、Indirect Cost: ¥750,000)
Fiscal Year 2023: ¥3,250,000 (Direct Cost: ¥2,500,000、Indirect Cost: ¥750,000)
Fiscal Year 2022: ¥3,250,000 (Direct Cost: ¥2,500,000、Indirect Cost: ¥750,000)
Fiscal Year 2021: ¥4,680,000 (Direct Cost: ¥3,600,000、Indirect Cost: ¥1,080,000)
Keywords低資源音声技術 / 多言語音声認識 / 多言語音声合成 / 音声翻訳 / Machine Speech Chain
Outline of Research at the Start

海外からの居住者および観光客との言葉の壁は深刻な問題となっている。いくつかの音声翻訳サービスが実用化されているが、高精度の翻訳性能を実現するために、広範な音声と対応する書き起こしデータを使用する教師あり学習ディープラーニングに基づいた音声翻訳の開発が必須である。一方、人間は機械学習のように大量のデータを使わなくとも、日常生活において自然に言語を習得できる。本研究では、人間の言語習得プロセス、特にSpeech Chain メカニズムに基づいて、多言語の言語習得のための新しいディープラーニングの教師なしおよび半教師あり学習メカニズムを提案する。

Outline of Annual Research Achievements

危機管理のグローバル化や、大型国際イベントの開催などにより、海外からの居住者および観光客との言葉の壁は深刻な問題となっている。いくつかの音声翻訳サービスが実用化されているが、高精度の翻訳性能を実現するためには、広範な音声と対応する書き起こしデータを使用する教師あり学習によるディープラーニングに基づいた音声翻訳の開発が必須である。
本研究では、人間の言語習得プロセス、特にSpeech Chainメカニズムに基づき、多言語の言語習得のための新しいディープラーニングの教師なしおよび半教師あり学習メカニズムを提案する。以下の課題を構成して取り組む。
課題1:人間の言語処理および認知に関する文献調査および検証、課題2:リソースの少ない言語の音声およびテキストデータの収集、課題3:多言語Machine Speech Chainフレームワークの開発、「話しながら聞いて多言語を学ぶ」を実行する(オフライン半教師あり学習、課題4:多言語Machine Speech Chainフレームワークの改善、リアルタイム学習(オフラインとオンライン学習)を実行する、課題5:多言語Machine Speech Chainフレームワークの改善、自己Lifelong学習(オンライン学習)を実行する、課題6:多言語Machine Speech Chainフレームワーク内に機械翻訳を組み込む、課題7:音声翻訳のため、多言語Machine Speech Chainフレームワークを開発し、「話しながら聞いて翻訳する」を実行する(オフライン半教師あり学習とオンライン自己Lifelong学習)
令和6年度末までに、課題1-6を完了し、多言語Machine Speech Chainフレームワーク内への機械翻訳の統合を達成した。これまでに、招待(基調)講演6件、学術論文4編、査読付き国際会議論文13件、国内会議論文12件を発表した.

Current Status of Research Progress
Current Status of Research Progress

2: Research has progressed on the whole more than it was originally planned.

Reason

令和6年度末までに、計画どおり、課題1-6:多言語Machine Speech Chainフレームワークへの機械翻訳機能の統合を完了した。具体的には、Machine Speech Chainフレームワークに着想を得たSimultaneous Speech TranslationのためのContrastive Feedback Mechanismを提案し、トップ国際会議であるINTERSPEECHで発表した。
さらに、課題7:「話しながら聞いて翻訳する」を実現するためのフレームワークの高度化(オフライン半教師あり学習とオンライン自己Lifelong学習)に着手し、**Gradient Episodic Memoryを用いたMachine Speech Chainにおける継続学習(Continual Learning)**を提案し、O-COCOSDAで発表した。
しかし、異なる多言語にまたがる大量のデータをカバーするには、アノテーション付き音声データの不足により依然として困難が残る。そのため、未知の未翻訳言語に対応するためにビジュアルグラウンディングモデルを活用した手法をさらに強化し、IEEE Access誌に論文を発表した。
また、多言語システムの研究を支えるため、インドネシアの大学およびベトナムの研究機関との連携も継続しており、関連する研究成果を国際会議に投稿した。

Strategy for Future Research Activity

令和7年度では、以下の課題に取り組む。
課題4:多言語Machine Speech Chainフレームワークの改善とリアルタイム学習(オフラインおよびオンライン学習)の開発に関して、インクリメンタルMachine Speech Chainの実験を継続します。特に、Machine Speech Chainの性能を向上させ、対応可能な言語の範囲をさらに拡大することを目指する。
課題5:多言語Machine Speech Chainフレームワークの改善と自己Lifelong学習(オンライン学習)に関して、実験を継続する。
課題6:多言語Machine Speech Chainフレームワーク内に機械翻訳を統合する。
課題7:「話しながら聞いて翻訳する」を実現するために、音声翻訳に対応したMachine Speech Chainフレームワークを高度化します(オフライン半教師あり学習およびオンライン自己Lifelong学習)。
課題7に特に重点を置きつつ、課題4-6;7の強化も引き続き進めていく。

Report

(4 results)
  • 2024 Research-status Report
  • 2023 Annual Research Report
  • 2022 Annual Research Report
  • 2021 Annual Research Report
  • Research Products

    (121 results)

All 2025 2024 2023 2022 2021 Other

All Int'l Joint Research (5 results) Journal Article (42 results) (of which Int'l Joint Research: 7 results,  Peer Reviewed: 31 results,  Open Access: 21 results) Presentation (74 results) (of which Int'l Joint Research: 40 results,  Invited: 20 results)

  • [Int'l Joint Research] Bandung Institute of Technology/University of Indonesia(インドネシア)

    • Related Report
      2024 Research-status Report
  • [Int'l Joint Research] Institute of Information Technology(ベトナム)

    • Related Report
      2024 Research-status Report
  • [Int'l Joint Research] Bandung Institute of Technology/University of Indonesia(インドネシア)

    • Related Report
      2023 Annual Research Report
  • [Int'l Joint Research] Institute of Information Technology(ベトナム)

    • Related Report
      2023 Annual Research Report
  • [Int'l Joint Research] Bandung Institute of Technology/University of Indonesia(インドネシア)

    • Related Report
      2022 Annual Research Report
  • [Journal Article] Zero-Shot Cross-Lingual Text-to-Speech With Style-Enhanced Normalization and Auditory Feedback Training Mechanism2025

    • Author(s)
      Tran Chung、Luong Chi Mai、Sakti Sakriani
    • Journal Title

      IEEE Transactions on Audio, Speech and Language Processing

      Volume: 33 Pages: 1479-1492

    • DOI

      10.1109/taslpro.2025.3548429

    • Related Report
      2024 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] ZeST: A Zero-Resourced Speech-to-Speech Translation Approach for Unknown, Unpaired, and Untranscribed Languages2025

    • Author(s)
      Thanh Nguyen Luan、Sakti Sakriani
    • Journal Title

      IEEE Access

      Volume: 13 Pages: 8638-8648

    • DOI

      10.1109/access.2025.3527012

    • Related Report
      2024 Research-status Report
    • Peer Reviewed / Open Access
  • [Journal Article] Neural End-To-End Speech Translation Leveraged by ASR Posterior Distribution2024

    • Author(s)
      KO Yuka、SUDOH Katsuhito、SAKTI Sakriani、NAKAMURA Satoshi
    • Journal Title

      IEICE Transactions on Information and Systems

      Volume: E107.D Issue: 10 Pages: 1322-1331

    • DOI

      10.1587/transinf.2023EDP7249

    • ISSN
      0916-8532, 1745-1361
    • Year and Date
      2024-10-01
    • Related Report
      2024 Research-status Report
    • Peer Reviewed / Open Access
  • [Journal Article] Applying Syntax-Prosody Mapping Hypothesis and Boundary-Driven Theory to Neural Sequence-to-Sequence Speech Synthesis2024

    • Author(s)
      Furukawa Kei、Kishiyama Takeshi、Nakamura Satoshi、Sakti Sakriani
    • Journal Title

      IEEE Access

      Volume: 12 Pages: 160896-160917

    • DOI

      10.1109/access.2024.3487053

    • Related Report
      2024 Research-status Report
    • Peer Reviewed / Open Access
  • [Journal Article] NAIST Simultaneous Speech Translation System for IWSLT 20242024

    • Author(s)
      Ko Yuka、Fukuda Ryo、Nishikawa Yuta、Kano Yasumasa、Yanagita Tomoya、Doi Kosuke、Makinae Mana、Tan Haotian、Sakai Makoto、Sakti Sakriani、Sudoh Katsuhito、Nakamura Satoshi
    • Journal Title

      Proc. of IWSLT

      Volume: 1 Pages: 170-182

    • DOI

      10.18653/v1/2024.iwslt-1.23

    • Related Report
      2024 Research-status Report
  • [Journal Article] The NAIST System for the CHiME-8 NOTSOFAR-1 Task2024

    • Author(s)
      Hirano Yuta、Nguyen Mau、Azuma Kakeru、Saragih Jan Meyer、Sakti Sakriani
    • Journal Title

      Proc. of the 8th International Workshop on Speech Processing in Everyday Environments (CHiME 2024)

      Volume: 1 Pages: 59-63

    • DOI

      10.21437/chime.2024-13

    • Related Report
      2024 Research-status Report
  • [Journal Article] Contrastive Feedback Mechanism for Simultaneous Speech Translation2024

    • Author(s)
      Tan Haotian、Sakti Sakriani
    • Journal Title

      Proc. of INTERSPEECH

      Volume: 1 Pages: 852-856

    • DOI

      10.21437/interspeech.2024-2426

    • Related Report
      2024 Research-status Report
  • [Journal Article] Leveraging the Multilingual Indonesian Ethnic Languages Dataset In Self-Supervised Models for Low-Resource ASR Task2023

    • Author(s)
      Sakti Sakriani, Titalim Benita Angela
    • Journal Title

      Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)

      Volume: Vol. 1 Pages: 1314-1321

    • DOI

      10.1109/asru57964.2023.10389730

    • Related Report
      2023 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Speech Recognition and Meaning Interpretation: Towards Disambiguation of Structurally Ambiguous Spoken Utterances in Indonesian2023

    • Author(s)
      Widiaputri Ruhiyah, Purwarianti Ayu, Lestari Dessi, Azizah Kurniawati, Tanaya Dipta、Sakti Sakriani
    • Journal Title

      Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP)

      Volume: Vol. 1 Pages: 16813-16824

    • DOI

      10.18653/v1/2023.emnlp-main.1045

    • Related Report
      2023 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Generating Speech with Prosodic Prominence based on SSL-Visually Grounded Models2023

    • Author(s)
      Ika Hartanti Bella Septina, Tanaya Dipta, Azizah Kurniawati, Lestari Dessi Puji、Purwarianti Ayu、Sakti Sakriani
    • Journal Title

      Proceeding of the Conference of the Oriental COCOSDA

      Volume: Vol. 1 Pages: 1-6

    • DOI

      10.1109/o-cocosda60357.2023.10482965

    • Related Report
      2023 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Exploring Difficulties Encountered by Professional Interpreters in Japanese-to-English and English-to-Japanese Simultaneous Translation2023

    • Author(s)
      Xi Hang, Sakti Sakriani
    • Journal Title

      Proceeding of the Conference of the Oriental COCOSDA

      Volume: Vol. 1 Pages: 1-6

    • DOI

      10.1109/o-cocosda60357.2023.10482968

    • Related Report
      2023 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] STEN-TTS: Improving Zero-shot Cross-Lingual Transfer for Multi-Lingual TTS with Style-Enhanced Normalization Diffusion Framework2023

    • Author(s)
      Tran Chung, Luong Chi Mai, Sakti Sakriani
    • Journal Title

      Proceedings of the INTERSPEECH

      Volume: Vol. 1 Pages: 4464-4468

    • DOI

      10.21437/interspeech.2023-2243

    • Related Report
      2023 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Unsupervised Learning of Discrete Latent Representations with Data-Adaptive Dimensionality from Continuous Speech Streams2023

    • Author(s)
      Takahashi Shun, Sakti Sakriani
    • Journal Title

      Proceedings of the INTERSPEECH

      Volume: Vol. 1 Pages: 416-420

    • DOI

      10.21437/interspeech.2023-1321

    • Related Report
      2023 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Low-Resource Japanese-English Speech-to-Text Translation Leveraging Speech-Text Unified-model Representation Learning2023

    • Author(s)
      Tran Tu Dinh, Sakti Sakriani
    • Journal Title

      Proceedings of the INTERSPEECH Satellite Workshop - the ELRA/ISCA Special Interest Group on Under-resourced Languages (SIGUL)

      Volume: Vol. 1 Pages: 78-82

    • DOI

      10.21437/sigul.2023-17

    • Related Report
      2023 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] VGSAlign: Bilingual Speech Alignment of Unpaired and Untranscribed Languages using Self-Supervised Visually Grounded Speech Models2023

    • Author(s)
      Nguyen Luan Thanh, Sakti Sakriani
    • Journal Title

      Proceedings of the INTERSPEECH Satellite Workshop - the ELRA/ISCA Special Interest Group on Under-resourced Languages (SIGUL)

      Volume: Vol. 1 Pages: 53-57

    • DOI

      10.21437/sigul.2023-12

    • Related Report
      2023 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] An Isotropy Analysis for Self-Supervised Acoustic Unit Embeddings on the Zero Resource Speech Challenge 2021 Framework2023

    • Author(s)
      Chen Jianan, Sakti Sakriani
    • Journal Title

      Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

      Volume: Vol. 1 Pages: 1-5

    • DOI

      10.1109/icassp49357.2023.10095119

    • Related Report
      2023 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Self-Adaptive Incremental Machine Speech Chain for Lombard TTS with High-Granularity ASR Feedback in Dynamic Noise Condition2023

    • Author(s)
      Novitasari Sashi、Sakti Sakriani、Nakamura Satoshi
    • Journal Title

      Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

      Volume: Vol. 1 Pages: 1-5

    • DOI

      10.1109/icassp49357.2023.10096128

    • Related Report
      2023 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Investigation of Cross-Lingual Mismatch in Low-resource ASR for Indonesian Ethnic Languages2023

    • Author(s)
      Sakti Sakriani, Titalim Benita Angela
    • Journal Title

      Proceedings of the ASJ Spring Meeting

      Volume: Vol. 1 Pages: 761-762

    • Related Report
      2023 Annual Research Report
  • [Journal Article] Maintaining Personal Styles in Multilingual TTS with STEN Approach in Diffusion Framework2023

    • Author(s)
      Tran Chung, Luong Chi Mai, Sakti Sakriani
    • Journal Title

      Proceedings of the ASJ Spring Meeting

      Volume: Vol. 1 Pages: 775-776

    • Related Report
      2023 Annual Research Report
  • [Journal Article] Non-Parallel Limited Data Emotion Voice Conversion with Variance Adapter and Non-Autoregressive Decoder2023

    • Author(s)
      Zhang Zhanhang, Sakti Sakriani
    • Journal Title

      Proceedings of the ASJ Spring Meeting

      Volume: Vol. 1 Pages: 1013-1014

    • Related Report
      2023 Annual Research Report
  • [Journal Article] Deep Sequential Generative Modeling for Unsupervised Learning of Linguistic Representations from Speech Streams2023

    • Author(s)
      Takahashi Shun, Sakti Sakriani
    • Journal Title

      Proceedings of the ASJ Spring Meeting

      Volume: Vol. 1 Pages: 825-826

    • Related Report
      2023 Annual Research Report
  • [Journal Article] Perceived Challenges in Simultaneous Japanese-English Translation2023

    • Author(s)
      Xi Hang, Sakti Sakriani
    • Journal Title

      Proceedings of the ASJ Spring Meeting

      Volume: Vol. 1 Pages: 827-828

    • Related Report
      2023 Annual Research Report
  • [Journal Article] Utilizing Self-Supervised Visually Grounded Speech Models for Aligning Unpaired and Untranscribed Bilingual Speech2023

    • Author(s)
      Nguyen Luan Thanh, Sakti Sakriani
    • Journal Title

      Proceedings of the ASJ Spring Meeting

      Volume: Vol. 1 Pages: 829-830

    • Related Report
      2023 Annual Research Report
  • [Journal Article] Generating Textual Prosody based on ASR2023

    • Author(s)
      Liu Mingxi, Sakti Sakriani
    • Journal Title

      Proceedings of the ASJ Spring Meeting

      Volume: Vol. 1 Pages: 831-832

    • Related Report
      2023 Annual Research Report
  • [Journal Article] Japanese Neural Incremental Text-to-Speech Synthesis Framework With an Accent Phrase Input2023

    • Author(s)
      Yanagita Tomoya、Sakti Sakriani、Nakamura Satoshi
    • Journal Title

      IEEE Access

      Volume: 11 Pages: 22355-22363

    • DOI

      10.1109/access.2023.3251657

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] NIX-TTS: Lightweight and End-to-End Text-to-Speech Via Module-Wise Distillation2023

    • Author(s)
      Chevi Rendi、Prasojo Radityo Eko、Aji Alham Fikri、Tjandra Andros、Sakti Sakriani
    • Journal Title

      Proceeding of the IEEE Spoken Language Technology Workshop (SLT) 2023

      Volume: 1 Pages: 970-976

    • DOI

      10.1109/slt54892.2023.10023322

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Language technology impact on linguistic diversity2023

    • Author(s)
      Sakti Sakriani
    • Journal Title

      In Book: "State of the art of indigenous languages in research: a collection of selected research papers," UNESCO Open Access Repository

      Volume: 1 Pages: 341-348

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Cyclic Partially-aligned Transformer for Visually Connected Speech-to-text Mapping2023

    • Author(s)
      Johanes Effendi、Sakti Sakriani、Nakamura Satoshi
    • Journal Title

      Proceeding of the Acoustical Society of Japan (ASJ)

      Volume: 1 Pages: 1-2

    • Related Report
      2022 Annual Research Report
  • [Journal Article] Synthesis Unit for Japanese Incremental Text-to-Speech2022

    • Author(s)
      柳田 智也、サクテイ サクリアニ、中村 哲
    • Journal Title

      情報処理学会論文誌

      Volume: 63 Issue: 4 Pages: 1149-1158

    • DOI

      10.20729/00217617

    • Year and Date
      2022-04-15
    • Related Report
      2022 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A Machine Speech Chain Approach for Dynamically Adaptive Lombard TTS in Static and Dynamic Noise Environments2022

    • Author(s)
      Novitasari Sashi、Sakti Sakriani、Nakamura Satoshi
    • Journal Title

      IEEE/ACM Transactions on Audio, Speech, and Language Processing

      Volume: 30 Pages: 2673-2688

    • DOI

      10.1109/taslp.2022.3196879

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Tackling multiple object tracking with complicated motions ? Re-designing the integration of motion and appearance2022

    • Author(s)
      Yang Fan、Wang Zheng、Wu Yang、Sakti Sakriani、Nakamura Satoshi
    • Journal Title

      Image and Vision Computing

      Volume: 124 Pages: 104514-104514

    • DOI

      10.1016/j.imavis.2022.104514

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing2022

    • Author(s)
      Qi Heli、Novitasari Sashi、Sakti Sakriani、Nakamura Satoshi
    • Journal Title

      Proceeding of the INTERSPEECH 2022

      Volume: 1 Pages: 3413-3417

    • DOI

      10.21437/interspeech.2022-11169

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed
  • [Journal Article] NAIST Simultaneous Speech-to-Text Translation System for IWSLT 20222022

    • Author(s)
      Fukuda Ryo、Ko Yuka、Kano Yasumasa、Doi Kosuke、Tokuyama Hirotaka、Sakti Sakriani、Sudoh Katsuhito、Nakamura Satoshi
    • Journal Title

      Proceeding of the International Conference on Spoken Language Translation (IWSLT)

      Volume: 1 Pages: 286-292

    • DOI

      10.18653/v1/2022.iwslt-1.25

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Modeling Unsupervised Empirical Adaptation by DPGMM and DPGMM-RNN Hybrid Model to Extract Perceptual Features for Low-resource ASR2022

    • Author(s)
      Bin Wu, Sakriani Sakti, Jinsong Zhang, and Satoshi Nakamura
    • Journal Title

      IEEE/ACM Transactions on Audio, Speech, and Language Processing

      Volume: Vol. 30 Pages: 901-916

    • DOI

      10.1109/taslp.2022.3150220

    • Related Report
      2021 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Neural Incremental Speech Recognition Toward Real-Time Machine Speech Translation2021

    • Author(s)
      Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura,
    • Journal Title

      IEICE Transactions on Information and Systems

      Volume: E104.D Issue: 12 Pages: 2195-2208

    • DOI

      10.1587/transinf.2021EDP7014

    • NAID

      130008123347

    • ISSN
      0916-8532, 1745-1361
    • Year and Date
      2021-12-01
    • Related Report
      2021 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Code-Switching ASR and TTS Using Semisupervised Learning with Machine Speech Chain2021

    • Author(s)
      Sahoko Nakayama, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      IEICE Transactions on Information and Systems

      Volume: E104.D Issue: 10 Pages: 1661-1677

    • DOI

      10.1587/transinf.2021EDP7005

    • NAID

      130008095601

    • ISSN
      0916-8532, 1745-1361
    • Year and Date
      2021-10-01
    • Related Report
      2021 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Multimodal Chain: Cross-Modal Collaboration Through Listening, Speaking, and Visualizing2021

    • Author(s)
      Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      IEEE Access

      Volume: 9 Pages: 70286-70299

    • DOI

      10.1109/access.2021.3077886

    • Related Report
      2021 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Using Local Phrase Dependency Structure Information in Neural Sequence-to-Sequence Speech Synthesis2021

    • Author(s)
      Nobuyoshi Kaiki, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      Proceeding of the Oriental COCOSDA 2021

      Volume: 1 Pages: 206-211

    • DOI

      10.1109/o-cocosda202152914.2021.9660456

    • Related Report
      2021 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Transcribing Paralinguistic Acoustic Cues to Target Language Text in Transformer-Based Speech-to-Text Translation2021

    • Author(s)
      Hirotaka Tokuyama, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura
    • Journal Title

      Proceeding of the INTERSPEECH 2021

      Volume: 1 Pages: 2262-2266

    • DOI

      10.21437/interspeech.2021-1020

    • Related Report
      2021 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Weakly-Supervised Speech-to-Text Mapping with Visually Connected Non-Parallel Speech-Text Data Using Cyclic Partially-Aligned Transformer2021

    • Author(s)
      Johanes Effendi, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      Proceeding of the INTERSPEECH 2021

      Volume: 1 Pages: 2257-2261

    • DOI

      10.21437/interspeech.2021-970

    • Related Report
      2021 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Unsupervised Neural-Based Graph Clustering for Variable-Length Speech Representation Discovery of Zero-Resource Languages2021

    • Author(s)
      Shun Takahashi, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      Proceeding of the INTERSPEECH 2021

      Volume: 1 Pages: 1559-1563

    • DOI

      10.21437/interspeech.2021-1340

    • Related Report
      2021 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Dynamically Adaptive Machine Speech Chain Inference for TTS in Noisy Environment: Listen and Speak Louder2021

    • Author(s)
      Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura
    • Journal Title

      Proceeding of the INTERSPEECH 2021

      Volume: 1 Pages: 4124-4128

    • DOI

      10.21437/interspeech.2021-946

    • Related Report
      2021 Annual Research Report
    • Peer Reviewed / Open Access
  • [Presentation] Language Technologies for All Mother Languages: Opportunities and Challenges2025

    • Author(s)
      Sakriani Sakti
    • Organizer
      The International Mother Language Day (IMLD)
    • Related Report
      2024 Research-status Report
    • Invited
  • [Presentation] Machine Speech Chain: Modeling Human Speech Perception and Production with Auditory Feedback Mechanism2025

    • Author(s)
      Sakriani Sakti
    • Organizer
      The NUS Computer Science Research Week
    • Related Report
      2024 Research-status Report
    • Invited
  • [Presentation] 日英コードスイッチングが社会的なヒューマンロボットインタラクションに及ぼす影響2025

    • Author(s)
      中村 佳登, メフムード ファイサル, サクティ サクリアニ
    • Organizer
      SIG-SLUD
    • Related Report
      2024 Research-status Report
  • [Presentation] 音声翻訳フレームワークによる吃音音声の自動音声認識に対する課題への取り組み2025

    • Author(s)
      久保田 なつみ, サクティ サクリアニ
    • Organizer
      SIG-SLUD
    • Related Report
      2024 Research-status Report
  • [Presentation] 音声信号から文字記号を創り出す―深層ベイズに基づく教師なし表現学習によるアプローチ―2025

    • Author(s)
      髙橋 舜, 金崎 朝子, 須田 仁志, サクティ サクリアニ
    • Organizer
      NLP
    • Related Report
      2024 Research-status Report
  • [Presentation] 音声認識出力の曖昧性を考慮したMulti-task End-to-end音声翻訳と曖昧性の高い音声入力に対する頑健性の分析2025

    • Author(s)
      胡 尤佳, 須藤 克仁, 中村 哲, サクティ サクリアニ
    • Organizer
      NLP
    • Related Report
      2024 Research-status Report
  • [Presentation] Disambiguating Ambiguous Indonesian Utterances with ASR and Meaning Interpretation2025

    • Author(s)
      R. F. Widiaputri, A. Purwarianti, D. P. Lestari, K. Azizah, D. Tanaya, S. Sakti
    • Organizer
      ASJ Spring Meeting
    • Related Report
      2024 Research-status Report
    • Int'l Joint Research
  • [Presentation] Multitask Training of Multi-channel Speaker Separation and Room Acoustic Parameter Estimation2025

    • Author(s)
      R. Hartanto, S. Sakti, K. Shinoda
    • Organizer
      ASJ Spring Meeting
    • Related Report
      2024 Research-status Report
  • [Presentation] 中間 CTC 目標を活用した多言語 ASR におけるコードスイッチングの向上2025

    • Author(s)
      東 翔, サクティ サクリアニ
    • Organizer
      ASJ Spring Meeting
    • Related Report
      2024 Research-status Report
  • [Presentation] Flow Matchingによる周波数領域でのフローマッチングを用いた高速ニューラルボコーダー2025

    • Author(s)
      W. Yingjie, S. Sakti
    • Organizer
      ASJ Spring Meeting
    • Related Report
      2024 Research-status Report
  • [Presentation] Improving Simultaneous Speech Translation with a Contrastive Feedback Mechanism2025

    • Author(s)
      T. Haotian, S. Sakti
    • Organizer
      ASJ Spring Meeting
    • Related Report
      2024 Research-status Report
  • [Presentation] The NAIST System for the CHiME-8 Distant Meeting Transcription Challenge2025

    • Author(s)
      Y. Hirano, M. Nguyen, K. Azuma, J. M. Saragih, S. Sakti
    • Organizer
      ASJ Spring Meeting
    • Related Report
      2024 Research-status Report
  • [Presentation] 音声認識誤りが ChatGPT の翻訳に与える影響の調査2025

    • Author(s)
      安藤 宏祐, 平野 雄太, 佐藤 颯空, サクティ サクリアニ
    • Organizer
      ASJ Spring Meeting
    • Related Report
      2024 Research-status Report
  • [Presentation] 拡散モデルベース DNN 音声合成のバックボーンに着目した軽量化とカーネル形状変化の影響2025

    • Author(s)
      佐藤 颯空, サクティ サクリアニ
    • Organizer
      ASJ Spring Meeting
    • Related Report
      2024 Research-status Report
  • [Presentation] Enhancing Indonesian Automatic Speech Recognition: Evaluating Multilingual Models with Diverse Speech Variabilities2024

    • Author(s)
      Aulia Adila, Dessi Lestari, Ayu Purwarianti, Dipta Tanaya, Kurniawati Azizah, Sakriani Sakti
    • Organizer
      O-COCOSDA
    • Related Report
      2024 Research-status Report
    • Int'l Joint Research
  • [Presentation] Indonesian-English Code-Switching Speech Synthesizer Utilizing Multilingual STEN-TTS and Bert LID2024

    • Author(s)
      Ahmad Alfani Handoyo, Chung Tran, Dessi Puji Lestari, Sakriani Sakti
    • Organizer
      O-COCOSDA
    • Related Report
      2024 Research-status Report
    • Int'l Joint Research
  • [Presentation] Continual Learning in Machine Speech Chain Using Gradient Episodic Memory2024

    • Author(s)
      Geoffrey Tyndall, Kurniawati Arizah, Dipta Tanaya, Ayu Purwarianti, Dessi Puji Lestari, Sakriani Sakti
    • Organizer
      O-COCOSDA
    • Related Report
      2024 Research-status Report
    • Int'l Joint Research
  • [Presentation] An Evaluation of Neural Vocoder-Based Voice Cloning System for Dysphonia Speech Disorder2024

    • Author(s)
      Dhiya Dewangga, Dessi Lestari, Ayu Purwarianti, Dipta Tanaya, Kurniawati Azizah, Sakriani Sakti
    • Organizer
      O-COCOSDA
    • Related Report
      2024 Research-status Report
    • Int'l Joint Research
  • [Presentation] Chunk Size Scheduling for Optimizing the Quality-Latency Trade-off in Simultaneous Speech Translation2024

    • Author(s)
      Iqbal Pahlevi Amin, Haotian Tan, Kurniawati Azizah, Sakriani Sakti
    • Organizer
      O-COCOSDA
    • Related Report
      2024 Research-status Report
    • Int'l Joint Research
  • [Presentation] A Feedback-Driven Self-Improvement Strategy and Emotion-Aware Vocoder for Emotional Voice Conversion2024

    • Author(s)
      Zhanhang Zhang, Sakriani Sakti
    • Organizer
      O-COCOSDA
    • Related Report
      2024 Research-status Report
    • Int'l Joint Research
  • [Presentation] Contrastive Feedback Mechanism for Simultaneous Speech Translation2024

    • Author(s)
      Haotian Tan, Sakriani Sakti
    • Organizer
      INTERSPEECH
    • Related Report
      2024 Research-status Report
  • [Presentation] The NAIST System for the CHiME-8 NOTSOFAR-1 Task2024

    • Author(s)
      Yuta Hirano, Mau Nguyen, Kakeru Azuma, Jan Meyer Saragih, Sakriani Sakti
    • Organizer
      The 8th International Workshop on Speech Processing in Everyday Environments (CHiME 2024)
    • Related Report
      2024 Research-status Report
  • [Presentation] Machine Speech Chain with Emotion Recognition2024

    • Author(s)
      Akeyla Pradia Naufal, Dessi Puji Lestari, Ayu Purwarianti, Kurniawati Azizah, Dipta Tanaya, Sakriani Sakti
    • Organizer
      The 11th International Conference on Advanced Informatics: Concept, Theory and Application (ICAICTA)
    • Related Report
      2024 Research-status Report
    • Int'l Joint Research
  • [Presentation] NAIST Simultaneous Speech Translation System for IWSLT 20242024

    • Author(s)
      Yuka Ko, Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Tomoya Yanagita, Kosuke Doi, Mana Makinae, Haotian Tan, Makoto Sakai, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura
    • Organizer
      IWSLT
    • Related Report
      2024 Research-status Report
  • [Presentation] Refining rtMRI Landmark-Based Vocal Tract Contour Labels with FCN-Based Smoothing and Point-to-Curve Projection2024

    • Author(s)
      Mushaffa Rasyid Ridha, Sakriani Sakti
    • Organizer
      LREC-COLING
    • Related Report
      2024 Research-status Report
  • [Presentation] Indonesian-English Code-Switching Speech Recognition using the Machine Speech Chain based Semi-Supervised Learning2024

    • Author(s)
      Rais Vaza Man Tazakka,, Dessi Lestari, Ayu Purwarianti, Dipta Tanaya, Kurniawati Azizah, Sakriani Sakti
    • Organizer
      SIGUL
    • Related Report
      2024 Research-status Report
    • Int'l Joint Research
  • [Presentation] Multilingual Self-Supervised Visually Grounded Speech Models2024

    • Author(s)
      Huynh Phuong Thanh Nguyen, Sakriani Sakti
    • Organizer
      SIGUL
    • Related Report
      2024 Research-status Report
  • [Presentation] Machine Speech Chain: From Human Auditory Feedback Principles to Language Technology Empowering Indigenous Communities2024

    • Author(s)
      Sakriani Sakti
    • Organizer
      The 21st International Conference on Natural Language Processing (ICON)
    • Related Report
      2024 Research-status Report
    • Invited
  • [Presentation] Leveraging the Foundational Speech Chain Models to Empower Low-Resource Languages2024

    • Author(s)
      Sakriani Sakti
    • Organizer
      The IX International Scientific Conference 溺odern Problems of Applied Mathematics and Information Technologies Al-Khwarizmi
    • Related Report
      2024 Research-status Report
    • Invited
  • [Presentation] Machine Speech Chain: A Deep Learning Approach for Modeling Human Speech Perception and Production with Auditory Feedback Mechanism for Low-Resource Languages2024

    • Author(s)
      Sakriani Sakti
    • Organizer
      The 11th International Conference on Computer, Control, Informatics and its Applications (IC3INA)
    • Related Report
      2024 Research-status Report
    • Invited
  • [Presentation] Language Technology for All: Leveraging Foundational Speech Models to Empower Low-Resource Languages2024

    • Author(s)
      Sakriani Sakti
    • Organizer
      The INTERSPEECH Workshop on Synthetic Data Transformative Role in Foundational Speech Models (SynData4GenA)
    • Related Report
      2024 Research-status Report
    • Invited
  • [Presentation] Communicative Intelligent Systems towards Society 5.02023

    • Author(s)
      Sakti Sakriani
    • Organizer
      Sarasehan Nasional Pendidikan Tinggi Informatika dan Pemberian Tribute kepada Penggagas dan Pendidik Senior Teknik Informatika ITB
    • Related Report
      2023 Annual Research Report 2022 Annual Research Report
    • Invited
  • [Presentation] Language Technology for All: From the indigenous community perspectives2023

    • Author(s)
      Sakti Sakriani
    • Organizer
      Data, Technologies and Benchmarks for the Spoken Languages of the World" Meeting, IEEE SLT
    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Language Technology for All: From the technology and indigenous community perspectives2023

    • Author(s)
      Sakti Sakriani
    • Organizer
      the 25th Conference of the Oriental COCOSDA
    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Leveraging the Multilingual Indonesian Ethnic Languages Dataset In Self-Supervised Models for Low-Resource ASR Task2023

    • Author(s)
      Titalim Benita Angela
    • Organizer
      IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Speech Recognition and Meaning Interpretation: Towards Disambiguation of Structurally Ambiguous Spoken Utterances in Indonesian2023

    • Author(s)
      Widiaputri Ruhiyah
    • Organizer
      the Conference on Empirical Methods in Natural Language Processing (EMNLP)
    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Generating Speech with Prosodic Prominence based on SSL-Visually Grounded Models2023

    • Author(s)
      Ika Hartanti Bella Septina、Sakti Sakriani
    • Organizer
      the Oriental COCOSDA
    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Exploring Difficulties Encountered by Professional Interpreters in Japanese-to-English and English-to-Japanese Simultaneous Translation2023

    • Author(s)
      Xi Hang、Sakti Sakriani
    • Organizer
      the Oriental COCOSDA
    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research
  • [Presentation] STEN-TTS: Improving Zero-shot Cross-Lingual Transfer for Multi-Lingual TTS with Style-Enhanced Normalization Diffusion Framework2023

    • Author(s)
      Tran Chung, Sakti Sakriani
    • Organizer
      INTERSPEECH
    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Unsupervised Learning of Discrete Latent Representations with Data-Adaptive Dimensionality from Continuous Speech Streams2023

    • Author(s)
      Takahashi Shun、Sakti Sakriani
    • Organizer
      INTERSPEECH
    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Low-Resource Japanese-English Speech-to-Text Translation Leveraging Speech-Text Unified-model Representation Learning2023

    • Author(s)
      Tran Tu Dinh、Sakti Sakriani
    • Organizer
      the INTERSPEECH Satellite Workshop - the ELRA/ISCA Special Interest Group on Under-resourced Languages (SIGUL)
    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research
  • [Presentation] VGSAlign: Bilingual Speech Alignment of Unpaired and Untranscribed Languages using Self-Supervised Visually Grounded Speech Models2023

    • Author(s)
      Nguyen Luan Thanh、Sakti Sakriani
    • Organizer
      the INTERSPEECH Satellite Workshop - the ELRA/ISCA Special Interest Group on Under-resourced Languages (SIGUL)
    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research
  • [Presentation] An Isotropy Analysis for Self-Supervised Acoustic Unit Embeddings on the Zero Resource Speech Challenge 2021 Framework2023

    • Author(s)
      Chen Jianan、Sakti Sakriani
    • Organizer
      the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Self-Adaptive Incremental Machine Speech Chain for Lombard TTS with High-Granularity ASR Feedback in Dynamic Noise Condition2023

    • Author(s)
      Novitasari Sashi、Sakti Sakriani、Nakamura Satoshi
    • Organizer
      the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Investigation of Cross-Lingual Mismatch in Low-resource ASR for Indonesian Ethnic Languages2023

    • Author(s)
      Benita Angela Titalim
    • Organizer
      the ASJ Spring Meeting
    • Related Report
      2023 Annual Research Report
  • [Presentation] Maintaining Personal Styles in Multilingual TTS with STEN Approach in Diffusion Framework2023

    • Author(s)
      Tran Chung
    • Organizer
      the ASJ Spring Meeting
    • Related Report
      2023 Annual Research Report
  • [Presentation] Non-Parallel Limited Data Emotion Voice Conversion with Variance Adapter and Non-Autoregressive Decoder2023

    • Author(s)
      Zhang Zhanhang
    • Organizer
      the ASJ Spring Meeting
    • Related Report
      2023 Annual Research Report
  • [Presentation] Deep Sequential Generative Modeling for Unsupervised Learning of Linguistic Representations from Speech Streams2023

    • Author(s)
      Takahashi Shun
    • Organizer
      the ASJ Spring Meeting
    • Related Report
      2023 Annual Research Report
  • [Presentation] Perceived Challenges in Simultaneous Japanese-English Translation2023

    • Author(s)
      Xi Hang
    • Organizer
      the ASJ Spring Meeting
    • Related Report
      2023 Annual Research Report
  • [Presentation] Utilizing Self-Supervised Visually Grounded Speech Models for Aligning Unpaired and Untranscribed Bilingual Speech2023

    • Author(s)
      Sakti Sakriani
    • Organizer
      the ASJ Spring Meeting
    • Related Report
      2023 Annual Research Report
  • [Presentation] Generating Textual Prosody based on ASR2023

    • Author(s)
      Liu Mingxi
    • Organizer
      the ASJ Spring Meeting
    • Related Report
      2023 Annual Research Report
  • [Presentation] Language Technology for All: From the indigenous community perspectives2023

    • Author(s)
      Sakti Sakriani
    • Organizer
      "Data, Technologies and Benchmarks for the Spoken Languages of the World" Meeting of IEEE SLT
    • Related Report
      2022 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] NIX-TTS: Lightweight and End-to-End Text-to-Speech Via Module-Wise Distillation2023

    • Author(s)
      Chevi Rendi、Prasojo Radityo Eko、Aji Alham Fikri、Tjandra Andros、Sakti Sakriani
    • Organizer
      IEEE Spoken Language Technology Workshop (SLT) 2023
    • Related Report
      2022 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Cyclic Partially-aligned Transformer for Visually Connected Speech-to-text Mapping2023

    • Author(s)
      Effendi Johanes、Sakti Sakriani、Nakamura Satoshi
    • Organizer
      Acoustical Society of Japan (ASJ)
    • Related Report
      2022 Annual Research Report
  • [Presentation] Language Technology for All: From the technology and indigenous community perspectives2022

    • Author(s)
      Sakti Sakriani
    • Organizer
      the 25th Conference of the Oriental COCOSDA
    • Related Report
      2022 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Semi-supervised Learning for Low-resource Multilingual and Multimodal Speech Processing with Machine Speech Chain2022

    • Author(s)
      Sakti Sakriani
    • Organizer
      "Data Collection, Bias, and Ethical Concerns in Speech Processing," Speech for Social Good - INTERSPEECH Satellite Workshop
    • Related Report
      2022 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Semi-supervised Learning for Low-resource Multilingual and Multimodal Speech Processing with Machine Speech Chain2022

    • Author(s)
      Sakti Sakriani
    • Organizer
      HiTZ Language Technology Webinar
    • Related Report
      2022 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Data Collection, Bias, and Ethical Concerns in Speech Processing2022

    • Author(s)
      Sakti Sakriani
    • Organizer
      Speech for Social Good - INTERSPEECH Satellite Workshop
    • Related Report
      2022 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing2022

    • Author(s)
      Qi Heli、Novitasari Sashi、Sakti Sakriani、Nakamura Satoshi
    • Organizer
      INTERSPEECH 2022
    • Related Report
      2022 Annual Research Report
    • Int'l Joint Research
  • [Presentation] NAIST Simultaneous Speech-to-Text Translation System for IWSLT 20222022

    • Author(s)
      Fukuda Ryo、Ko Yuka、Kano Yasumasa、Doi Kosuke、Tokuyama Hirotaka、Sakti Sakriani、Sudoh Katsuhito、Nakamura Satoshi
    • Organizer
      International Conference on Spoken Language Translation (IWSLT)
    • Related Report
      2022 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Self-Adaptive Machine Speech Chain in Noisy Environment2022

    • Author(s)
      Sakriani Sakti
    • Organizer
      the AAAI workshop on Self-supervised Learning for Audio and Speech Processing
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Using Local Phrase Dependency Structure Information in Neural Sequence-to-Sequence Speech Synthesis2021

    • Author(s)
      Nobuyoshi Kaiki, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      Oriental COCOSDA 2021
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Transcribing Paralinguistic Acoustic Cues to Target Language Text in Transformer-Based Speech-to-Text Translation2021

    • Author(s)
      Hirotaka Tokuyama, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura
    • Organizer
      INTERSPEECH 2021
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Weakly-Supervised Speech-to-Text Mapping with Visually Connected Non-Parallel Speech-Text Data Using Cyclic Partially-Aligned Transformer2021

    • Author(s)
      Johanes Effendi, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      INTERSPEECH 2021
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Unsupervised Neural-Based Graph Clustering for Variable-Length Speech Representation Discovery of Zero-Resource Languages2021

    • Author(s)
      Shun Takahashi, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      INTERSPEECH 2021
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Dynamically Adaptive Machine Speech Chain Inference for TTS in Noisy Environment: Listen and Speak Louder2021

    • Author(s)
      Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      INTERSPEECH 2021
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Improving Intelligibility of Synthesized Speech in Noisy Condition with Dynamically Adaptive Machine Speech Chain2021

    • Author(s)
      Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura
    • Organizer
      SIG-SLP 2021
    • Related Report
      2021 Annual Research Report
  • [Presentation] ゼロ資源状況におけるサブワード単位の獲得にむけて グラフニューラルネットワークを用いた手法2021

    • Author(s)
      高橋 舜, サクティ サクリアニ, 中村 哲
    • Organizer
      2021年度 人工知能学会全国大会 (第35回)
    • Related Report
      2021 Annual Research Report
  • [Presentation] 局所的な句構造の情報を用いた ニューラル音声合成2021

    • Author(s)
      海木 延佳, サクティ サクリアニ, 中村 哲
    • Organizer
      音学シンポジウム2021
    • Related Report
      2021 Annual Research Report
  • [Presentation] Machine Speech Chain: A Deep Learning Approach for Training and Inference through Feedback Loop2021

    • Author(s)
      Sakriani Sakti
    • Organizer
      IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Machine Speech Chain: A Deep Learning Approach for Modeling Human Speech Perception and Production with Auditory Feedback Mechanism2021

    • Author(s)
      Sakriani Sakti
    • Organizer
      the ITB Seminar
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Listening while Speaking and Visualizing: A Semi-supervised Approach with Multimodal Machine Speech Chain2021

    • Author(s)
      Sakriani Sakti
    • Organizer
      the SoCS International Seminar
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Listening while Speaking and Visualizing: A Semi-supervised Approach with Multimodal Machine Speech Chain2021

    • Author(s)
      Sakriani Sakti
    • Organizer
      International Conference of Artificial Intelligence and Speech Technology (AIST-3)
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Listening while Speaking and Visualizing: A Semi-supervised Approach with Multimodal Machine Speech Chain2021

    • Author(s)
      Sakriani Sakti
    • Organizer
      YRRSDS
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research / Invited

URL: 

Published: 2021-04-28   Modified: 2025-12-26  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi