• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Image to Sound Mapping Method Using Spectral Phase on Long-Term Fourier Transform

Research Project

Project/Area Number 17K00234
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeMulti-year Fund
Section一般
Research Field Perceptual information processing
Research InstitutionKyoto Sangyo University (2018-2020)
Osaka University (2017)

Principal Investigator

KAWAMURA Arata  京都産業大学, 情報理工学部, 教授 (60362646)

Project Period (FY) 2017-04-01 – 2021-03-31
Project Status Completed (Fiscal Year 2020)
Budget Amount *help
¥3,510,000 (Direct Cost: ¥2,700,000、Indirect Cost: ¥810,000)
Fiscal Year 2019: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000)
Fiscal Year 2018: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000)
Fiscal Year 2017: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000)
Keywords画像の音変換 / スペクトログラム / 位相スペクトル / 長時間フーリエ変換 / 反復位相復元 / 情報システム / 画像 / 音声等認識 / 情報通信工学
Outline of Final Research Achievements

In this study, we proposed an image to sound mapping method. This technique treats an image as a spectrogram and maps it to a sound by taking inverse Fourier transform of the spectrogram. The embedded image destroyed speech spectral amplitude. We compensate the speech quality by using a speech spectral phase obtained by taking LTFT (Long-Term Fourier Transform). The speech spectral phase on LTFT contains speech intelligibility. The proposed method synthesis a speech signal with spectrogram consisting of an original image and speech spectral phase on LTFT. The synthesis speech signal is transmitted from a loudspeaker, and received at a microphone equipped on a mobile device. The received speech signal is transformed to a spectrogram which directly displays the transmitted image. The proposed method does not require any special transformation technique excepted of Fourier Transform.

Academic Significance and Societal Importance of the Research Achievements

本研究では,画像を埋め込んだ合成音声をスピーカ等から放射し,受信側で音声から画像を復元する.この技術が完成すれば,音声から得られる言葉の情報とともに,画像情報も同時に伝達できる.また,WiFi環境が整備されていない場所でも受信が可能となり,受信可能範囲も,スピーカの音量調整により制御可能となる.応用例は多岐にわたり,防災用スピーカからの緊急放送に避難経路や災害現場の写真を埋め込む,ラジオの天気予報に天気図を埋め込む,絵本の読み聞かせに該当ページの絵を埋め込む,タイムセール放送に商品や売り場の地図を埋め込む,海外のバスや電車の音声アナウンスに翻訳情報を埋め込む,などが考えられる.

Report

(5 results)
  • 2020 Annual Research Report   Final Research Report ( PDF )
  • 2019 Research-status Report
  • 2018 Research-status Report
  • 2017 Research-status Report
  • Research Products

    (5 results)

All 2020 2018 2017

All Journal Article (3 results) (of which Int'l Joint Research: 2 results,  Peer Reviewed: 3 results) Presentation (2 results) (of which Int'l Joint Research: 1 results)

  • [Journal Article] An Efficient Image to Sound Mapping Method Preserving Speech Spectral Envelope2020

    • Author(s)
      HOSODA Yuya、KAWAMURA Arata、IIGUNI Youji
    • Journal Title

      IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences

      Volume: E103.A Issue: 3 Pages: 629-630

    • DOI

      10.1587/transfun.2019EAL2139

    • NAID

      130007803998

    • ISSN
      0916-8508, 1745-1337
    • Year and Date
      2020-03-01
    • Related Report
      2019 Research-status Report
    • Peer Reviewed
  • [Journal Article] On Sound Signal Processing in Image to Sound Mapping Technique2017

    • Author(s)
      Arata Kawamura
    • Journal Title

      Elsevier Applied Acoustics

      Volume: 117 Pages: 1-11

    • Related Report
      2017 Research-status Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] An efficient image to sound mapping method using speech spectral phase and multi-column image2017

    • Author(s)
      Arata Kawamura, Hiro Igarashi, and Youji Iiguni
    • Journal Title

      IEICE Trans. Fundamentals

      Volume: E100-A Pages: 893-895

    • NAID

      130005397016

    • Related Report
      2017 Research-status Report
    • Peer Reviewed / Int'l Joint Research
  • [Presentation] 反復位相復元を利用した音声スペクトログラムへの画像埋め込み2020

    • Author(s)
      川村新
    • Organizer
      電子情報通信学会 信号処理研究会 技術報告会資料
    • Related Report
      2019 Research-status Report
  • [Presentation] Image-to-sound transformation using inpainting technique2018

    • Author(s)
      Yuya Hosoda, Arata Kawamura, and Youji Iiguni
    • Organizer
      The 2018 International Symposium on Nonlinear Theory and its Applications
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research

URL: 

Published: 2017-04-28   Modified: 2022-01-27  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi