Robust F0 estimation based on time-varying complex speech analysis and its application for IP telephony and musical signal

Research Project

Project/Area Number	20500158
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	Perception information processing/Intelligent robotics
Research Institution	University of the Ryukyus
Principal Investigator	FUNAKI Keiichi University of the Ryukyus, 総合情報処理センター, 講師 (30315486)
Project Period (FY)	2008 – 2010
Project Status	Completed (Fiscal Year 2010)
Budget Amount *help	¥3,770,000 (Direct Cost: ¥2,900,000、Indirect Cost: ¥870,000) Fiscal Year 2010: ¥1,300,000 (Direct Cost: ¥1,000,000、Indirect Cost: ¥300,000) Fiscal Year 2009: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000) Fiscal Year 2008: ¥1,300,000 (Direct Cost: ¥1,000,000、Indirect Cost: ¥300,000)
Keywords	音声情報処理 / 信号処理 / 音声のF0推定 / 解析信号 / 複素音声分析 / ロバスト分析 / 音声符号化 / iLBC / 楽音の音高推定 / フォルマント推定 / 音声のFO推定 / G.711.1 / ALS / 音声のフォルマント推定 / 雑音に頑健 / 時変複素音声分析
Research Abstract	F0 estimation of speech plays an important role on speech processing. One of the authors has proposed time-varying complex speech analysis for analytic speech signal and has proposed novel F0 estimation based on the time-varying complex speech analysis in which complex residual is estimated by the speech analysis and F0 is estimated by peak-picking the weighted auto-correlation with a reciprocal of the corresponding AMDF for the residual. It is called frame-based method. In this study, we propose more accurate and fast F0 estimation algorithm. In the method, pre-selections of F0 and F1 are realized by using F0 and F1 contour estimation method based on the time-varying complex analysis. It is called sample-based method. Final-selection of F0 is realized by the frame-based method with shorten range of F0 based on the estimated F0 and F1. The shorten range results in more accurate estimation with smaller computational amount in the two-stage F0 estimation. Furthermore, in order to investigate the efficacy of the time-varying analysis, the frame-based method is evaluated for each frame that is categorized into 4 modes with respect to the voiced strength. The experimental results confirm that the time-varying analysis can perform better for strong voiced frames.

Report

(4 results)

2010 Annual Research Report Final Research Report ( PDF )
2009 Annual Research Report
2008 Annual Research Report

Research Products
(42 results)

All 2011 2010 2009 2008 Other

All Journal Article (2 results) (of which Peer Reviewed: 2 results) Presentation (39 results) Remarks (1 results)

[Journal Article] Speech Enhancement based on Iterative Wiener Filter using Complex LPC Speech Analysis-Recent Advances in Signal Processing2009
- Author(s)
  Keiichi Funaki
- Journal Title
  
  Ashraf A Zaher (Ed.), ISBN : 978-953-307-002-5, IN-TECH BOOK
- Related Report
  2010 Final Research Report
- Peer Reviewed
[Journal Article] Speech Enhancement based on Iterative Wiener Filter using Complex LPC Speech Analysis2009
- Author(s)
  Keiichi Funaki
- Journal Title
  
  Recent Advances in Signal Processing, INTECH BOOK, ISBN 978-953-307-002-5 http://sciyo.com/books/show/title/recent-advances-in-signal-processing 1
  
  Pages: 251-266
- Related Report
  2009 Annual Research Report
- Peer Reviewed
[Presentation] 時変複素音声分析を用いた音声のF0推定の改良に関する検討2011
- Author(s)
  舟木慶一, 比嘉健人
- Organizer
  日本音響学会春季研究発表会
- Place of Presentation
  早稲田大学
- Year and Date
  2011-03-09
- Related Report
  2010 Final Research Report
[Presentation] 時変複素音声分析を用いた音声のF0推定の改良に関する検討2011
- Author(s)
  舟木慶一、比嘉健人
- Organizer
  日本音響学会春季研究発表会
- Place of Presentation
  早稲田大学(東京都)
- Year and Date
  2011-03-09
- Related Report
  2010 Annual Research Report
[Presentation] F0 Contour Estimation Using ELS-Based Robust Time-Varying Complex Speech Analysis2011
- Author(s)
  Keiichi Funaki
- Organizer
  IEEE DSP/SPE Workshop
- Place of Presentation
  Sedona(Arizona, USA)
- Year and Date
  2011-01-08
- Related Report
  2010 Annual Research Report
[Presentation] F0 contour estimation using ELS-based robust time-varying complex speech analysis2011
- Author(s)
  舟木慶一
- Organizer
  IEEE DSP/SPE workshop, Sedona
- Place of Presentation
  Arizona. USA
- Year and Date
  2011-01-06
- Related Report
  2010 Final Research Report
[Presentation] 時変複素音声分析を用いた基本周波数推定の研究2010
- Author(s)
  舟木慶一, 比嘉健人
- Organizer
  電子情報通信学会SIPシンポジウム
- Place of Presentation
  奈良女子大学
- Year and Date
  2010-11-24
- Related Report
  2010 Final Research Report
[Presentation] 時変複素音声分析を用いた基本周波数推定の研究2010
- Author(s)
  舟木慶一、比嘉健人
- Organizer
  IEICE SIPシンポジウム
- Place of Presentation
  奈良女子大学(奈良市)
- Year and Date
  2010-11-24
- Related Report
  2010 Annual Research Report
[Presentation] On Evaluation of the F0 estimation based on time-varying complex speech analysis2010
- Author(s)
  Keiichi Funaki
- Organizer
  Interspeech2011
- Place of Presentation
  幕張(千葉県)
- Year and Date
  2010-09-27
- Related Report
  2010 Annual Research Report
[Presentation] On Evaluation of the F0 estimation based on time-varying complex speech analysis2010
- Author(s)
  舟木慶一
- Organizer
  Proc.Interspeech 2010
- Place of Presentation
  幕張メッセ国際会議場
- Year and Date
  2010-09-26
- Related Report
  2010 Final Research Report
[Presentation] 時変複素音声分析を用いたF0推定の評価2010
- Author(s)
  舟木慶一
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  関西大学
- Year and Date
  2010-09-16
- Related Report
  2010 Final Research Report
[Presentation] 時変複素音声分析を用いたF0推定の評価2010
- Author(s)
  舟木慶一
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  関西大学(大阪府)
- Year and Date
  2010-09-16
- Related Report
  2010 Annual Research Report
[Presentation] 時変複素音声分析を用いた音声符号化方式について2010
- Author(s)
  舟木慶一
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  関西大学
- Year and Date
  2010-09-14
- Related Report
  2010 Final Research Report
[Presentation] 時変複素音声分析を用いた音声符号化方式について2010
- Author(s)
  舟木慶一
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  関西大学(大阪府)
- Year and Date
  2010-09-14
- Related Report
  2010 Annual Research Report
[Presentation] ロバスト複素AR音声分析に基づくITU-TG.711.1改良方式の検討2010
- Author(s)
  舟木慶一
- Organizer
  日本音響学会春季研究発表会
- Place of Presentation
  電気通信大学
- Year and Date
  2010-03-10
- Related Report
  2010 Final Research Report
[Presentation] ロバスト複素AR音声分析に基づくITU-T G.711.1改良方式の検討2010
- Author(s)
  舟木慶一
- Organizer
  日本音響学会春季研究発表会
- Place of Presentation
  電通大(東京都調布市)
- Year and Date
  2010-03-10
- Related Report
  2009 Annual Research Report
[Presentation] ロバスト時変複素 AR 音声分析に基づくIETF iLBC互換方式の検討2010
- Author(s)
  舟木慶一
- Organizer
  日本音響学会春季研究発表会
- Place of Presentation
  電気通信大学
- Year and Date
  2010-03-08
- Related Report
  2010 Final Research Report
[Presentation] ロバスト時変複素AR音声分析に基づくIETF iLBC互換方式の検討2010
- Author(s)
  舟木慶一
- Organizer
  日本音響学会春季研究発表会
- Place of Presentation
  電通大(東京都調布市)
- Year and Date
  2010-03-08
- Related Report
  2009 Annual Research Report
[Presentation] Evaluation of robust complex AR analysis on MPEG-4 ALS for noisy speech2010
- Author(s)
  舟木慶一
- Organizer
  Proc.IASTED SPPRA2010, Innsbruck
- Place of Presentation
  Austria
- Year and Date
  2010-02-19
- Related Report
  2010 Final Research Report
[Presentation] Evaluation of robust complex AR analysis on MPEG-4 ALS for noisy speech2010
- Author(s)
  舟木慶一
- Organizer
  IASTED SPPRA2010
- Place of Presentation
  Innsbruck(オーストリア)
- Year and Date
  2010-02-19
- Related Report
  2009 Annual Research Report
[Presentation] Evaluation of Complex LPC analysis on lossless compression of Finger PrintImage using MPEG-4 ALS2009
- Author(s)
  舟木慶一
- Organizer
  Proc.SISA 2009
- Place of Presentation
  関西大、大阪
- Year and Date
  2009-10-23
- Related Report
  2010 Final Research Report
[Presentation] Evaluation of Complex LPC Analysis on Lossless Compression of Finger Print Image Using MPEG-4 ALS2009
- Author(s)
  舟木慶一
- Organizer
  IEICE SISA2009
- Place of Presentation
  関西大学(大阪)
- Year and Date
  2009-10-23
- Related Report
  2009 Annual Research Report
[Presentation] ロバスト時変複素AR音声分析法とその音声処理への応用2009
- Author(s)
  舟木慶一
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  日本大学工学部(東京都)
- Year and Date
  2009-09-18
- Related Report
  2009 Annual Research Report
[Presentation] 超複素解析信号の特性に関する検討2009
- Author(s)
  舟木慶一
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  日本大学
- Year and Date
  2009-09-17
- Related Report
  2010 Final Research Report
[Presentation] 超複素解析信号の特性に関する検討2009
- Author(s)
  舟木慶一
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  日本大学工学部(東京都)
- Year and Date
  2009-09-17
- Related Report
  2009 Annual Research Report
[Presentation] ロバスト時変複素AR音声分析法とその音声処理への応用2009
- Author(s)
  舟木慶一
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  日本大学
- Year and Date
  2009-09-16
- Related Report
  2010 Final Research Report
[Presentation] ロバスト複素AR分析に基づくオーディオロスレス符号化(ALS)の評価2009
- Author(s)
  舟木慶一
- Organizer
  電子情報通信学会音声研究会
- Place of Presentation
  北海道大学
- Year and Date
  2009-06-25
- Related Report
  2010 Final Research Report
[Presentation] ロバスト複素AR分析に基づくオーディオロスレス符号化(ALS)の評価2009
- Author(s)
  舟木慶一
- Organizer
  電子情報通信学会音声研究会
- Place of Presentation
  北海道大学(札幌市)
- Year and Date
  2009-06-25
- Related Report
  2009 Annual Research Report
[Presentation] 複素線形予測分析を用いたMPEG-4 Audio Lossless Coding(ALS)の改良2009
- Author(s)
  舟木慶一
- Organizer
  日本音響学会春季研究発表会
- Place of Presentation
  東京工業大学
- Year and Date
  2009-03-18
- Related Report
  2010 Final Research Report
[Presentation] 複素線形予測分析を用いたMPEG-4 Audio Lossless Coding (ALS)の改良2009
- Author(s)
  舟木慶一
- Organizer
  日本音響学会春季研究発表会
- Place of Presentation
  東工大、東京
- Year and Date
  2009-03-18
- Related Report
  2008 Annual Research Report
[Presentation] 時変複素音声分析を用いたフォルマント推定2009
- Author(s)
  舟木慶一
- Organizer
  日本音響学会春季研究発表会
- Place of Presentation
  東京工業大学
- Year and Date
  2009-03-17
- Related Report
  2010 Final Research Report
[Presentation] 時変複素音声分析を用いたフォルマント推定2009
- Author(s)
  舟木慶一
- Organizer
  日本音響学会春季研究発表会
- Place of Presentation
  東工大、東京
- Year and Date
  2009-03-17
- Related Report
  2008 Annual Research Report
[Presentation] 時変複素音声分析を用いたF0軌跡推定方式の改善2008
- Author(s)
  舟木慶一
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  九州大学
- Year and Date
  2008-09-11
- Related Report
  2010 Final Research Report
[Presentation] 時変複素音声分析を用いたFO軌跡推定方式の改善2008
- Author(s)
  舟木慶一
- Organizer
  日本音響学会秋季研究発表会
- Place of Presentation
  九州大学大橋キャンパス、福岡
- Year and Date
  2008-09-11
- Related Report
  2008 Annual Research Report
[Presentation] Speech Enhancement based on Iterative Wiener Filter using Complex Speech Analysis2008
- Author(s)
  Keiichi Funaki
- Organizer
  EUSIPCO-2008
- Place of Presentation
  Lausanne, Switzerland
- Year and Date
  2008-08-28
- Related Report
  2008 Annual Research Report
[Presentation] Speech Enhancement based on Iterative Wiener Filter using Complex Speech Analysis2008
- Author(s)
  舟木慶一
- Organizer
  Proc.EUSIPCO-2008
- Place of Presentation
  Lausanne, Switzerland
- Year and Date
  2008-08-27
- Related Report
  2010 Final Research Report
[Presentation] FO estimation based on robust ELS complex speech analysis2008
- Author(s)
  Keiichi Funaki
- Organizer
  EUSIPCO-2008
- Place of Presentation
  Lausanne, Switzerland
- Year and Date
  2008-08-26
- Related Report
  2008 Annual Research Report
[Presentation] F0 estimation based on robust ELS complex speech analysis2008
- Author(s)
  舟木慶一
- Organizer
  Proc.EUSIPCO-2008
- Place of Presentation
  Lausanne, Switzerland
- Year and Date
  2008-08-25
- Related Report
  2010 Final Research Report
[Presentation] F0 contour estimation based on time-varying complex speech analysis2008
- Author(s)
  舟木慶一
- Organizer
  Proc.Acoustics'08, J.Acoust.Soc.Am. 123, 3735
- Place of Presentation
  Paris
- Year and Date
  2008-07-04
- Related Report
  2010 Final Research Report
[Presentation] FO contour estimation based on time-varying complex speech analysis2008
- Author(s)
  Keiichi Funaki
- Organizer
  Acoustics'08
- Place of Presentation
  Paris, France
- Year and Date
  2008-07-03
- Related Report
  2008 Annual Research Report
[Presentation] ロバスト時変複素音声分析に基づくFO軌跡推定に関する-考察2008
- Author(s)
  舟木慶一
- Organizer
  電子情報通信学会SIP研究会
- Place of Presentation
  北海道大学、札幌
- Year and Date
  2008-06-26
- Related Report
  2008 Annual Research Report
[Remarks] ホームページ等
- Related Report
  2010 Final Research Report

Robust F0 estimation based on time-varying complex speech analysis and its application for IP telephony and musical signal

Principal Investigator

FUNAKI Keiichi University of the Ryukyus, 総合情報処理センター, 講師 (30315486)

¥3,770,000 (Direct Cost: ¥2,900,000、Indirect Cost: ¥870,000)

Report

Research Products

[Journal Article] Speech Enhancement based on Iterative Wiener Filter using Complex LPC Speech Analysis-Recent Advances in Signal Processing2009

Author(s)

Journal Title

Related Report

[Journal Article] Speech Enhancement based on Iterative Wiener Filter using Complex LPC Speech Analysis2009

Author(s)

Journal Title

Related Report

[Presentation] 時変複素音声分析を用いた音声のF0推定の改良に関する検討2011

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 時変複素音声分析を用いた音声のF0推定の改良に関する検討2011

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] F0 Contour Estimation Using ELS-Based Robust Time-Varying Complex Speech Analysis2011

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] F0 contour estimation using ELS-based robust time-varying complex speech analysis2011

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 時変複素音声分析を用いた基本周波数推定の研究2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 時変複素音声分析を用いた基本周波数推定の研究2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] On Evaluation of the F0 estimation based on time-varying complex speech analysis2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] On Evaluation of the F0 estimation based on time-varying complex speech analysis2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 時変複素音声分析を用いたF0推定の評価2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 時変複素音声分析を用いたF0推定の評価2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 時変複素音声分析を用いた音声符号化方式について2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report