Forensic intelligibility enhancement of recorded speech in high noise environments

Research Project

Project/Area Number	24710195
Research Category	Grant-in-Aid for Young Scientists (B)
Allocation Type	Multi-year Fund
Research Field	Social systems engineering/Safety system
Research Institution	National Research Institute of Police Science
Principal Investigator	Makinae Hisanori 科学警察研究所, 法科学第四部, 主任研究官 (20415441)
Research Collaborator	SATO Masamichi
Project Period (FY)	2012-04-01 – 2017-03-31
Project Status	Completed (Fiscal Year 2016)
Budget Amount *help	¥4,420,000 (Direct Cost: ¥3,400,000、Indirect Cost: ¥1,020,000) Fiscal Year 2015: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000) Fiscal Year 2014: ¥910,000 (Direct Cost: ¥700,000、Indirect Cost: ¥210,000) Fiscal Year 2013: ¥910,000 (Direct Cost: ¥700,000、Indirect Cost: ¥210,000) Fiscal Year 2012: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Keywords	信頼性工学 / 裁判科学
Outline of Final Research Achievements	In a forensic situation, since speech samples are usually recorded in highly noisy environments, for example, in a store or a car where background music is played, the signal-to-noise ratio (SNR) of these samples is severely degraded. We propose speech enhancement methods effective to these samples in this study. In some proposed methods, source signals of the noise were used as reference signals, because real time processing is unnecessary for forensic purpose and the source signals used for background music is easily available. And non-negative matrix factorization (NMF) or sinusoidal modeling was used for signal representation. Experiment showed the effectiveness of the proposed methods to the severely degraded signals. In addition to these methods based on signal processing technique, preliminary examination was carried out to investigate the availability of speech recognition technology.

Report

(6 results)

2016 Annual Research Report Final Research Report ( PDF )
2015 Research-status Report
2014 Research-status Report
2013 Research-status Report
2012 Research-status Report

Research Products
(13 results)

All 2016 2015 2014 2013 2012

All Journal Article (2 results) (of which Acknowledgement Compliant: 1 results, Peer Reviewed: 1 results) Presentation (11 results)

[Journal Article] 音と法科学2016
- Author(s)
  長内　隆、蒔苗　久則、網野　加苗
- Journal Title
  
  日本音響学会誌
  
  Volume: ７２ Pages: 74-80
- Related Report
  2015 Research-status Report
- Acknowledgement Compliant
[Journal Article] 法科学分野における話者認識のための大規模音声データベースの構築2013
- Author(s)
  蒔苗久則、鎌田敏明、長内隆
- Journal Title
  
  科学警察研究所報告
  
  Volume: 印刷中
- NAID
  40020008909
- Related Report
  2012 Research-status Report
- Peer Reviewed
[Presentation] 正弦波モデルを用いたブラインド雑音抑圧2016
- Author(s)
  蒔苗久則，網野加苗，鎌田敏明，長内隆
- Organizer
  日本法科学技術学会
- Place of Presentation
  中野サンプラザ
- Year and Date
  2016-11-10
- Related Report
  2016 Annual Research Report
[Presentation] 非定常雑音の抑圧性能の評価に関する研究2015
- Author(s)
  蒔苗久則，鎌田敏明，網野加苗，長内隆
- Organizer
  日本法科学技術学会
- Place of Presentation
  柏の葉カンファレンスセンター（千葉県柏市）
- Year and Date
  2015-11-12
- Related Report
  2015 Research-status Report
[Presentation] 正弦波モデルを用いた非定常雑音の抑圧2014
- Author(s)
  蒔苗久則，網野加苗，鎌田敏明，長内　隆
- Organizer
  日本法科学技術学会第20 回学術集会
- Place of Presentation
  ホテルフロラシオン青山
- Year and Date
  2014-11-13 – 2014-11-14
- Related Report
  2014 Research-status Report
[Presentation] Nasality in Speech and Its Contribution to Speaker Individuality2014
- Author(s)
  Kanae Amino, Hisanori Makinae, Tatsuya Kitamura
- Organizer
  Interspeech 2014
- Place of Presentation
  Singapore
- Year and Date
  2014-09-14 – 2014-09-18
- Related Report
  2014 Research-status Report
[Presentation] Nasality in Oral Sounds? -Perception and Analysis of Oro-Nasal Signals-2014
- Author(s)
  Kanae Amino, Hisanori Makinae, Tatsuya Kitamura
- Organizer
  日本音響学会2014年春季研究発表会
- Place of Presentation
  日本大学（東京都千代田区）
- Related Report
  2013 Research-status Report
[Presentation] 同一話者内における母音の無声化の再現性2014
- Author(s)
  網野加苗、蒔苗久則、鎌田敏明、長内隆
- Organizer
  日本音響学会2014年春季研究発表会
- Place of Presentation
  日本大学（東京都千代田区）
- Related Report
  2013 Research-status Report
[Presentation] 本人および両親の出身地が母音の無声化に与える影響2013
- Author(s)
  網野加苗、蒔苗久則、鎌田敏明、長内　隆
- Organizer
  日本音響学会2013年秋季研究発表会
- Place of Presentation
  豊橋技術科学大学（愛知県豊橋市）
- Related Report
  2013 Research-status Report
[Presentation] 非負値行列因子分解を用いたインパルス性雑音の抑圧2013
- Author(s)
  蒔苗久則、網野加苗、鎌田敏明、長内隆
- Organizer
  日本法科学技術学会第19回学術集会
- Place of Presentation
  ホテルフロラシオン青山（東京都港区）
- Related Report
  2013 Research-status Report
[Presentation] フォルマント周波数を用いた話者照合法の統計的評価2013
- Author(s)
  四宮康治、蒔苗久則、網野加苗、鎌田敏明、長内隆、伊藤仁
- Organizer
  日本法科学技術学会第19回学術集会
- Place of Presentation
  ホテルフロラシオン青山（東京都港区）
- Related Report
  2013 Research-status Report
[Presentation] 母音の無声化頻度と話者の出身地に関する考察2013
- Author(s)
  網野加苗
- Organizer
  日本音響学会 2012年春季研究発表会
- Place of Presentation
  東京工科大学（東京都）
- Related Report
  2012 Research-status Report
[Presentation] 非負値行列因子分解を用いた非定常雑音の明瞭化2012
- Author(s)
  蒔苗久則
- Organizer
  日本法科学技術学会第18回学術集会
- Place of Presentation
  ホテルフロラシオン青山（東京都）
- Related Report
  2012 Research-status Report

Forensic intelligibility enhancement of recorded speech in high noise environments

Principal Investigator

Makinae Hisanori 科学警察研究所, 法科学第四部, 主任研究官 (20415441)

¥4,420,000 (Direct Cost: ¥3,400,000、Indirect Cost: ¥1,020,000)

Report

Research Products

[Journal Article] 音と法科学2016

Author(s)

Journal Title

Related Report

[Journal Article] 法科学分野における話者認識のための大規模音声データベースの構築2013

Author(s)

Journal Title

NAID

Related Report

[Presentation] 正弦波モデルを用いたブラインド雑音抑圧2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 非定常雑音の抑圧性能の評価に関する研究2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 正弦波モデルを用いた非定常雑音の抑圧2014

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Nasality in Speech and Its Contribution to Speaker Individuality2014

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Nasality in Oral Sounds? -Perception and Analysis of Oro-Nasal Signals-2014

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 同一話者内における母音の無声化の再現性2014

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 本人および両親の出身地が母音の無声化に与える影響2013

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 非負値行列因子分解を用いたインパルス性雑音の抑圧2013

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] フォルマント周波数を用いた話者照合法の統計的評価2013

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 母音の無声化頻度と話者の出身地に関する考察2013

Author(s)

Organizer

Place of Presentation

Related Report

[Presentation] 非負値行列因子分解を用いた非定常雑音の明瞭化2012

Author(s)

Organizer

Place of Presentation

Related Report