Research on speaker recognition method that is robust to the differences in speaking styles and timing of recording speech using the Standardized-Normalization Transformation

Research Project

Project/Area Number	25350488
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Multi-year Fund
Section	一般
Research Field	Social systems engineering/Safety system
Research Institution	National Research Institute of Police Science
Principal Investigator	Osanai Takashi 科学警察研究所, 法科学第四部, 部長 (70392264)
Research Collaborator	KAMADA toshiaki MAKINAE hisanori AMINO kanae
Project Period (FY)	2013-04-01 – 2019-03-31
Project Status	Completed (Fiscal Year 2018)
Budget Amount *help	¥4,160,000 (Direct Cost: ¥3,200,000、Indirect Cost: ¥960,000) Fiscal Year 2017: ¥910,000 (Direct Cost: ¥700,000、Indirect Cost: ¥210,000) Fiscal Year 2016: ¥780,000 (Direct Cost: ¥600,000、Indirect Cost: ¥180,000) Fiscal Year 2015: ¥910,000 (Direct Cost: ¥700,000、Indirect Cost: ¥210,000) Fiscal Year 2014: ¥780,000 (Direct Cost: ¥600,000、Indirect Cost: ¥180,000) Fiscal Year 2013: ¥780,000 (Direct Cost: ¥600,000、Indirect Cost: ¥180,000)
Keywords	話者認識 / 発話様式 / 時期変動 / 特徴量変換 / 犯罪捜査支援
Outline of Final Research Achievements	In speaker recognition, it is said that the differences in recording environments, speaking styles, and timing of recording speech samples, are one of the factors that cause deterioration of authentication performance. Therefore, we investigated the speaker recognition method that is robust to such differences. In this research, we used various speech databases that we constructed so far. In addition, we used the Standardization-Normalization Transformation, which was proposed earlier in our research and proved to be effective in improving the speaker recognition performances. Results of the experiments showed that the Standardization-Normalization Transformation is an effective method for conquering the differences in the recorded speech data.
Academic Significance and Societal Importance of the Research Achievements	非協力的な話者を扱うことが多い法科学分野においては、多様な音声資料に適応できる話者認識に期待する声は大きい。例えば、振り込め詐欺事件において、同一犯による犯行の解明には、犯人の音声を比較して同一話者か否かを判断する必要があるが、それぞれの事件の会話はさまざまである上、関係者を装うなど、話し方も多様となるケースが多い。本研究の成果を利用することで、多様な音声資料であってもそれぞれの事件の犯人の同一性を示すことが期待できる。

Report

(7 results)

2018 Annual Research Report Final Research Report ( PDF )
2017 Research-status Report
2016 Research-status Report
2015 Research-status Report
2014 Research-status Report
2013 Research-status Report

Research Products
(27 results)

All 2018 2017 2016 2015 2014 2013 Other

All Int'l Joint Research (4 results) Journal Article (6 results) (of which Peer Reviewed: 5 results, Open Access: 1 results, Acknowledgement Compliant: 1 results) Presentation (17 results) (of which Int'l Joint Research: 3 results)

[Int'l Joint Research] オーストラリア国立大学(オーストラリア)
- Related Report
  2018 Annual Research Report
[Int'l Joint Research] オーストラリア国立大学(オーストラリア)
- Related Report
  2017 Research-status Report
[Int'l Joint Research] オーストラリア国立大学(オーストラリア)
- Related Report
  2016 Research-status Report
[Int'l Joint Research] オーストラリア国立大学(オーストラリア)
- Related Report
  2015 Research-status Report
[Journal Article] Reference data on Japanese vowel devoicing: Effects of speakers' and parents' places of origin and within-speaker reproducibility2018
- Author(s)
  Amino Kanae、Makinae Hisanori、Kamada Toshiaki、Osanai Takashi
- Journal Title
  
  Acoustical Science and Technology
  
  Volume: 39 Issue: 3 Pages: 207-214
- DOI
  10.1250/ast.39.207
- NAID
  130006730816
- ISSN
  0369-4232, 1346-3969, 1347-5177
- Year and Date
  2018-05-01
- Related Report
  2018 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] 音の法科学2016
- Author(s)
  長内　隆，蒔苗久則，網野加苗
- Journal Title
  
  日本音響学会誌
  
  Volume: 72 Pages: 74-80
- Related Report
  2015 Research-status Report
- Peer Reviewed
[Journal Article] Cross-language differences of articulation rate and its transfer into Japanese as a second language2015
- Author(s)
  Kanae Amino, Takashi Osanai
- Journal Title
  
  Forensic Science International
  
  Volume: 249 Pages: 116-122
- DOI
  10.1016/j.forsciint.2015.01.029
- Related Report
  2015 Research-status Report
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] 多数話者による単独発話母音から抽出したフォルマント周波数の特性2014
- Author(s)
  鎌田敏明，蒔苗久則，網野加苗，長内隆
- Journal Title
  
  科学警察研究所報告
  
  Volume: 63(1) Pages: 19-23
- NAID
  40020571764
- Related Report
  2013 Research-status Report
- Peer Reviewed
[Journal Article] Native vs. non-native accent identification using Japanese spoken telephone numbers2014
- Author(s)
  Kanae Amino, Takashi Osanai
- Journal Title
  
  Speech Communication
  
  Volume: Vol.56 Pages: 70-81
- DOI
  10.1016/j.specom.2013.07.010
- Related Report
  2013 Research-status Report
- Peer Reviewed
[Journal Article] 法科学分野における話者認識の動向2013
- Author(s)
  長内隆，石原俊一
- Journal Title
  
  日本音響学会誌
  
  Volume: 69（7） Pages: 365-370
- Related Report
  2013 Research-status Report
[Presentation] 時期差のある単語発話を用いた話者照合における標準化・正規化変換の効果2018
- Author(s)
  長内隆，網野加苗，蒔苗久則，鎌田敏明
- Organizer
  日本法科学技術学会　第24回学術集会
- Related Report
  2018 Annual Research Report
[Presentation] 声道共鳴特性を用いた地域性情報と話者分類2018
- Author(s)
  鎌田敏明，蒔苗久則，網野加苗，長内隆
- Organizer
  日本法科学技術学会　第24回学術集会
- Related Report
  2018 Annual Research Report
[Presentation] 言語形態を用いた地域性推定における共通語形の影響2018
- Author(s)
  網野加苗，蒔苗久則，鎌田敏明，長内隆
- Organizer
  日本法科学技術学会　第24回学術集会
- Related Report
  2018 Annual Research Report
[Presentation] Exploring sub-band cepstral distances for more robust speaker classification2018
- Author(s)
  Takashi Osanai, Yuko Kinoshita, Frantz Clermont
- Organizer
  17th Australasian International Conferenceon Speech Science and Technology
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Forensic voice comparison using sub-band cepstral distances as features: A first attempt with vowels from 306 Japanese speakers under channel mismatch conditions2018
- Author(s)
  Yuko Kinoshita, Takashi Osanai, Frantz Clermont
- Organizer
  17th Australasian International Conferenceon Speech Science and Technology
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] 話者照合における発話様式の影響に関する予備的検討2017
- Author(s)
  長内隆，網野加苗，蒔苗久則，鎌田敏明
- Organizer
  日本法科学技術学会　第23回学術集会
- Related Report
  2017 Research-status Report
[Presentation] Sub-band cepstral variability within and between speakers under microphone and mobile conditions: A preliminary investigation2016
- Author(s)
  Frantz Clermont, Yukop Kinoshita, Takashi Osanai
- Organizer
  16th Australasian International Conference on Speech Science & Technology
- Place of Presentation
  オーストラリア、シドニー
- Year and Date
  2016-12-07
- Related Report
  2016 Research-status Report
- Int'l Joint Research
[Presentation] 異なる環境下の単語発話を用いた話者照合における標準化・正規化変換の効果2016
- Author(s)
  長内　隆、網野加苗、蒔苗久則、鎌田敏明
- Organizer
  日本法科学技術学会　第22回学術集会
- Place of Presentation
  中野サンプラザ
- Year and Date
  2016-11-10
- Related Report
  2016 Research-status Report
[Presentation] 話者認識における静的特徴量と動的特徴量の比較2016
- Author(s)
  鎌田敏明、蒔苗久則、網野加苗、長内　隆
- Organizer
  日本法科学技術学会　第22回学術集会
- Place of Presentation
  中野サンプラザ
- Year and Date
  2016-11-10
- Related Report
  2016 Research-status Report
[Presentation] 正弦波モデルを用いたブラインド雑音抑圧2016
- Author(s)
  蒔苗久則、網野加苗、鎌田敏明、長内　隆
- Organizer
  日本法科学技術学会　第22回学術集会
- Place of Presentation
  中野サンプラザ
- Year and Date
  2016-11-10
- Related Report
  2016 Research-status Report
[Presentation] 収録環境の異なる音声を用いた話者照合における標準化・正規化変換の効果2015
- Author(s)
  長内　隆，網野加苗，蒔苗久則，鎌田敏明
- Organizer
  日本法科学技術学会　第21回学術集会
- Place of Presentation
  柏の葉カンファレンスセンター
- Year and Date
  2015-11-12
- Related Report
  2015 Research-status Report
[Presentation] 聴取による合成音声と自然音声の識別2015
- Author(s)
  網野加苗，蒔苗久則，鎌田敏明，長内　隆
- Organizer
  日本法科学技術学会　第21回学術集会
- Place of Presentation
  柏の葉カンファレンスセンター
- Year and Date
  2015-11-12
- Related Report
  2015 Research-status Report
[Presentation] 非定常雑音の抑圧性能の評価に関する研究2015
- Author(s)
  蒔苗久則，網野加苗，鎌田敏明，長内　隆
- Organizer
  日本法科学技術学会　第21回学術集会
- Place of Presentation
  柏の葉カンファレンスセンター
- Year and Date
  2015-11-12
- Related Report
  2015 Research-status Report
[Presentation] 音声データベースの違いによる話者照合性能の比較2014
- Author(s)
  長内　隆, 網野加苗, 鎌田敏明, 蒔苗久則
- Organizer
  日本法科学技術学会第20回学術集会
- Place of Presentation
  ホテルフロラシオン青山
- Year and Date
  2014-11-13 – 2014-11-14
- Related Report
  2014 Research-status Report
[Presentation] 同一話者内における母音の無声化の再現性2014
- Author(s)
  網野加苗, 蒔苗久則, 鎌田敏明，長内　隆
- Organizer
  日本音響学会 2014年春季研究発表会
- Place of Presentation
  東京都、千代田区
- Related Report
  2013 Research-status Report
[Presentation] 本人および両親の出身地が母音の無声化率に与える影響2013
- Author(s)
  網野加苗, 蒔苗久則, 鎌田敏明，長内　隆
- Organizer
  日本音響学会 2013年秋季研究発表会
- Place of Presentation
  愛知県、豊橋市
- Related Report
  2013 Research-status Report
[Presentation] 連続音声を対象とした音響特徴量間の性別識別性能の比較2013
- Author(s)
  長内隆、網野加苗、鎌田敏明、蒔苗久則
- Organizer
  日本法科学技術学会第19回学術集会
- Place of Presentation
  東京都、港区
- Related Report
  2013 Research-status Report

Research on speaker recognition method that is robust to the differences in speaking styles and timing of recording speech using the Standardized-Normalization Transformation

Principal Investigator

Osanai Takashi 科学警察研究所, 法科学第四部, 部長 (70392264)

¥4,160,000 (Direct Cost: ¥3,200,000、Indirect Cost: ¥960,000)

Report

Research Products

[Int'l Joint Research] オーストラリア国立大学(オーストラリア)

Related Report

[Int'l Joint Research] オーストラリア国立大学(オーストラリア)

Related Report

[Int'l Joint Research] オーストラリア国立大学(オーストラリア)

Related Report

[Int'l Joint Research] オーストラリア国立大学(オーストラリア)

Related Report

[Journal Article] Reference data on Japanese vowel devoicing: Effects of speakers' and parents' places of origin and within-speaker reproducibility2018

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] 音の法科学2016

Author(s)

Journal Title

Related Report

[Journal Article] Cross-language differences of articulation rate and its transfer into Japanese as a second language2015

Author(s)

Journal Title

DOI

Related Report

[Journal Article] 多数話者による単独発話母音から抽出したフォルマント周波数の特性2014

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Native vs. non-native accent identification using Japanese spoken telephone numbers2014

Author(s)

Journal Title

DOI

Related Report

[Journal Article] 法科学分野における話者認識の動向2013

Author(s)

Journal Title

Related Report

[Presentation] 時期差のある単語発話を用いた話者照合における標準化・正規化変換の効果2018

Author(s)

Organizer

Related Report

[Presentation] 声道共鳴特性を用いた地域性情報と話者分類2018

Author(s)

Organizer

Related Report

[Presentation] 言語形態を用いた地域性推定における共通語形の影響2018

Author(s)

Organizer

Related Report

[Presentation] Exploring sub-band cepstral distances for more robust speaker classification2018

Author(s)

Organizer

Related Report

[Presentation] Forensic voice comparison using sub-band cepstral distances as features: A first attempt with vowels from 306 Japanese speakers under channel mismatch conditions2018

Author(s)

Organizer

Related Report

[Presentation] 話者照合における発話様式の影響に関する予備的検討2017

Author(s)

Organizer

Related Report

[Presentation] Sub-band cepstral variability within and between speakers under microphone and mobile conditions: A preliminary investigation2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 異なる環境下の単語発話を用いた話者照合における標準化・正規化変換の効果2016

Author(s)

Organizer

Place of Presentation

Year and Date