2009 Fiscal Year Annual Research Report

次世代ウエアラブルコンピュータのための知覚認識モバイルプロセッサの垂直統合研究

Research Project

Project/Area Number	18200003
Research Institution	Kobe University
Principal Investigator	吉本雅彦 Kobe University, 大学院・工学研究科, 教授 (30324099)
Co-Investigator(Kenkyū-buntansha)	塚本昌彦神戸大学, 大学院・工学研究科, 教授 (60273588) 有木康雄神戸大学, 都市安全研究センター, 教授 (10135519) 滝口哲也神戸大学, 大学院・工学研究科, 講師 (40397815) 川口博神戸大学, 大学院・工学研究科, 准教授 (00361642)
Keywords	ウエアラブル / 画像認識 / 音声認識 / ネットワーク / VLSI / メモリ
Research Abstract	本研究は、次世代ウエアラブルコンピュータに特化した、視覚・音声・言語処理を統合して知覚認識できる超低消費電力・統合型認識プロセッサコア(Unified Perception Processor : UPP)実現のための基礎技術を確立することを目的とする。画像認識アルゴリズムでは、画像に映っている物体をセグメンテーションして認識する研究、静止画・動画から人の動作を認識する研究、2次元画像から人の3次元姿勢を推定する研究を行った。これらは、ウェアラブル・パーセプション・デバイスにとって、実環境で視覚認識を正しく実行するための技術である。また、音声言語処理アルゴリズムでは、音声の到来方向を推定する研究、雑音下であっても音声特徴を正確に抽出する研究、音声の誤りを検出し訂正する研究等を行った。これらは、ウェアラブル・パーセプション・デバイスにとって、実環境で音声を検出して高精度に音声認識を行うための技術である。VLSIアーキテクチャ階層では、HDTV画像のSIFT特徴量を実時間で抽出する並列アーキテクチャを開発した。高い電力効率を実現するVLSIを設計し、試作(65nmCMOS)を完了し、従来技術に対して98.6%の電力削減を達成した。また、20000語彙の実時間連続音声認識のためのVLSIアーキテクチャの開発を完了した。GMM演算の高並列化,Viterbi演算アルゴリズム改良、キャッシュの導入,GMM演算とViterbi演算の2ステージパイプラインの導入を行ない、提案アーキテクチャをFPGAに実装することで、従来アーキテクチャに比べ実時間動作時の必要周波数を,20,000語で32%削減し41.71[MHz]での動作を確認した。さらに、コンピュータのデスクトップを常時画像処理し、決まったパターンに対してアプリケーション動作を行うルールベースエンジンを構築した。ウェアラブル環境で利用し、ウェアラブルカメラとHMDと組み合わせて利用することで、実空間内でのアノテーションや実空間連動型のサービスの構築が容易になる。画像処理には単純なテンプレートマッチングの方式を実装しているが、本研究で開発したSIFT処理エンジンに関しても、ソフトウェアレベルで統合が完了した。上記LSIと組み合わせて高速動作を実現することで、実用的な実世界アプリケーションの構築が可能になる。以上により、超低消費電力・統合型認識プロセッサ実現のための基礎技術を確立しえた。

Research Products
(10 results)

All 2010 2009

All Journal Article (5 results) (of which Peer Reviewed: 4 results) Presentation (5 results)

[Journal Article] Monaural sound-source-direction estimation using the acoustic transfer function of a parabolic reflection board"2010
- Author(s)
  Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki
- Journal Title
  
  Journal of the Acoustical Society of America Volume 127, Issue 2
  
  Pages: 902-908
- Peer Reviewed
[Journal Article] A Dependable SRAM with 7T/14T Memory Cells2009
- Author(s)
  H.Fujiwara, S.Okumura, Y.Iguchi, H.Noguchi, H.Kawaguchi, M.Yoshimoto
- Journal Title
  
  IEICE Transactions on Electronics vol.E92-C, no.4
  
  Pages: 423-432
- Peer Reviewed
[Journal Article] SPEECH FEATURE EXTRACTION USING WEIGHTED HIGHER-ORDER LOCAL AUTO-CORRELATION2009
- Author(s)
  Yasuo Ariki, Tetsuya Takiguchi, Takashi Muroi, Ryoichi Takashima
- Journal Title
  
  Far East Journal of Electronics and Communications Volume 3, Issue 2
  
  Pages: 125-140
- Peer Reviewed
[Journal Article] Graph Cuts Segmentation by Using Local Texture Features of Multiresolution Analysis2009
- Author(s)
  Keita Fukuda, Tetsuya Takiguchi, Yasuo Ariki
- Journal Title
  
  IEICE Transactions on Information and Systems Vol.E92-D, No.7
  
  Pages: 1453-1461
- Peer Reviewed
[Journal Article]2009
- Author(s)
  K.Onishi, T.Takiguchi, Y.Ariki(Peng-Yeng Yin編集)
- Journal Title
  
  Pattern Recognition, Chapter 16, 3D Human Posture Estimation Using HOG Features of Monocular Images執筆分担(I-Tech Education and Publishing)
  
  Pages: 295-304
[Presentation] 画像情報を基盤とした状況認識によるアプリケーション制御フレームワークの設計と実装2009
- Author(s)
  栗田雄介, 寺田努, 塚本昌彦
- Organizer
  ウェアラブルコンピュータ研究開発機構ユビキタスウェアラブルワークショップ2009
- Place of Presentation
  グリーンピア三木(兵庫県)
- Year and Date
  2009-11-27
[Presentation] Human Action Recognition Using HDP by Integrating Motion and Location Information2009
- Author(s)
  Yasuo Ariki, Takuya Tonaru, Tetsuya Takiguchi
- Organizer
  Asian Conf.on Computer Vision
- Place of Presentation
  Xian, China
- Year and Date
  2009-09-27
[Presentation] Parallelized Viterbi Processor for 5,000-Word Large-Vocabulary Real-Time Continuous Speech Recognition FPGA System2009
- Author(s)
  T.Fujinaga, K.Miura, H.Noguchi, H.Kawaguchi, M.Yoshimoto
- Organizer
  Proceedings of ISCA Annual Conference of International Speech Communication Association(Interspeech)
- Place of Presentation
  London, UK
- Year and Date
  2009-09-10
[Presentation] Parallelized Viterbi Processor for 5,000-Word Large-Vocabulary Real-Time Continuous Speech Recognition FPGA System2009
- Author(s)
  T.Fujinaga, K.Miura, H.Noguchi, H.Kawaguchi, M.Yoshimoto
- Organizer
  Parallelized Viterbi Processor for 5,000-Word Large-Vocabulary Real-Time Continuous Speech Recognition FPGA System
- Place of Presentation
  London, UK
- Year and Date
  2009-09-10
[Presentation] 画像認識を基盤としたアプリケーション制御フレームワークの設計と実装2009
- Author(s)
  栗田雄介, 寺田努, 塚本昌彦
- Organizer
  情報処理学会マルチメディア,分散,協調とモバイルシンポジウム(DICOMO 2009)
- Place of Presentation
  杉乃井ホテル(大分県)
- Year and Date
  2009-07-09

2009 Fiscal Year Annual Research Report

次世代ウエアラブルコンピュータのための知覚認識モバイルプロセッサの垂直統合研究

Principal Investigator

吉本 雅彦 Kobe University, 大学院・工学研究科, 教授 (30324099)

Research Products

[Journal Article] Monaural sound-source-direction estimation using the acoustic transfer function of a parabolic reflection board"2010

Author(s)

Journal Title

[Journal Article] A Dependable SRAM with 7T/14T Memory Cells2009

Author(s)

Journal Title

[Journal Article] SPEECH FEATURE EXTRACTION USING WEIGHTED HIGHER-ORDER LOCAL AUTO-CORRELATION2009

Author(s)

Journal Title

[Journal Article] Graph Cuts Segmentation by Using Local Texture Features of Multiresolution Analysis2009

Author(s)

Journal Title

[Journal Article]2009

Author(s)

Journal Title

[Presentation] 画像情報を基盤とした状況認識によるアプリケーション制御フレームワークの設計と実装2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Human Action Recognition Using HDP by Integrating Motion and Location Information2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Parallelized Viterbi Processor for 5,000-Word Large-Vocabulary Real-Time Continuous Speech Recognition FPGA System2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Parallelized Viterbi Processor for 5,000-Word Large-Vocabulary Real-Time Continuous Speech Recognition FPGA System2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 画像認識を基盤としたアプリケーション制御フレームワークの設計と実装2009

Author(s)

Organizer

Place of Presentation

Year and Date

吉本雅彦 Kobe University, 大学院・工学研究科, 教授 (30324099)