2003 Fiscal Year Annual Research Report

GDA文書タグの自動変換とその応用システム開発の研究

Research Project

Project/Area Number	13558037
Research Institution	KYOTO UNIVERSITY
Principal Investigator	奥乃博京都大学, 情報学研究科, 教授 (60318201)
Co-Investigator(Kenkyū-buntansha)	橋田浩一独立行政法人産業技術総合研究所, サイバーアシスト研究センター, 副所長佐藤理史京都大学, 情報学研究科, 助教授 (30205918) 河原達也京都大学, 学術情報メディアセンター, 教授 (00234104) 駒谷和範京都大学, 情報学研究科, 助手 (40362579)
Keywords	文書タグ / Global Data Annotation (GDA) / MPEG-7 / 意味構造記述方式 / 会議録インデキシング / Linguistic Description Scheme / MPEG-7音楽記述子 / プライバシー重視のアクセス機構
Research Abstract	最終年度は,予定通り,GDA(Global Data Annotation)による大規模データのタグ付けとGDAの標準化,音声会議録の話者インデキシング,音楽情報のMPEG-7タグ付け,および,プライバシー重視アクセス法の研究を中心に進めた. (1)GDAによる大規模データのタグ付け:GDAに基づいた新聞記事のデータに関する照応・共参照のタグ付けを昨年度に引き続き作業委託で行った。毎日新聞10年分の記事の中なら50記事について詳細なGDAタグを付与し,3年間で合計200記事のGDAタグ付けが終了した.産総研を通じて公開を予定している. (2)GDA概念のMPEG-7標準への組込み:MPEG-7の言語データの意味構造記述方式(MDS)であるLinguistic DSについて,GDAとUNL (Universal Network Language)に基づいた提案がMPEG-7の第2版のWD (Working Draft)に組み入れられるとともに,ISO/TC37/SC4が行っている言語資源管理の標準化との整合性を現在調整している. (3)音楽情報のMPEG-7タグ付けの検討:MPEG-7では音楽データの記述方法はユーザに任せられていたが,そのような手法が人工知能でのオントロジーと同様の問題を内在していることを指摘し,楽器音識別のための楽器カテゴリー自動構築法と未知楽器の識別法を開発した. (4)音声会議録の自動話者インデキシング:会議録の自動タグ付けとして,複数話者が交代で話している音声会議を取り上げ,大量対話データを用いた事前学習による多数話者モデルの獲得,及び,衆議院速記録から得られたドメイン知識を用いた専門用語への対応により,話者認識を用いた高性能の話者インデキシングを開発した. (5)プライバシー重視のアクセス法:言語データや会議録,さらには,音楽データへのアクセス法は,今後セキュリティが重視されていくが,それに伴い従来図書館で重視されていたプライバシー保護がおろそかになる可能性がある.プライバシーを重視したアクセス法の有効性を実証するために,SPKI (Simpleな公開鍵暗号基盤)に基づいた手法を開発し,Webサーバに実現をした.

Research Products
(25 results)

All Other

All Publications (25 results)

[Publications] 渡邊太郎, 今村, 隅田英一郎, 奥乃博: "階層的アライメントを用いた統計的機械翻訳"電子情報通信学会論文誌. Vol.45, No.4(印刷中). (2004)
[Publications] 秋田祐哉, 河原達也: "多数話者モデルを用いた討論音声の教師なし話者インデキシング"電子情報通信学会論文誌. Vol.87, No.2. 495-503 (2004)
[Publications] 北原鉄朗, 後藤真孝, 奥乃博: "音響的類似性を反映した楽器の階層表現の獲得とそれに基づく未知楽器のカテゴリーレベルの音源同定"情報処理学会論文誌. Vol.45, No.3. 680-689 (2004)
[Publications] 渡邊太郎, 隅田英一郎, 奥乃博: "生成方向を考慮した統計的機械翻訳のためのデコーディングアルゴリズム"情報処理学会論文誌. Vol.44, No.12. 3202-3210 (2003)
[Publications] 山肩洋子, 河原達也, 奥乃博, 美濃導彦: "音声対話システムにおける物体指示のための信念ネットワークを用いた曖昧性の解消"人工知能学会誌. Vol.19, No.1F. 47-56 (2004)
[Publications] 北原鉄朗, 後藤孝, 奥乃博: "音高による音色変化に着目した楽器音の音源同定:F0依存多次元正規分布に基づく識別手法"情報処理学会論文誌. Vol.44, No.10. 2448-2458 (2003)
[Publications] 駒谷和範, 鹿島博晶, 田中克明, 河原達也: "複合的言語制約に基づくキーフレーズ検出を用いた汎用的なデータベース検索音声対話プラットフォーム"情報処理学会論文誌. Vol.44, No.5. 1333-1342 (2003)
[Publications] 奥乃博, 中臺一博: "ロボット聴覚の課題と現状"情報処理. VOl.44, No.11. 1138-1144 (2003)
[Publications] Kazunori Komatani, S.Ueno, Tatsuya Kawahara, Hiroshi G.Okuno: "User modeling in Spoken Dialogue Systems for Flexible Guidance Generation"Proceedings of the Eighth European Conference on Speech Communication and Technology (Eurospeech-2003). 745-748 (2003)
[Publications] Takamichi Saito, Toshio Kito, Kentaro Umesawa, Hiroshi G.Okuno: "Privacy-Enhanced SPKI Access Control on PKIX and Its Application to Web Server"Proceedings of the Seventeenth International Conference on Advanced Information Networking and Applications (AINA'03). 696-703 (2003)
[Publications] Kazushi Ishihara, Yasushi Tsubota, Hiroshi G.Okuno: "Automatic Transformation of Environmental Sounds into Sound-Imitation Words Based on Japanese Syllable Structure"Proceedings of the Eighth European Conference on Speech Communication and Technology (Eurospeech-2003). 3185-3188 (2003)
[Publications] Kazuhiro Nakadai, D.Matsuura, Hiroshi G.Okuno, Hiroshi Tsujino: "Three Simultaneous Speech Recognition by Integration of Active Audition and Face Recognition for Humanoid"Proceedings of the Eighth European Conference on Speech Communication and Technology (Eurospeech-2003). 2705-2708 (2003)
[Publications] Tatsuya Kawahara, Ryosuke Ito, Kazunori Komatani: "Spoken Dialogue System for Queries on Appliance Manuals using Hierarchical Confirmation Strategy"Proceedings of the Eighth European Conference on Speech Communication and Technology (Eurospeech-2003). 1701-1704 (2003)
[Publications] Yohei Sakuraba, Hiroshi G.Okuno: "Note Recognition of Polyphonic Music by Using Timbre Similarity and Direction Proximity"Proceedings of International Computer Music Conference (ICMC2003). 167-170 (2003)
[Publications] Kazunori Komatani, S.Ueno, Tatsuya Kawahara, Hiroshi G.Okuno: "Flexible Guidance Generation using User Model in Spoken Dialogue Systems"Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics (ACL2003). 256-263 (2003)
[Publications] Kazunori Komatani, F.Adachi, S.Ueno, T.Kawahara, H.G.Okuno: "Flexible Spoken Dialogue System based on User Models and Dynamic Generation of VoiceXML Scripts"Proceedings of 4th SIGdial Workshop on Discourse and Dialogue. 87-96 (2003)
[Publications] Tetsuro Kitahara, Masataka Goto, Hiroshi G.Okuno: "Pitch-dependent Musical Instrument Identification and Its Application to Musical Sound Ontology"Developments in Applied Artificial Intelligence. LNAI2718. 112-122 (2003)
[Publications] Tetsuro Kitahara, Masataka Goto, Hiroshi G.Okuno: "Musical Instrument Identification based on F0-dependent Multivariate Normal Distribution"Proceedings of 2003 International Conference on Multimedia and Expo (ICME2003). Vol.III. 405-409 (2003)
[Publications] Hiroshi G.Okuno, Kazuhiro Nakadai, Hiroaki Kitano: "Design and Implementation of Personality of Humanoids in Human Humanoid Non-verbal Interaction"Developments in Applied Artificial Intelligence. LNAI2718. 662-673 (2003)
[Publications] Tetsuro Kitahara, Masataka Goto, Hiroshi G.Okuno: "Musical Instrument Identification based on F0-dependent Multivariate Normal Distribution"Proceedings of 2003 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2003). Vol.5 Vol.III. 421-424 (2003)
[Publications] Tetsuro Kitahara, Masataka, Goto, Hirosi G.Okuno: "Musical Instrument Identification based on F0-dependent Multivariate Normal Distribution"Proceedings of 2003 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2003). Vol.5 Vol.III. 421-424 (2003)
[Publications] 橋田浩一: "知的符号化"人工知能学会誌. Vol.18, No.3. 251-258 (2003)
[Publications] 橋田浩一: "アノテーションに基づく知的生産支援"第5回知識科学シンポジウム「知的創造のプロセス、場、およびシステム化」. (口頭発表). (2003)
[Publications] 奥乃博: "AI辞典、第2版"共立出版. 544 (2003)
[Publications] Koiti Hasida, John R.Smith (eds.): "Information technology. Multimedia content description interface. Part 5 : Multimedia description schemes AMENDMENT 1 : Multimedia description schemes extensions. FDAM 1"ISO/IEC15938-5. 70 (2004)

2003 Fiscal Year Annual Research Report

GDA文書タグの自動変換とその応用システム開発の研究

Principal Investigator

奥乃 博 京都大学, 情報学研究科, 教授 (60318201)

Research Products

[Publications] 渡邊太郎, 今村, 隅田英一郎, 奥乃博: "階層的アライメントを用いた統計的機械翻訳"電子情報通信学会論文誌. Vol.45, No.4(印刷中). (2004)

[Publications] 秋田祐哉, 河原達也: "多数話者モデルを用いた討論音声の教師なし話者インデキシング"電子情報通信学会論文誌. Vol.87, No.2. 495-503 (2004)

[Publications] 北原鉄朗, 後藤真孝, 奥乃博: "音響的類似性を反映した楽器の階層表現の獲得とそれに基づく未知楽器のカテゴリーレベルの音源同定"情報処理学会論文誌. Vol.45, No.3. 680-689 (2004)

[Publications] 渡邊太郎, 隅田英一郎, 奥乃博: "生成方向を考慮した統計的機械翻訳のためのデコーディングアルゴリズム"情報処理学会論文誌. Vol.44, No.12. 3202-3210 (2003)

[Publications] 山肩洋子, 河原達也, 奥乃博, 美濃導彦: "音声対話システムにおける物体指示のための信念ネットワークを用いた曖昧性の解消"人工知能学会誌. Vol.19, No.1F. 47-56 (2004)

[Publications] 北原鉄朗, 後藤孝, 奥乃博: "音高による音色変化に着目した楽器音の音源同定:F0依存多次元正規分布に基づく識別手法"情報処理学会論文誌. Vol.44, No.10. 2448-2458 (2003)

[Publications] 駒谷和範, 鹿島博晶, 田中克明, 河原達也: "複合的言語制約に基づくキーフレーズ検出を用いた汎用的なデータベース検索音声対話プラットフォーム"情報処理学会論文誌. Vol.44, No.5. 1333-1342 (2003)

[Publications] 奥乃博, 中臺一博: "ロボット聴覚の課題と現状"情報処理. VOl.44, No.11. 1138-1144 (2003)

[Publications] Kazunori Komatani, S.Ueno, Tatsuya Kawahara, Hiroshi G.Okuno: "User modeling in Spoken Dialogue Systems for Flexible Guidance Generation"Proceedings of the Eighth European Conference on Speech Communication and Technology (Eurospeech-2003). 745-748 (2003)

[Publications] Takamichi Saito, Toshio Kito, Kentaro Umesawa, Hiroshi G.Okuno: "Privacy-Enhanced SPKI Access Control on PKIX and Its Application to Web Server"Proceedings of the Seventeenth International Conference on Advanced Information Networking and Applications (AINA'03). 696-703 (2003)

[Publications] Tatsuya Kawahara, Ryosuke Ito, Kazunori Komatani: "Spoken Dialogue System for Queries on Appliance Manuals using Hierarchical Confirmation Strategy"Proceedings of the Eighth European Conference on Speech Communication and Technology (Eurospeech-2003). 1701-1704 (2003)

[Publications] Yohei Sakuraba, Hiroshi G.Okuno: "Note Recognition of Polyphonic Music by Using Timbre Similarity and Direction Proximity"Proceedings of International Computer Music Conference (ICMC2003). 167-170 (2003)

[Publications] Kazunori Komatani, S.Ueno, Tatsuya Kawahara, Hiroshi G.Okuno: "Flexible Guidance Generation using User Model in Spoken Dialogue Systems"Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics (ACL2003). 256-263 (2003)

[Publications] Kazunori Komatani, F.Adachi, S.Ueno, T.Kawahara, H.G.Okuno: "Flexible Spoken Dialogue System based on User Models and Dynamic Generation of VoiceXML Scripts"Proceedings of 4th SIGdial Workshop on Discourse and Dialogue. 87-96 (2003)

[Publications] Tetsuro Kitahara, Masataka Goto, Hiroshi G.Okuno: "Pitch-dependent Musical Instrument Identification and Its Application to Musical Sound Ontology"Developments in Applied Artificial Intelligence. LNAI2718. 112-122 (2003)

[Publications] Tetsuro Kitahara, Masataka Goto, Hiroshi G.Okuno: "Musical Instrument Identification based on F0-dependent Multivariate Normal Distribution"Proceedings of 2003 International Conference on Multimedia and Expo (ICME2003). Vol.III. 405-409 (2003)

[Publications] Hiroshi G.Okuno, Kazuhiro Nakadai, Hiroaki Kitano: "Design and Implementation of Personality of Humanoids in Human Humanoid Non-verbal Interaction"Developments in Applied Artificial Intelligence. LNAI2718. 662-673 (2003)

[Publications] Tetsuro Kitahara, Masataka Goto, Hiroshi G.Okuno: "Musical Instrument Identification based on F0-dependent Multivariate Normal Distribution"Proceedings of 2003 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2003). Vol.5 Vol.III. 421-424 (2003)

[Publications] Tetsuro Kitahara, Masataka, Goto, Hirosi G.Okuno: "Musical Instrument Identification based on F0-dependent Multivariate Normal Distribution"Proceedings of 2003 International Conference on Acoustics, Speech and Signal Processing (ICASSP'2003). Vol.5 Vol.III. 421-424 (2003)

[Publications] 橋田浩一: "知的符号化"人工知能学会誌. Vol.18, No.3. 251-258 (2003)

[Publications] 橋田浩一: "アノテーションに基づく知的生産支援"第5回知識科学シンポジウム「知的創造のプロセス、場、およびシステム化」. (口頭発表). (2003)

[Publications] 奥乃博: "AI辞典、第2版"共立出版. 544 (2003)

[Publications] Koiti Hasida, John R.Smith (eds.): "Information technology. Multimedia content description interface. Part 5 : Multimedia description schemes AMENDMENT 1 : Multimedia description schemes extensions. FDAM 1"ISO/IEC15938-5. 70 (2004)

奥乃博京都大学, 情報学研究科, 教授 (60318201)