• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

A Study of A Classification Method Using Automatic Model Selection For Multimedia Heterogeneous Data

Research Project

Project/Area Number 16300036
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section一般
Research Field Media informatics/Database
Research InstitutionThe Institute of Statistical Mathematics

Principal Investigator

MATSUI Tomoko  The Institute of Statistical Mathematics, Research Organization and Systems, Associate Professor (10370090)

Co-Investigator(Kenkyū-buntansha) TANABE Kunio  Waseda University, Faculty of Science and Engineering, Professor (50000203)
Project Period (FY) 2004 – 2007
Project Status Completed (Fiscal Year 2007)
Budget Amount *help
¥12,360,000 (Direct Cost: ¥11,400,000、Indirect Cost: ¥960,000)
Fiscal Year 2007: ¥4,160,000 (Direct Cost: ¥3,200,000、Indirect Cost: ¥960,000)
Fiscal Year 2006: ¥2,100,000 (Direct Cost: ¥2,100,000)
Fiscal Year 2005: ¥2,800,000 (Direct Cost: ¥2,800,000)
Fiscal Year 2004: ¥3,300,000 (Direct Cost: ¥3,300,000)
Keywordsmultimedia / classification / model selection / probabilistic model / inductive learning / カーネルマシン / 音声認識 / 話者認識 / 画像認識
Research Abstract

Our objective is to develop a practical classification method bawd on dual penalized regression machines (dPLRMs) for heterogeneous data such as multimedia data including speech, image and text data. dPLRMs are multiclass discrimination machines which has been developed by a investigator of this project, Kunio Tanabe and can handle noisy stochastic data by employing the penalized logistic regression model. We have worked on four specific problems; 1) establishment of a practical classification framework using dPLRMs, 2) investigation on modeling of heterogeneous, variable-length, and massive data sets, 3) investigation of a method to deal with unknown data, and 4) development of the dPLRM software package for wide use. For 1), we established the framework through the experiments to examine the classification and inductive power in dPLRMs with speech and auditory data. For 2), we extended dPLRMs for multiple kernels and investigated a coding method to manage heterogeneous data with different sampling rates. Moreover, we designed a kernel function for time series with variable lengths. For 3), we investigated a method to add a new class for unknown data. For 4), we developed the software package and opened it to the public with a research purpose. Through the 1)-4) investigations, we established the fundamental framework of a classification method based on dPLRMs.

Report

(5 results)
  • 2007 Annual Research Report   Final Research Report Summary
  • 2006 Annual Research Report
  • 2005 Annual Research Report
  • 2004 Annual Research Report
  • Research Products

    (57 results)

All 2008 2007 2006 2005 2004

All Journal Article (57 results) (of which Peer Reviewed: 12 results)

  • [Journal Article] N-best rescoring for speech recognition using penalized logistic regression machines2008

    • Author(s)
      O.Birkenes, T.Matsui, K.Tanabe and T.A.Myrvoll
    • Journal Title

      日本音響学会研究発表会講演論文集 春季 I

      Pages: 5-6

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] Speaker Identification Using dPLRM2008

    • Author(s)
      T. Matsui, K. Tanabe
    • Journal Title

      Proc. The Institute of Statistical Mathematics 53-2

      Pages: 201-210

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] N-besr rescoring for speech recognition using penalized logistic regressin machines2008

    • Author(s)
      O. Birkenes, T. Matsui, K. Tanabe, T. A. Myrvoll
    • Journal Title

      Proc. ASJ Spring Meeting-I

      Pages: 5-6

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] N-best rescoring for speech regocnition using penalized logistic regression machines2008

    • Author(s)
      O. Birkenes, T. Matsui, K. Tanabe and T. A. Myrvoll
    • Journal Title

      日本音響学会2008年春季研究発表会講演論文集 I

    • Related Report
      2007 Annual Research Report
  • [Journal Article] Information fusion using multiple kernel logistic regression with applications to phonetic feature detection2007

    • Author(s)
      T.A.Myrvoll, 松井 知子
    • Journal Title

      日本音響学会研究発表会講演論文集 春季 I

      Pages: 49-50

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] Isolated-Word Recognition Using Global Alignment Kernel2007

    • Author(s)
      M.Cuturi, J.-P.Vert, O.Birkenes, 松井 知子
    • Journal Title

      日本音響学会研究発表会講演論文集 春季 I

      Pages: 51-52

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] NII-ISM,Japan at TRECVID 2007:high level feature extraction2007

    • Author(s)
      H.D.Le, S.Satoh, and T.Matsui
    • Journal Title

      Proceedings of TRECVID Workshop 2007

      Pages: 285-292

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] A kernel for time series based on global alignments2007

    • Author(s)
      M.Cuturi, J.-P.Vert, O.Birkenes, and T.Matsui
    • Journal Title

      Proceedings of 2007 ICASSP IEEE International Conference on Acoustics, Speech, and Signal Processing

      Pages: 413-416

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
    • Peer Reviewed
  • [Journal Article] N-best rescoring for speech recognition using penalized logistic regression machines with garbage class2007

    • Author(s)
      O.Birkenes, T.Matsui, and K.Tanabe, T.A.Myrvoll
    • Journal Title

      Proceedings of 2007 ICASSP IEEE International Conference on Acoustics, Speech, and Signal Processing

      Pages: 449-452

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
    • Peer Reviewed
  • [Journal Article] Information fusion using multiple kernel logistic regression with applications to phonetic feature detection2007

    • Author(s)
      T. A. Myrvoll, T.Matsui
    • Journal Title

      Proc. ASJ Spring Meeting-I

      Pages: 49-50

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] Isolated-Word Recognition Using Global Alignment Kernel2007

    • Author(s)
      M. Cuturi, J.-P. Vert, O. Birkenes, T.Matsui
    • Journal Title

      Proc. ASJ Spring Meeting-I

      Pages: 51-52

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] NIT-ISM, Japan at TRECVID 2007 : high level feature extraction2007

    • Author(s)
      H. D. Le, S. Satoh, T. Matsui
    • Journal Title

      Proc. TRECVID Workshop

      Pages: 285-292

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] A kernel for time series based on global alignments2007

    • Author(s)
      M. Cuturi, J.-P. Vert, O. Birkenes, T. Matsui
    • Journal Title

      Proc. ICASSP

      Pages: 413-416

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] N-best rescoring for speech recognition using penalized logistic regression machines with garbage class2007

    • Author(s)
      O. Birkenes, T. Matsui, K. Tanabe, T. A. Myrvoll
    • Journal Title

      Proc. ICASSP

      Pages: 449-452

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] NII-ISM, Japan at TRECVID 2007: high level feature extraction2007

    • Author(s)
      H.D. Le, S. Satoh, and T Matsui
    • Journal Title

      Proceeding of TRECVID 2007 Workshop

    • Related Report
      2007 Annual Research Report
  • [Journal Article] A kernel for time series based on global alignments2007

    • Author(s)
      M. Cuturi, J.-P. Vert, O. Birkenes, and T. Matsui
    • Journal Title

      Proceedings of 2007 ICASSP IEEE International Conference on Acoustics, Speech, and Signal Processing 2

    • Related Report
      2007 Annual Research Report
    • Peer Reviewed
  • [Journal Article] N-best restoring for speech recognition using penalized logistic regression machines with garbage class2007

    • Author(s)
      O. Birkenes, T. Matsui, and K. Tanabe
    • Journal Title

      Proceedings of 2007 ICASSP IEEE International Conference on Acoustics, Speech, and Signal Processing 4

    • Related Report
      2007 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Information fusion using multiple kernel logistic regression with applications to phonetic feature detection2007

    • Author(s)
      T.A.Myrvoll, 松井知子
    • Journal Title

      日本音響学会2007春季研究発表会講演論文集

      Pages: 49-50

    • Related Report
      2006 Annual Research Report
  • [Journal Article] Phoneme recognition using global alignment kernel2007

    • Author(s)
      M.Cuturi, J.-P.Vert, O.Birkenes, 松井知子
    • Journal Title

      日本音響学会2007春季研究発表会講演論文集

      Pages: 51-52

    • Related Report
      2006 Annual Research Report
  • [Journal Article] Comparative Study of Speaker Identification Methods:dPLRM,SVM and GMM2006

    • Author(s)
      T.Matsui and K.Tanabe
    • Journal Title

      IEICE TRANS, INF & SYST., VOL E89-D No.3

      Pages: 1066-1073

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
    • Peer Reviewed
  • [Journal Article] Isolated-Word Recognition with Penalized Logistic Regression Machines2006

    • Author(s)
      O.Birkenes, T.Matsui and K.Tanabe
    • Journal Title

      Proceedings of 2006 ICASSP IEEE International Conference on Acoustics, Speech, and Signal Processing

      Pages: 405-408

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
    • Peer Reviewed
  • [Journal Article] Continuous Speech Recognition with Penalized Logistic Regression Machines2006

    • Author(s)
      O.Birkenes, T.Matsui, K.Tanabe and T.A.Myrvoll
    • Journal Title

      Proceedings of 2006 Norsig Nordic Signal Processing

      Pages: 110-113

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
    • Peer Reviewed
  • [Journal Article] On a greedy learning algorithm for dPLRM with applications to phonetic feature detection2006

    • Author(s)
      T.A.Myrvoll and T.Matsui
    • Journal Title

      Proceedings of Interspeech 2006-ICSLP

      Pages: 1690-1693

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
    • Peer Reviewed
  • [Journal Article] Shot boundary detection and high-level feature extraction experiments for TRECVID 20062006

    • Author(s)
      M.Naito, K.Matsumoto, M.Shishibori, K.Kita, M.Cuturi, T.Matsui, S.Sato, K.Hoashi, F.Sugaya, and Y.Nakajima
    • Journal Title

      Proceedings of TRECVID Workshop 2006

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] Comparative Study of Speaker Identification Methods : dPLRM, SVM and GMM2006

    • Author(s)
      T. Matsui, K. Tanabe
    • Journal Title

      IEICE E89-D-3

      Pages: 1066-1073

    • NAID

      110004719382

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] Isolated-Word Recognition with Penalized Logistic Regression Machines2006

    • Author(s)
      O. Birkenes, T. Matsui, K. Tanabe
    • Journal Title

      Proc. ICASSP

      Pages: 405-408

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] Continuous Speech Recognition with Penalized Logistic Regression Machines2006

    • Author(s)
      O. Birkenes, T. Matsui, K. Tanabe, T. A. Myrvoll
    • Journal Title

      Proc. Norsig

      Pages: 110-113

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] On a greedy learning algorithm for dPLRM with applications to phonetic feature detection2006

    • Author(s)
      T. A. Myrvoll, T. Matsui
    • Journal Title

      Proc. Interspeech

      Pages: 1690-1693

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] Shot boundary detection and high-level feature extraction experiments for TRECVID 20062006

    • Author(s)
      M. Naito, K. Matsumoto, M. Shishibori, K. Kita, M. Cuturi, T. Matsui, S. Sato, K. Hoashi, F. Sugaya, Y. Nakajima
    • Journal Title

      Proc. TRECVID Workshop

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] Isolated-Word Recognition with Penalized Logistic Regression Machines2006

    • Author(s)
      O.Birkenes, T.Matsui, K.Tanabe
    • Journal Title

      Proceedings of 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing I

      Pages: 405-408

    • Related Report
      2006 Annual Research Report
  • [Journal Article] Continuous Speech Recognition with Penalized Logistic Regression Machines2006

    • Author(s)
      O.Birkenes, T.Matsui, K.Tanabe, T.A.Myrvoll
    • Journal Title

      Proceeding of 7th NORDIC SIGNAL PROCESSING SYMPOSIUM

    • Related Report
      2006 Annual Research Report
  • [Journal Article] On a Greedy Learning Algorithm for dPLRM with Applications to Phonetic Feature Detection2006

    • Author(s)
      T.A.Myrvoll, T.Matsui
    • Journal Title

      Proceedings of Interspeech 2006

      Pages: 1690-1693

    • Related Report
      2006 Annual Research Report
  • [Journal Article] Shot Boundary Detection and High-Level Feature Extraction Experiments for TRECVID 20062006

    • Author(s)
      M.Naito, K.Matsumoto, M.Shishibori, K.Kita, M.Cuturi, T.Matsui, S.Sato, K.Hoashi, F.Sugaya, Y.Nakajima
    • Journal Title

      Proceeding of TRECVID 2006 Workshop

    • Related Report
      2006 Annual Research Report
  • [Journal Article] Comparative Study of Speaker Identification Methods : dPLRM, SVM and GMM2006

    • Author(s)
      T.Matsui, K.Tanabe
    • Journal Title

      IEICE Transactions on Information and Systems E89-D・3

      Pages: 1066-1073

    • NAID

      110004719382

    • Related Report
      2005 Annual Research Report
  • [Journal Article] dPLRMを用いた話者識別2005

    • Author(s)
      松井 知子, 田邉 國士
    • Journal Title

      統計数理 第53巻 2号

      Pages: 201-210

    • NAID

      120006019075

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
    • Peer Reviewed
  • [Journal Article] dPLRMによる対数パワースペクトルを用いた話者識別2005

    • Author(s)
      松井 知子, 田邉 國士
    • Journal Title

      日本音響学会研究発表会講演論文集 春季 -I

      Pages: 11-12

    • NAID

      10018036951

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] dPLRM-Based Speaker Identification with Log Power Spectrum2005

    • Author(s)
      T.Matsui and K.Tanabe
    • Journal Title

      Proceeding of Interspeech 2005-ICSLP

      Pages: 2017-2020

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
    • Peer Reviewed
  • [Journal Article] dPLRM:Application to Speaker Identification2005

    • Author(s)
      T.Matsui and K.Tanabe
    • Journal Title

      Proceeding of International Symposium on The Art of Statistical Metaware

      Pages: 171-178

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] 両耳間相関関数を用いない音源方向推定2005

    • Author(s)
      松井 知子, 田邉 國士, 入野 俊夫
    • Journal Title

      日本音響学会研究発表会講演論文集 秋季 -I

      Pages: 713-714

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] Speaker IndetificationUsing Log-power spectrum2005

    • Author(s)
      T. Matsui, K. Tanabe
    • Journal Title

      Proc. ASJ Spring Meeting-I

      Pages: 11-12

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] dPLRM-Based Speaker Identification with Log Power Spectrum2005

    • Author(s)
      T. Matsui, K. Tanabe
    • Journal Title

      Proc. Interspeech

      Pages: 2017-2020

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] dPLRM : Application to Speaker Identification2005

    • Author(s)
      T. Matsui, K. Tanabe
    • Journal Title

      Proc. International Symposium on The Art of Statistical Metaware

      Pages: 171-178

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] Binaural Sound Source Localization Without Using Interaural Cross-Correlation2005

    • Author(s)
      T. Matsui, K. Tanabe, T. Irino
    • Journal Title

      Proc. ASJ Fall Meeting-I

      Pages: 713-714

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] dPLRM-Based Speaker Identification with Log Power Spectrum2005

    • Author(s)
      T.Matsui, K.Tanabe
    • Journal Title

      Proc.Interspeech 2005

      Pages: 2017-2020

    • Related Report
      2005 Annual Research Report
  • [Journal Article] dPLRM : Application to Speaker Identification2005

    • Author(s)
      T.Matsui, K.Tanabe
    • Journal Title

      Proc.International Symposium on The Art of Statistical Metaware

      Pages: 171-178

    • Related Report
      2005 Annual Research Report
  • [Journal Article] 両耳間相関関数を用いない音源方向推定2005

    • Author(s)
      松井知子, 田邉國士, 入野俊夫
    • Journal Title

      日本音響学会研究発表会講演論文集 秋季・I

      Pages: 713-714

    • Related Report
      2005 Annual Research Report
  • [Journal Article] dPLRMを用いた話者識別2005

    • Author(s)
      松井知子, 潤邉國士
    • Journal Title

      統計数理 53・2

      Pages: 201-210

    • NAID

      120006019075

    • Related Report
      2005 Annual Research Report
  • [Journal Article] dPLRMによる対数パワースペクトルを用いた話者識別2005

    • Author(s)
      松井知子, 田邉國士
    • Journal Title

      日本音響学会研究発表会講演論文集 春季・I

      Pages: 87-88

    • NAID

      10018036951

    • Related Report
      2004 Annual Research Report
  • [Journal Article] 罰金付きロジスティック回帰マシンによる話者識別2004

    • Author(s)
      松井 知子, 田邉 國士
    • Journal Title

      日本音響学会研究発表会講演論文集 秋季 -I

      Pages: 87-88

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] Probabilistic Speaker Identification with dual Penalized Logistic Regression Machine2004

    • Author(s)
      T.Matsui and K.Tanabe
    • Journal Title

      Proceeding of Interspeech 2004-ICSLP -III

      Pages: 1797-1800

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
    • Peer Reviewed
  • [Journal Article] Speaker Recognition without Feature Extraction Process2004

    • Author(s)
      T.Matsui and K.Tanabe
    • Journal Title

      Proceeding of Workshop on Statistical Modeling Approach for Speech Recognition

      Pages: 79-84

    • NAID

      110003278791

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2007 Final Research Report Summary
    • Peer Reviewed
  • [Journal Article] Speaker Identification using penalized logistic regression machine2004

    • Author(s)
      T. Matsui, K. Tanabe
    • Journal Title

      Proc. ASJ Fall Meeting S-I

      Pages: 87-88

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] Probabilistic Speaker Identification with dual Penalized Logistic Regression Machine2004

    • Author(s)
      T. Matsui, K. Tanabe
    • Journal Title

      Proc. ICSLP, III

      Pages: 1797-1800

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] Speaker Recognition without Feature Extraction Process2004

    • Author(s)
      T. Matsui, K. Tanabe
    • Journal Title

      Proc. Workshop on Statistical Modeling Approach for Speech Recognition

      Pages: 79-84

    • NAID

      110003278791

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2007 Final Research Report Summary
  • [Journal Article] 罰金付きロジスティック回帰マシンによる話者識別2004

    • Author(s)
      松井知子, 田邉國士
    • Journal Title

      日本音響学会研究発表会講演論文集 秋季・I

      Pages: 11-12

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Probabilistic Speaker Identification with dual Penalized Logistic Regression Machine2004

    • Author(s)
      T.Matsui, K.Tanabe
    • Journal Title

      Proc.8th International Conference on Spoken Language Processing

      Pages: 1797-1800

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Speaker Recognition without Feature Extraction Process2004

    • Author(s)
      T.Matsui, K.Tanabe
    • Journal Title

      Proc.Workshop on Statisitcal Modeling Approach for Speech Recognition

      Pages: 79-84

    • NAID

      110003278791

    • Related Report
      2004 Annual Research Report

URL: 

Published: 2004-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi