Project/Area Number |
08680333
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Statistical science
|
Research Institution | Takachiho University |
Principal Investigator |
SIBUYA Masaaki Takachiho University, Faculty of Commerce, Professor, 商学部, 教授 (20146723)
|
Project Period (FY) |
1996 – 1998
|
Project Status |
Completed (Fiscal Year 1998)
|
Budget Amount *help |
¥2,400,000 (Direct Cost: ¥2,400,000)
Fiscal Year 1998: ¥500,000 (Direct Cost: ¥500,000)
Fiscal Year 1997: ¥800,000 (Direct Cost: ¥800,000)
Fiscal Year 1996: ¥1,100,000 (Direct Cost: ¥1,100,000)
|
Keywords | clustering process / distance between partitions / frequency of frequencies / number of new records / size index / Stirling-Carlitz polynomial / Stirling number of the first kind / the most random partition / Ewens 確率分布 / Stirling-Carlity多項式 / スターリング確率分布族 / 寸法指標 / クラスター / 分類 / スターリング数 / 予測 / スターリング・カ-リッツ多項式 / 有限集合の分割間の距離 / 確率分割の中心 |
Research Abstract |
Random partition appears in the diversity of population in ecology, vocabulary of a group in linguistics, inelastic collision of particles in dynamics etc. and relates to Ziph's law in social groups. It has been advanced recently in population genetics in relation with random occurrence of new alleles. In this projects, statistical problems in the application of the model and related problems are studied. A central question is the notion of the most random partition of a finite set. This notion can be applied to the clustering technique in multivariate data analysis. Any method can be justified only if it classifies meaningful data in a systematic way, not random. A definition is proposed and characterized in some ways. Other topics : Prediction of the number of future new records. The earthquakes are well recorded, and people want safety against the strongest earthquake ever experienced. The problem is how often the record value will be broken. By new records the events are partitioned into intervals and a model of new records is developed for the prediction. A distance between partitions of a finite set can be defined similarly to Hamming distance between binary codes. The distance between independent two random partitions can be used to characterize the random partition. Moreover the distance can be used to define a center of the random partitions. Hence a sampling distributions of the random partition can be constructed.
|