Utilizing distributed parallel computation for computer intensive statistical analysis within a heterogeneous computer environment
Project/Area Number |
15500189
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Statistical science
|
Research Institution | Oita University |
Principal Investigator |
OCHI Yoshimichi Oita University, Faculty of Engineering, Professor, 工学部, 教授 (60185618)
|
Co-Investigator(Kenkyū-buntansha) |
OBATA Tsuneshi Oita University, Faculty of Engineering, Assistant Professor, 工学部, 助手 (00244153)
|
Project Period (FY) |
2003 – 2004
|
Project Status |
Completed (Fiscal Year 2004)
|
Budget Amount *help |
¥2,600,000 (Direct Cost: ¥2,600,000)
Fiscal Year 2004: ¥900,000 (Direct Cost: ¥900,000)
Fiscal Year 2003: ¥1,700,000 (Direct Cost: ¥1,700,000)
|
Keywords | distributed parallel computing / MPI / PVM / task management / computer intensive statistical analysis |
Research Abstract |
We adopted a form of implementations of Beowulf. type PC cluster for the distributed parallel computation environment and studied the actual implementations and pros and cons of the use of such implementation. In order to install such system, we reorganized the network system and computer resource in our laboratories and established the distributed parallel computation environment with PVM and MPI(here we used "lam" as an actual implementation) with those computers. We developed a task management method "Time Prediction Model", which control tasks with their expected run time as a manager-worker model and the method was compared with "Work Pool Methods", in order to control tasks within our heterogeneous computing resource. Based on the results given here, we also considered a fundamental design of an integrated application development environment for the distributed parallel environment, which would support computationally intensive statistical investigation, such as Bootstrap and simulation based analyses. For the algorisms, we first investigated some basic calculation tools, such as matrix multiplications and inversion. Especially for multiplication of larger matrices, several methods, such as column/row wise matrix splitting and Fox algorithm with checker board arrangements are investigated regarding the memory and CPU resource configuration. Furthermore, programs for the bootstrapping and tree based methods were reviewed and their plausibility for the parallel processing, granularity issues and coherency for the task management methods were investigated. Based on these considerations, we installed a distributed parallel processing simulation platform within the heterogeneous computer resources and confirmed its efficiency for use of computer intensive analysis for real data analysis and their performance evaluation, via analyses of multinomial categorical data with over-dispersion problem.
|
Report
(3 results)
Research Products
(12 results)