Project/Area Number |
07680406
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Intelligent informatics
|
Research Institution | Kyushu Institute of Technology |
Principal Investigator |
SHINOHARA Takeshi Kyushu Institute of Technology, Department of Atrificial Intelligence, Professor, 情報工学部, 教授 (60154225)
|
Project Period (FY) |
1995 – 1996
|
Project Status |
Completed (Fiscal Year 1996)
|
Budget Amount *help |
¥2,100,000 (Direct Cost: ¥2,100,000)
Fiscal Year 1996: ¥800,000 (Direct Cost: ¥800,000)
Fiscal Year 1995: ¥1,300,000 (Direct Cost: ¥1,300,000)
|
Keywords | Inductive Inference / Machine Learning / Learning from Positive Examples / Pattern Languages / Elementary Formal Systems / Efficiency of Learning |
Research Abstract |
A pattern is a string consisting of constant symbols and variables. The language of a pattern is the set of constant strings obtained by substituting nonempty constant strings for variables in the pattern. For any fixed k, the class of unions of at most k pattern languages is already shown to be inferable from positive data. We apply a learning algorithm for pattern languages to discover a motif from amino-acid sequences. From only positive examples with the help of an alphsbet indexing, the algorithm succesfully finds sets of patterns, that can be considered as motifs. The class of all the unions of arbitrarily finitely many pattern languages in not inferable, because any constant string defines a singleton set consisting of itself, and the class of unions contains all the finite languages. A proper pattern is a pattern that contains at least one variable. The language of a proper pattern is infinite. In this paper, we consider the class of unions when patterns are restricted to be priper and show that the class is not inferable from positive data. When patterns are restricted not to contain more than l consecutive occurrences of constant symbols for some l, the class of unions is shown to be inferable from positive data.
|