• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2023 Fiscal Year Research-status Report

Machine Learning for Structure-Rich Data-Scarce Domains

Research Project

Project/Area Number 22K12150
Research InstitutionKyoto University

Principal Investigator

NGUYEN Canh・Hao  京都大学, 化学研究所, 講師 (90626889)

Project Period (FY) 2022-04-01 – 2025-03-31
KeywordsGraph neural networks / Convex Clustering
Outline of Annual Research Achievements

In this year, we are working on representation of data that are faithful to the original features as well as having cluster structures. We investigated the method of convex clustering to obtain a representation using a convex program, which is efficient and globally optimal.

The key idea is to assume that data follows cluster structures. For that, we cluster the data using convex clustering. The advantage of convex clustering is that it is a convex program that guarantees optimality. Another advantage is that it offers a relaxation of k-means and agglomerative clustering algorithms, offering potential advantages of the two algorithms.

Our main work here is to analyze analytically what are the clusters that are obtained by convex clustering, pros and cons compared to the other two algorithms. We found that convex cluster only can learn convex clusters. This is similar to k-means and different from agglomerative clustering. We also found that the clusters can be bounded in balls, making them round-shaped. These clusters are found to have gaps between them. These properties show that convex clustering found rather specific types of clusters, rather inflexible compare to the other algorithms.

Current Status of Research Progress
Current Status of Research Progress

3: Progress in research has been slightly delayed.

Reason

We are working on a particular problem with the difficulty of understanding the formulation of convex clustering, which has not been well studied before.

Strategy for Future Research Activity

We plan to continue working on finding suitable representations of data from original features with additional information such as graphs that are guaranteed to extract more information compared to currently used methods.

Causes of Carryover

We did not proceed with travel and buying articles as the research plan.

URL: 

Published: 2024-12-25  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi