研究課題/領域番号 |
23K16921
|
研究機関 | 東京大学 |
研究代表者 |
沈 奕超 東京大学, 大学院情報理工学系研究科, 助教 (40969119)
|
研究期間 (年度) |
2023-04-01 – 2028-03-31
|
キーワード | computer graphics / typography / computer vision / icon colorization / virtual try-on |
研究実績の概要 |
In the past one year, I have been working on various projects related to this funding, including designing high-usability icons, icon colorization, procedural shape generation, and foundation model for typography. These projects lead to four journal papers and one conference short paper. The result outcomes exhibits distinct contributions that we focus on providing novel computational models and foundation models for graphic design elements, such as fonts and icons.
|
現在までの達成度 (区分) |
現在までの達成度 (区分)
1: 当初の計画以上に進展している
理由
In the past year, I developed a foundation model for typography which is fundamentally change how multilingual font applications can be achieved. This model can be used to do language-driven font retrieval and vector glyph optimization. Meanwhile, I also developed another foundation model for connecting text descriptions with icons. This foundation model can then be used to design high-usability icons that reduce the effort of icon designers. These two foundation models are very unique compared to other existing foundation models where our models focus on graphic design domains rather than natural scene images.
|
今後の研究の推進方策 |
The development of the foundation model of typography (i.e., FontCLIP) provides a great foundation for me to proceed on related research projects, such as utilize language input to design variable fonts and sentence generation. Meanwhile, with the foundation model for icon, I plan to combine it with a large language model to generate diverse icons that users can use for various graphic design tasks.
|
次年度使用額が生じた理由 |
In the past year, since the computational resources, including the machines and GPUs are still suffice for the project need, so I did not spend that much for purchasing new devices. In this year, since I plan to work improving the foundation models that facilitate language driven content generation, so I plan to purchase new computational devices to utilize the research fundings.
|