Language-driven visual content customization using construction sequences.

Research Project

Project/Area Number	23K16921
Research Category	Grant-in-Aid for Early-Career Scientists
Allocation Type	Multi-year Fund
Review Section	Basic Section 61020:Human interface and interaction-related
Research Institution	The University of Tokyo
Principal Investigator	沈奕超東京大学, 大学院情報理工学系研究科, 助教 (40969119)
Project Period (FY)	2023-04-01 – 2028-03-31
Project Status	Granted (Fiscal Year 2023)
Budget Amount *help	¥4,680,000 (Direct Cost: ¥3,600,000、Indirect Cost: ¥1,080,000) Fiscal Year 2027: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000) Fiscal Year 2026: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000) Fiscal Year 2025: ¥780,000 (Direct Cost: ¥600,000、Indirect Cost: ¥180,000) Fiscal Year 2024: ¥780,000 (Direct Cost: ¥600,000、Indirect Cost: ¥180,000) Fiscal Year 2023: ¥1,820,000 (Direct Cost: ¥1,400,000、Indirect Cost: ¥420,000)
Keywords	computer graphics / typography / computer vision / icon colorization / virtual try-on / machine learning / 3D shape reconstruction
Outline of Research at the Start	This project will focus on inferring the 3D shape construction sequence and connecting them to natural language descriptions. I will collect construction sequences, design algorithms to generate construction sequences, and establish latent space between construction sequences and natural languages.
Outline of Annual Research Achievements	In the past one year, I have been working on various projects related to this funding, including designing high-usability icons, icon colorization, procedural shape generation, and foundation model for typography. These projects lead to four journal papers and one conference short paper. The result outcomes exhibits distinct contributions that we focus on providing novel computational models and foundation models for graphic design elements, such as fonts and icons.
Current Status of Research Progress	Current Status of Research Progress 1: Research has progressed more than it was originally planned. Reason In the past year, I developed a foundation model for typography which is fundamentally change how multilingual font applications can be achieved. This model can be used to do language-driven font retrieval and vector glyph optimization. Meanwhile, I also developed another foundation model for connecting text descriptions with icons. This foundation model can then be used to design high-usability icons that reduce the effort of icon designers. These two foundation models are very unique compared to other existing foundation models where our models focus on graphic design domains rather than natural scene images.
Strategy for Future Research Activity	The development of the foundation model of typography (i.e., FontCLIP) provides a great foundation for me to proceed on related research projects, such as utilize language input to design variable fonts and sentence generation. Meanwhile, with the foundation model for icon, I plan to combine it with a large language model to generate diverse icons that users can use for various graphic design tasks.

Report

(1 results)

2023 Research-status Report

Research Products
(12 results)

All 2024 2023 Other

All Int'l Joint Research (3 results) Journal Article (4 results) (of which Int'l Joint Research: 4 results, Peer Reviewed: 4 results, Open Access: 3 results) Presentation (1 results) (of which Int'l Joint Research: 1 results) Remarks (4 results)

[Int'l Joint Research] National Taiwan University/National Chung Cheng University/National Yang Ming Chiao Tung University(その他の国・地域)
- Related Report
  2023 Research-status Report
[Int'l Joint Research] Carleton University(カナダ)
- Related Report
  2023 Research-status Report
[Int'l Joint Research] Reichman University(イスラエル)
- Related Report
  2023 Research-status Report
[Journal Article] FontCLIP: A Semantic Typography Visual-Language Model for Multilingual Font Applications2024
- Author(s)
  Yuki Tatsukawa、 I-Chao Shen、 Anran Qi、Yuki Koyama、Takeo Igarashi、Ariel Shamir
- Journal Title
  
  Computer Graphics Forum
  
  Volume: -
- Related Report
  2023 Research-status Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] EvIcon: Designing High‐Usability Icon with Human‐in‐the‐loop Exploration and IconCLIP2023
- Author(s)
  Shen I‐Chao、Cherng Fu‐Yin、Igarashi Takeo、Lin Wen‐Chieh、Chen Bing‐Yu
- Journal Title
  
  Computer Graphics Forum
  
  Volume: 42 Issue: 6
- DOI
  10.1111/cgf.14924
- Related Report
  2023 Research-status Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Data‐guided Authoring of Procedural Models of Shapes2023
- Author(s)
  Hossain Ishtiaque、Shen I‐Chao、Igarashi Takeo、van Kaick Oliver
- Journal Title
  
  Computer Graphics Forum
  
  Volume: 42 Issue: 7
- DOI
  10.1111/cgf.14935
- Related Report
  2023 Research-status Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Palette‐Based and Harmony‐Guided Colorization for Vector Icons2023
- Author(s)
  Lin Miao、Shen I‐Chao、Chin Hsiao‐Yuan、Chen Ruo‐Xi、Chen Bing‐Yu
- Journal Title
  
  Computer Graphics Forum
  
  Volume: 42 Issue: 7
- DOI
  10.1111/cgf.14950
- Related Report
  2023 Research-status Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Presentation] 3D Reconstruction from Sketch with Hidden Lines by Two-Branch Diffusion Model2024
- Author(s)
  Yuta Fukushima
- Organizer
  Eurographics
- Related Report
  2023 Research-status Report
- Int'l Joint Research
[Remarks] FontCLIP
- URL
  https://yukistavailable.github.io/fontclip.github.io/
- Related Report
  2023 Research-status Report
[Remarks] Icon colorization
- URL
  https://jdily.github.io/proj_site/icon_colorization.html
- Related Report
  2023 Research-status Report
[Remarks] Procedural Models Authoring
- URL
  https://jdily.github.io/proj_site/data_proc_model.html
- Related Report
  2023 Research-status Report
[Remarks] EvIcon
- URL
  https://jdily.github.io/proj_site/evicon_proj.html
- Related Report
  2023 Research-status Report

Language-driven visual content customization using construction sequences.

Principal Investigator

沈 奕超 東京大学, 大学院情報理工学系研究科, 助教 (40969119)

¥4,680,000 (Direct Cost: ¥3,600,000、Indirect Cost: ¥1,080,000)

Current Status of Research Progress

Reason

Report

Research Products

[Int'l Joint Research] National Taiwan University/National Chung Cheng University/National Yang Ming Chiao Tung University(その他の国・地域)

Related Report

[Int'l Joint Research] Carleton University(カナダ)

Related Report

[Int'l Joint Research] Reichman University(イスラエル)

Related Report

[Journal Article] FontCLIP: A Semantic Typography Visual-Language Model for Multilingual Font Applications2024

Author(s)

Journal Title

Related Report

[Journal Article] EvIcon: Designing High‐Usability Icon with Human‐in‐the‐loop Exploration and IconCLIP2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Data‐guided Authoring of Procedural Models of Shapes2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Palette‐Based and Harmony‐Guided Colorization for Vector Icons2023

Author(s)

Journal Title

DOI

Related Report

[Presentation] 3D Reconstruction from Sketch with Hidden Lines by Two-Branch Diffusion Model2024

Author(s)

Organizer

Related Report

[Remarks] FontCLIP

URL

Related Report

[Remarks] Icon colorization

URL

Related Report

[Remarks] Procedural Models Authoring

URL

Related Report

[Remarks] EvIcon

URL

Related Report

沈奕超東京大学, 大学院情報理工学系研究科, 助教 (40969119)