2023 Fiscal Year Research-status Report

超高精細単視点多眼映像向け高効率符号化・高認識率映像システムの構築

Research Project

Project/Area Number	22K17913
Research Institution	The University of Tokushima
Principal Investigator	片山貴文徳島大学, 大学院社会産業理工学研究部(理工学域), 助教 (70848522)
Project Period (FY)	2022-04-01 – 2025-03-31
Keywords	動画像符号化 / 機械学習 / 単視点多眼映像符号 / Versatile Video Coding
Outline of Annual Research Achievements	小型IoTデバイスには単視点多眼映像が取得可能なカメラセンサが提供されている。しかしながら、単視点多眼映像から取得される映像の圧縮技術や認識技術の方法には多くの冗長性が未だ含まれており、より効率的な手法が必要とされている。本研究では、広角カメラ映像と望遠カメラ映像の空間的相関性をScalable Video Coding (SVC)規格や機械学習を応用することで明らかにし、これまで提案された圧縮技術や認識技術の効率化を目指す。本研究を完遂することで、小型カメラをターゲットとした新規画像処理システムへの応用が期待できる。本研究は、単視点多眼映像における動画像符号化及び認識処理を、スケーラブル符号化と機械学習を利用することで高効率かつ高認識率を実現する画像処理システムの構築を目的とする。スケーラビリティのある高効率符号化方式と機械学習を用いた共通Convolutional Neural Network(CNN)やTransformerによる画像認識手法を組み合わせることで演算コストと処理性能の最適なトレードオフを実現し、単視点多眼映像向け画像処理システムを構築する。本年度は、単視点多眼映像検証と解析を中心に研究を行なった。本フェーズでは、超高解像度単視点多眼映向けの符号化手法の提案および画像認識アルゴリズムとの統合手法を検討した。符号化性能の検証はVarsatuile Video Coding (VVenC)に応用し、画像認識アルゴリズムとの統合を方法を検討した。研究成果としては、1,主要な機械学習アルゴリズムを提案したこと、2,映像符号化処理に機械学習の実装を行なった事の2点である。これらに関連する研究成果を研究論文としてまとめ、複数の国際会議で発表を行った。
Current Status of Research Progress	Current Status of Research Progress 3: Progress in research has been slightly delayed. Reason 令和5年度は符号化性能の検証はSVC、MVC規格をVVCに応用し、WS-TS間の空間的相関関係を解明する予定であったが、当初の予定より実装に工数が必要になっている。近年では、VVC向けの高速並列演算用のアルゴリズムが公開されており、それを基にした、処理方法の提案が必要となっている。令和6年度の上期で当初予定していた工程まで加速させ完遂させる予定である。
Strategy for Future Research Activity	令和6年度は、スケーラビリティのある高効率符号化方式の実装が大きな課題として残されているので、その課題を中心とし、システム全体の開発に着手する。また、多眼カメラを応用した並列処理に特化した機械学習手法の提案が必要であることから、作業を並列的に進める。
Causes of Carryover	理由：本年度ではシステムの単体検証・評価を行ったため、想定よりも工数がかからなかったことから、人件費、謝金の支出がない。使用計画：令和6年度では、システム全体の検証・評価を効率的に進めるために、必要な人件費の支出を行う予定である。理由：研究に使用するノートPCが4月の納品となり、支出が完了していないため。使用計画：ノートPCの支払いが4月に完了する予定である。

Research Products
(26 results)

All 2023

All Journal Article (14 results) (of which Peer Reviewed: 14 results) Presentation (12 results) (of which Int'l Joint Research: 12 results)

[Journal Article] Underwater Object Detection through Analysis and Data Augmentation of Underwater Datasets2023
- Author(s)
  Imada Atsuki, Takafumi Katayama, Tian Song and Takashi Shimamoto
- Journal Title
  
  Lecture Notes in Networks and Systems, Springer
  
  Volume: - Pages: -
- Peer Reviewed
[Journal Article] Looking Closer to the Transferability Between Natural and Medical Images in Deep Learning2023
- Author(s)
  Rufaida Syahidah Izza、Putra Tryan Aditya、Leu Jenq-Shiou、Song Tian、Katayama Takafumi
- Journal Title
  
  IEEE Access
  
  Volume: 11 Pages: 79838～79850
- DOI
  10.1109/ACCESS.2023.3299819
- Peer Reviewed
[Journal Article] A Novel GAN-Based Intra Prediction Mode for HEVC2023
- Author(s)
  Takafumi Katayama, Tian Song and Takashi Shimamoto
- Journal Title
  
  The International Symposium on Communications and Information Technologies (ISCIT) 2023
  
  Volume: - Pages: 88-93
- Peer Reviewed
[Journal Article] Refined Datasets and Saliency Map Analysis for Underwater Object Detection2023
- Author(s)
  Takafumi Katayama, Tian Song and Takashi Shimamoto
- Journal Title
  
  OCEANS2023 Gulf Coast 2023
  
  Volume: - Pages: -
- Peer Reviewed
[Journal Article] High Efficiency Image Correction for Object Detection Improvement in Low Power Underwater Drone2023
- Author(s)
  Kazuto Shindo, Takafumi Katayama, Tian Song and Takashi Shimamoto
- Journal Title
  
  OCEANS2023 Gulf Coast 2023
  
  Volume: - Pages: -
- Peer Reviewed
[Journal Article] A High Precision Counting Framework for Cerithidea moerchii towards Low Power Implementation2023
- Author(s)
  Zhang Hang、Katayama Takafumi、Song Tian、Shimamoto Takashi、Ota Naotomo
- Journal Title
  
  Proceedings of International Technical Conference on Circuits/Systems, Computers and Communications(ITC-CSCC2023)
  
  Volume: - Pages: -
- DOI
  10.1109/ITC-CSCC58803.2023.10212819
- Peer Reviewed
[Journal Article] Semantic Segmentation of River Video for Efficient River Surveillance System2023
- Author(s)
  Inoue Haruki、Katayama Takafumi、Song Tian、Shimamoto Takashi
- Journal Title
  
  Proceedings of International Technical Conference on Circuits/Systems, Computers and Communications(ITC-CSCC2023)
  
  Volume: - Pages: -
- DOI
  10.1109/ITC-CSCC58803.2023.10212938
- Peer Reviewed
[Journal Article] Video Semantic Segmentation for Intersection by Domain Adaptation2023
- Author(s)
  Suzuki Shota、Katayama Takafumi、Song Tian、Shimamoto Takashi
- Journal Title
  
  Proceedings of International Technical Conference on Circuits/Systems, Computers and Communications(ITC-CSCC2023)
  
  Volume: - Pages: -
- DOI
  10.1109/ITC-CSCC58803.2023.10212842
- Peer Reviewed
[Journal Article] YOLO-Based Bitrate Control Algorithm for VVC2023
- Author(s)
  Goto Kaito、Katayama Takafumi、Song Tian、Shimamoto Takashi
- Journal Title
  
  Proceedings of International Technical Conference on Circuits/Systems, Computers and Communications(ITC-CSCC2023)
  
  Volume: - Pages: -
- DOI
  10.1109/ITC-CSCC58803.2023.10212711
- Peer Reviewed
[Journal Article] Color Correction Method Using Monocular Depth Estimation Model for Underwater Images2023
- Author(s)
  Tamaki Hirotaka、Katayama Takafumi、Song Tian、Shimamoto Takashi
- Journal Title
  
  Proceedings of International Technical Conference on Circuits/Systems, Computers and Communications(ITC-CSCC2023)
  
  Volume: - Pages: -
- DOI
  10.1109/ITC-CSCC58803.2023.10212542
- Peer Reviewed
[Journal Article] A Novel Intra Prediction Mode using Transformer-based GAN for VVenC2023
- Author(s)
  Takafumi Katayama, Tian Song and Takashi Shimamoto
- Journal Title
  
  Advances in Signal Processing and Artificial Intelligence (ASPAI) 2023
  
  Volume: - Pages: -
- Peer Reviewed
[Journal Article] Dataset Generation and De-raining Algorithms for Video System of Drone2023
- Author(s)
  Tian Song, Sawada Takuya, Takafumi Katayama, Takashi Shimamoto and Leu Jenq-Shiou
- Journal Title
  
  The 9th International Forum on Advanced Technologies 2023 (IFAT2023)
  
  Volume: - Pages: -
- Peer Reviewed
[Journal Article] An Adaptive Selection Algorithm of Screen Content Coding Tools for Educational Video System2023
- Author(s)
  Tanaka Shuichiro、Song Tian、Katayama Takafumi、Shimamoto Takashi
- Journal Title
  
  2023 IEEE International Conference on Consumer Electronics (ICCE)
  
  Volume: - Pages: -
- DOI
  10.1109/ICCE56470.2023.10043502
- Peer Reviewed
[Journal Article] Semi-Supervised Learning Based De-Raining Method for UAV2023
- Author(s)
  Sawada Takuya、Katayama Takafumi、Song Tian、Shimamoto Takashi
- Journal Title
  
  2023 IEEE International Conference on Consumer Electronics (ICCE)
  
  Volume: - Pages: -
- DOI
  10.1109/ICCE56470.2023.10043414
- Peer Reviewed
[Presentation] A Novel GAN-Based Intra Prediction Mode for HEVC2023
- Author(s)
  Takafumi Katayama, Tian Song and Takashi Shimamoto
- Organizer
  The International Symposium on Communications and Information Technologies (ISCIT) 2023
- Int'l Joint Research
[Presentation] Refined Datasets and Saliency Map Analysis for Underwater Object Detection2023
- Author(s)
  Takafumi Katayama, Tian Song and Takashi Shimamoto
- Organizer
  OCEANS2023 Gulf Coast
- Int'l Joint Research
[Presentation] High Efficiency Image Correction for Object Detection Improvement in Low Power Underwater Drone2023
- Author(s)
  Kazuto Shindo, Takafumi Katayama, Tian Song and Takashi Shimamoto
- Organizer
  OCEANS2023 Gulf Coast
- Int'l Joint Research
[Presentation] A High Precision Counting Framework for Cerithidea moerchii towards Low Power Implementation2023
- Author(s)
  Zhang Hang, Takafumi Katayama, Tian Song and Takashi Shimamoto
- Organizer
  Proceedings of International Technical Conference on Circuits/Systems, Computers and Communications(ITC-CSCC2023)
- Int'l Joint Research
[Presentation] Semantic Segmentation of River Video for Efficient River Surveillance System2023
- Author(s)
  Inoue Haruki, Takafumi Katayama, Tian Song and Takashi Shimamoto
- Organizer
  Proceedings of International Technical Conference on Circuits/Systems, Computers and Communications(ITC-CSCC2023)
- Int'l Joint Research
[Presentation] Video Semantic Segmentation for Intersection by Domain Adaptation2023
- Author(s)
  Suzuki Shota, Takafumi Katayama, Tian Song and Takashi Shimamoto
- Organizer
  Proceedings of International Technical Conference on Circuits/Systems, Computers and Communications(ITC-CSCC2023)
- Int'l Joint Research
[Presentation] YOLO-based Bitrate Control Algorithm for VVC2023
- Author(s)
  Goto Kaito, Takafumi Katayama, Tian Song and Takashi Shimamoto
- Organizer
  Proceedings of International Technical Conference on Circuits/Systems, Computers and Communications(ITC-CSCC2023)
- Int'l Joint Research
[Presentation] Color Correction Method using Monocular Depth Estimation Model for Underwater Images2023
- Author(s)
  Tamaki Hirotaka, Takafumi Katayama, Tian Song and Takashi Shimamoto
- Organizer
  Proceedings of International Technical Conference on Circuits/Systems, Computers and Communications(ITC-CSCC2023)
- Int'l Joint Research
[Presentation] A Novel Intra Prediction Mode using Transformer-based GAN for VVenC2023
- Author(s)
  Takafumi Katayama, Tian Song and Takashi Shimamoto
- Organizer
  Advances in Signal Processing and Artificial Intelligence (ASPAI) 2023
- Int'l Joint Research
[Presentation] Dataset Generation and De-raining Algorithms for Video System of Drone2023
- Author(s)
  Tian Song, Sawada Takuya, Takafumi Katayama, Takashi Shimamoto and Leu Jenq-Shiou
- Organizer
  The 9th International Forum on Advanced Technologies 2023 (IFAT2023)
- Int'l Joint Research
[Presentation] An Adaptive Selection Algorithm of Screen Content Coding Tools for Educational Video System2023
- Author(s)
  Tanaka Shuichiro, Takafumi Katayama, Tian Song and Takashi Shimamoto
- Organizer
  Proceedings of IEEE International Conference on Consumer Electronics(ICCE)
- Int'l Joint Research
[Presentation] Semi-Supervised Learning Based De-Raining Method for UAV2023
- Author(s)
  Sawada Takuya, Takafumi Katayama, Tian Song and Takashi Shimamoto
- Organizer
  Proceedings of IEEE International Conference on Consumer Electronics(ICCE)
- Int'l Joint Research

2023 Fiscal Year Research-status Report

超高精細単視点多眼映像向け高効率符号化・高認識率映像システムの構築

Principal Investigator

片山 貴文 徳島大学, 大学院社会産業理工学研究部(理工学域), 助教 (70848522)

Current Status of Research Progress

Reason

Research Products

[Journal Article] Underwater Object Detection through Analysis and Data Augmentation of Underwater Datasets2023

Author(s)

Journal Title

[Journal Article] Looking Closer to the Transferability Between Natural and Medical Images in Deep Learning2023

Author(s)

Journal Title

DOI

[Journal Article] A Novel GAN-Based Intra Prediction Mode for HEVC2023

Author(s)

Journal Title

[Journal Article] Refined Datasets and Saliency Map Analysis for Underwater Object Detection2023

Author(s)

Journal Title

[Journal Article] High Efficiency Image Correction for Object Detection Improvement in Low Power Underwater Drone2023

Author(s)

Journal Title

[Journal Article] A High Precision Counting Framework for Cerithidea moerchii towards Low Power Implementation2023

Author(s)

Journal Title

DOI

[Journal Article] Semantic Segmentation of River Video for Efficient River Surveillance System2023

Author(s)

Journal Title

DOI

[Journal Article] Video Semantic Segmentation for Intersection by Domain Adaptation2023

Author(s)

Journal Title

DOI

[Journal Article] YOLO-Based Bitrate Control Algorithm for VVC2023

Author(s)

Journal Title

DOI

[Journal Article] Color Correction Method Using Monocular Depth Estimation Model for Underwater Images2023

Author(s)

Journal Title

DOI

[Journal Article] A Novel Intra Prediction Mode using Transformer-based GAN for VVenC2023

Author(s)

Journal Title

[Journal Article] Dataset Generation and De-raining Algorithms for Video System of Drone2023

Author(s)

Journal Title

[Journal Article] An Adaptive Selection Algorithm of Screen Content Coding Tools for Educational Video System2023

Author(s)

Journal Title

DOI

[Journal Article] Semi-Supervised Learning Based De-Raining Method for UAV2023

Author(s)

Journal Title

DOI

[Presentation] A Novel GAN-Based Intra Prediction Mode for HEVC2023

Author(s)

Organizer

[Presentation] Refined Datasets and Saliency Map Analysis for Underwater Object Detection2023

Author(s)

Organizer

[Presentation] High Efficiency Image Correction for Object Detection Improvement in Low Power Underwater Drone2023

Author(s)

Organizer

[Presentation] A High Precision Counting Framework for Cerithidea moerchii towards Low Power Implementation2023

Author(s)

Organizer

[Presentation] Semantic Segmentation of River Video for Efficient River Surveillance System2023

Author(s)

Organizer

[Presentation] Video Semantic Segmentation for Intersection by Domain Adaptation2023

Author(s)

Organizer

[Presentation] YOLO-based Bitrate Control Algorithm for VVC2023

Author(s)

Organizer

[Presentation] Color Correction Method using Monocular Depth Estimation Model for Underwater Images2023

Author(s)

片山貴文徳島大学, 大学院社会産業理工学研究部(理工学域), 助教 (70848522)