• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Method and Implementation of Code Clone Detection for a Huge Set of Program Source Code

Research Project

Project/Area Number 24680002
Research Category

Grant-in-Aid for Young Scientists (A)

Allocation TypePartial Multi-year Fund
Research Field Software
Research InstitutionOsaka University

Principal Investigator

Higo Yoshiki  大阪大学, 情報科学研究科, 准教授 (70452414)

Project Period (FY) 2012-04-01 – 2016-03-31
Project Status Completed (Fiscal Year 2015)
Budget Amount *help
¥9,750,000 (Direct Cost: ¥7,500,000、Indirect Cost: ¥2,250,000)
Fiscal Year 2014: ¥2,080,000 (Direct Cost: ¥1,600,000、Indirect Cost: ¥480,000)
Fiscal Year 2013: ¥2,990,000 (Direct Cost: ¥2,300,000、Indirect Cost: ¥690,000)
Fiscal Year 2012: ¥4,680,000 (Direct Cost: ¥3,600,000、Indirect Cost: ¥1,080,000)
Keywordsコードクローン / ソースコード分析 / ソースコード解析 / リポジトリマイニング / 大規模データセット
Outline of Final Research Achievements

We have proposed a technique to detect code clones (hereafter, clones) from a large set of software projects. Our technique detects method-level clones while existing techniques detect file-level clones. We confirmed that our technique can finish detecting clones from 300 million lines of code. We also have proposed a technique to trace clones along development history of source code. Our technique can trace clones even if its location in a system was changed while existing techniques cannot. We confirmed that our technique takes about 3 hours to trace clones along 5,000 revisions of source code history.

Report

(5 results)
  • 2015 Annual Research Report   Final Research Report ( PDF )
  • 2014 Annual Research Report
  • 2013 Annual Research Report
  • 2012 Annual Research Report
  • Research Products

    (36 results)

All 2016 2015 2014 2013 2012 Other

All Journal Article (13 results) (of which Peer Reviewed: 11 results,  Acknowledgement Compliant: 8 results,  Open Access: 1 results) Presentation (22 results) (of which Int'l Joint Research: 7 results) Remarks (1 results)

  • [Journal Article] 書き忘れたコードに対するコード補完について2016

    • Author(s)
      石原知也, 肥後芳樹, 楠本真二
    • Journal Title

      電子情報通信学会論文誌D

      Volume: J99-D Pages: 415-427

    • Related Report
      2015 Annual Research Report
    • Peer Reviewed / Acknowledgement Compliant
  • [Journal Article] 複数メソッドに解離したコードクローンの検出手法2016

    • Author(s)
      高良多朗, 堀田圭佑, 肥後芳樹, 井垣宏, 楠本真二
    • Journal Title

      電子情報通信学会論文誌D

      Volume: 未定

    • Related Report
      2015 Annual Research Report
    • Peer Reviewed / Acknowledgement Compliant
  • [Journal Article] コードクローンとロジカルカップリングを用いた文字列検索ツールの出力順序の並び替え2016

    • Author(s)
      佐飛祐介, 村上寛明, 肥後芳樹, 楠本真二
    • Journal Title

      コンピュータソフトウェア

      Volume: 未定

    • Related Report
      2015 Annual Research Report
    • Peer Reviewed / Acknowledgement Compliant
  • [Journal Article] Classification Model for Code Clones Based on Machine Learning2016

    • Author(s)
      Jiachen Yang,Keisuke Hotta,Yoshiki Higo,Hiroshi Igaki, Shinji Kusumoto
    • Journal Title

      Empirical Software Engineering

      Volume: 未定 Issue: 4 Pages: 1095-1125

    • DOI

      10.1007/s10664-014-9316-x

    • Related Report
      2014 Annual Research Report
    • Peer Reviewed / Open Access / Acknowledgement Compliant
  • [Journal Article] 機械学習を利用した危険なコードクローンの予測手法2015

    • Author(s)
      今里文香, 堀田圭佑, 肥後芳樹, 楠本真二
    • Journal Title

      電子情報通信学会論文誌D

      Volume: J98-D Pages: 847-850

    • Related Report
      2015 Annual Research Report
    • Peer Reviewed / Acknowledgement Compliant
  • [Journal Article] 識別子名を用いたコミットのクラスタリング手法2015

    • Author(s)
      山内健二, 楊嘉晨, 堀田圭佑, 肥後芳樹, 楠本真二
    • Journal Title

      電子情報通信学会論文誌D

      Volume: J98-D Pages: 1060-1062

    • Related Report
      2015 Annual Research Report
    • Peer Reviewed / Acknowledgement Compliant
  • [Journal Article] 粗粒度なコードクローン検出手法の精度に関する調査2015

    • Author(s)
      堀田圭佑, 楊嘉晨, 肥後芳樹, 楠本真二
    • Journal Title

      情報処理学会論文誌

      Volume: 56

    • NAID

      110009877374

    • Related Report
      2014 Annual Research Report
    • Peer Reviewed / Acknowledgement Compliant
  • [Journal Article] ギャップの位置情報を追加した正解クローンの作成2014

    • Author(s)
      村上寛明, 肥後芳樹, 楠本真二
    • Journal Title

      電子情報通信学会論文誌D

      Volume: J97-D

    • Related Report
      2014 Annual Research Report
    • Peer Reviewed / Acknowledgement Compliant
  • [Journal Article] CRDを用いたコードクローンの生存期間と修正回数に関する調査2014

    • Author(s)
      堀田圭佑, 肥後芳樹, 楠本真二
    • Journal Title

      情報処理学会論文誌

      Volume: 55

    • NAID

      110009665004

    • Related Report
      2013 Annual Research Report
  • [Journal Article] Smith-Watermanアルゴリズムを利用したギャップを含むコードクローン検出2014

    • Author(s)
      村上寛明, 堀田圭佑, 肥後芳樹, 井垣宏, 楠本真二
    • Journal Title

      情報処理学会論文誌

      Volume: 55

    • NAID

      110009665007

    • Related Report
      2013 Annual Research Report
  • [Journal Article] 機械学習を用いた類似度に基づく有用なコードクローンの自動特定手法 '2013

    • Author(s)
      楊嘉晨, 堀田圭佑, 肥後芳樹, 井垣宏, 楠本真二
    • Journal Title

      情報処理学会論文誌

      Volume: 54(2) Pages: 807-810

    • NAID

      110009537079

    • Related Report
      2012 Annual Research Report
    • Peer Reviewed
  • [Journal Article] ソースコード中の繰り返し部分に着目したコードクローン検出ツールの実装と評価厂2013

    • Author(s)
      村上寛明, 堀田圭佑, 肥後芳樹, 井垣宏, 楠本真二
    • Journal Title

      情報処理学会論文誌

      Volume: 54(2) Pages: 845-856

    • Related Report
      2012 Annual Research Report
    • Peer Reviewed
  • [Journal Article] リポジトリマイニング可能なコードクローン版管理システムの提案2013

    • Author(s)
      畑秀明, 肥後芳樹, 楠本真二
    • Journal Title

      情報処理学会論文誌

      Volume: 54(2) Pages: 894-902

    • NAID

      110009537086

    • Related Report
      2012 Annual Research Report
    • Peer Reviewed
  • [Presentation] Toward Improving Ability to Repair Bugs Automatically‐A Patch Candidate Location Mechanism Using Code Similarity‐2016

    • Author(s)
      Haruki Yokoyama
    • Organizer
      the 31st ACM Symposium on Applied Computing (SAC2016)
    • Place of Presentation
      Pisa, Italy
    • Year and Date
      2016-04-04
    • Related Report
      2015 Annual Research Report
    • Int'l Joint Research
  • [Presentation] “A Capable Crossover Technique on Automatic Program Repair2016

    • Author(s)
      Ryotaro Kou
    • Organizer
      the 7th IEEE International Workshop on Empirical Software Engineering in Practice (IWESEP2016)
    • Place of Presentation
      Suita,Osaka, Japan
    • Year and Date
      2016-03-13
    • Related Report
      2015 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Kento Shimonaka, Soichi Sumi, Yoshiki Higo and Shinji Kusumoto2016

    • Author(s)
      Kento Shimonaka
    • Organizer
      the 7th IEEE International Workshop on Empirical Software Engineering in Practice (IWESEP2016)
    • Place of Presentation
      Suita,Osaka, Japan
    • Year and Date
      2016-03-13
    • Related Report
      2015 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Toward Improving Graftability on Automated Program Repair2015

    • Author(s)
      Soichi Sumi
    • Organizer
      the 31st IEEE International Conference on Software Maintenance and Evolution (ICSME2015)
    • Place of Presentation
      Bremen, Germany
    • Year and Date
      2015-09-29
    • Related Report
      2015 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Towards Purity-Guided Refactoring in Java2015

    • Author(s)
      Jiachan Yang
    • Organizer
      the 31st IEEE International Conference on Software Maintenance and Evolution (ICSME2015)
    • Place of Presentation
      Bremen, Germany
    • Year and Date
      2015-09-29
    • Related Report
      2015 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Reordering Results of Keyword-based Code Search for Supporting Simultaneous Code Changes2015

    • Author(s)
      Yusuke Sabi
    • Organizer
      the 23rd IEEE International Conference on Program Comprehension (ICPC2015)
    • Place of Presentation
      Florence, Italy
    • Year and Date
      2015-05-18
    • Related Report
      2015 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Toward Reusing Code Changes2015

    • Author(s)
      Yoshiki Higo
    • Organizer
      the 12th Working Conference on Mining Software Repositories (MSR2015)
    • Place of Presentation
      Florence, Italy
    • Year and Date
      2015-05-16
    • Related Report
      2015 Annual Research Report
    • Int'l Joint Research
  • [Presentation] On the Level of Code Suggestion for Reuse2015

    • Author(s)
      Akio Ohtani
    • Organizer
      the 9th International Workshop on Software Clones
    • Place of Presentation
      Montreal, Canada
    • Year and Date
      2015-03-06
    • Related Report
      2014 Annual Research Report
  • [Presentation] Clonepacker: a Tool for Clone Set Visualization2015

    • Author(s)
      Hiroaki Murakami
    • Organizer
      the 22nd IEEE International Conference on Software Analysis, Evolution, and Reengineering
    • Place of Presentation
      Montreal, Canada
    • Year and Date
      2015-03-02 – 2015-03-06
    • Related Report
      2014 Annual Research Report
  • [Presentation] How Often Are Necessary Code Missing? — A Controlled Experiment —2015

    • Author(s)
      Tomoya Ishihara
    • Organizer
      the 14th International Conference on Software Reuse
    • Place of Presentation
      Miami, Florida, USA
    • Year and Date
      2015-01-04 – 2015-01-06
    • Related Report
      2014 Annual Research Report
  • [Presentation] Predicting Risky Clones Based on Machine Learning2014

    • Author(s)
      Ayaka Imazato
    • Organizer
      the 15th International Conference of Product Focused Software Development and Process Improvement
    • Place of Presentation
      Helsinki, Finland
    • Year and Date
      2014-12-10 – 2014-12-12
    • Related Report
      2014 Annual Research Report
  • [Presentation] How Should We Measure Functional Sameness from Program Source Code? — An Exploratory Study on Java Methods —2014

    • Author(s)
      Yoshiki Higo
    • Organizer
      the 22nd International Symposium on the Foundations of Software Engineering
    • Place of Presentation
      Hong Kong
    • Year and Date
      2014-11-16 – 2014-11-22
    • Related Report
      2014 Annual Research Report
  • [Presentation] Clustering Commits for Understanding the Intents of Implementation2014

    • Author(s)
      Kenji Yamauchi
    • Organizer
      the 30th International Conference on Software Maintenance and Evolution
    • Place of Presentation
      British Columbia, Canada
    • Year and Date
      2014-09-28 – 2014-10-03
    • Related Report
      2014 Annual Research Report
  • [Presentation] MPAnalyzer: A Tool for Finding Unintended Inconsistencies in Program Source Code2014

    • Author(s)
      Yoshiki Higo
    • Organizer
      the 29th IEEE/ACM International Conference on Automated Software Engineering
    • Place of Presentation
      Vasteras, Sweden
    • Year and Date
      2014-09-15 – 2014-09-19
    • Related Report
      2014 Annual Research Report
  • [Presentation] Hey! Are you Committing Tangled Changes?2014

    • Author(s)
      Hiroyuki Kirinuki
    • Organizer
      the 22nd International Conference of Program Comprehension
    • Place of Presentation
      Hyderabad, India
    • Year and Date
      2014-06-02 – 2014-06-03
    • Related Report
      2014 Annual Research Report
  • [Presentation] A Dataset of Clone References with Gaps2014

    • Author(s)
      Hiroaki Murakami
    • Organizer
      the 11th Working Conference on Mining Software Repositories
    • Place of Presentation
      Hyderabad, India
    • Year and Date
      2014-05-31 – 2014-06-01
    • Related Report
      2014 Annual Research Report
  • [Presentation] A Dataset of Clone References with Gaps2014

    • Author(s)
      Hiroaki Murakami, Yoshiki Higo, Shinji Kusumoto
    • Organizer
      11th Working Conference on Mining Software Repositories
    • Place of Presentation
      Hyderabad, India
    • Related Report
      2013 Annual Research Report
  • [Presentation] How Accurate Is Coarse-grained Clone Detection?: Comparison with Fine-grained Detectors2014

    • Author(s)
      Keisuke Hotta, Yoshiki Higo, and Shinji Kusumoto
    • Organizer
      the 8th International Workshop of Software Clones
    • Place of Presentation
      Antwerp, Belgium
    • Related Report
      2013 Annual Research Report
  • [Presentation] Reusing Reused Code2013

    • Author(s)
      Tomoya Ishihara, Yoshiki Higo, and Shinji Kusumoto
    • Organizer
      the 20th Working Conference on Reverse Engineering
    • Place of Presentation
      Koblenz, Germany
    • Related Report
      2013 Annual Research Report
  • [Presentation] Enhancement of CRD-based Clone Tracking2013

    • Author(s)
      Yoshiki Higo, Keisuke Hotta, and Shinji Kusumoto
    • Organizer
      the 13th International Workshop on Principles of Software Evolution
    • Place of Presentation
      Saint Petersburg, Russia
    • Related Report
      2013 Annual Research Report
  • [Presentation] Gapped Code Clone Detection with Lightweight Source Code Analysis”2013

    • Author(s)
      Hiroaki Murakami, Keisuke Hotta, Yoshiki Higo, Hiroshi Igaki, and Shinji Kusumoto
    • Organizer
      the 21st International Conference on Program Comprehension
    • Place of Presentation
      San Francisco, California
    • Related Report
      2013 Annual Research Report
  • [Presentation] Folding Repeated Instructions for Improving Token-Based Code Clone Detection2012

    • Author(s)
      Hiroaki Murakami
    • Organizer
      12th International Working Conierence ol Source Code Analysis ald Manipulation
    • Place of Presentation
      イタリア
    • Year and Date
      2012-09-23
    • Related Report
      2012 Annual Research Report
  • [Remarks] CloneGear

    • URL

      https://github.com/YoshikiHigo/CloneGear

    • Related Report
      2015 Annual Research Report

URL: 

Published: 2012-04-24   Modified: 2019-07-29  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi