Which Change Sets in Git Repositories Are Related?

Jasmin Ramadani, Stefan Wagner

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Software repositories contain valuable information about the history of software changes. Using data mining, researchers have identified file changes that happened together frequently to present hints for necessary changes to developers. However, not all file change sets are related. This can affect the recommendations about coupled file changes negatively by delivering irrelevant couplings to the developers. The commit time and branching characteristics of Git have not been investigated together in previous heuristics for grouping related change sets. We exploit the mappings between commit messages and issue ids for judging the relatedness of change sets. We propose a heuristic for Git and investigate the influence of two factors, the time between the commits and their branching on the relatedness of change sets using the repositories of five open-source systems using logistic regression. According to our findings, the combination of these two factors influences the relatedness of change sets. Individually measured, only the time significantly influences the relatedness, the branching itself does not. Our results support previous heuristic that also in Git repositories the commit time is important for grouping related change sets.

Original languageEnglish
Title of host publicationProceedings - 2016 IEEE International Conference on Software Quality, Reliability and Security, QRS 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages406-413
Number of pages8
ISBN (Electronic)9781509041275
DOIs
StatePublished - 12 Oct 2016
Externally publishedYes
Event2nd IEEE International Conference on Software Quality, Reliability and Security, QRS 2016 - Vienna, Austria
Duration: 1 Aug 20163 Aug 2016

Publication series

NameProceedings - 2016 IEEE International Conference on Software Quality, Reliability and Security, QRS 2016

Conference

Conference2nd IEEE International Conference on Software Quality, Reliability and Security, QRS 2016
Country/TerritoryAustria
CityVienna
Period1/08/163/08/16

Keywords

  • branch
  • change sets
  • commit
  • data mining
  • git
  • repositories

Fingerprint

Dive into the research topics of 'Which Change Sets in Git Repositories Are Related?'. Together they form a unique fingerprint.

Cite this