Simpler Software Analytics: When? When Not?

(1)

ABSTRACT

FU, WEI. Simpler Software Analytics: When? When not?. (Under the direction of Dr. Timothy Menzies.)

Software engineering researchers and industrial practitioners routinely make extensively use of software analytics to discover insights about their projects: how long it will take to develop their code, or where bugs are most likely to occur. Significant research effort has been put forth to improve the quality of model software analytics. There has been continuous development of new feature selection, feature discovering and even new modeling techniques for software analytics. These exciting innovations have the potential to dramatically improve the quality of software analytics tools. The problem is that most of these methods are complex (e.g.,): not easy to explain, hard to reproduce, and require large amount of computational resources.

In this dissertation,we want to investigate the potential of simpler software analytics. The thesis of this dissertation is that software analytics should be simpler; Software analytics can be simpler; However, oversimplification can be harmful. To defend the claim of the thesis, we have conducted several studies to explore simple tools in different tasks, like software defect prediction and software text mining.

First of all, we discuss why software analytics should be simpler. From the perspectives of effectiveness, economy, explainability and reproducibility, we argue that simpler software analytics have more potential to impact the software engineering industry. To show software analytics can be simpler, we conduct a study to simplify hyper-parameter tuning techniques on software defect prediction tasks. According to our literature review, hyper-parameter tuning was ignored in most software defect prediction works. Even though some researchers proposed methods to tune learners’ hyper-parameters, those methods usually take at least hours to complete, which would not motivate researchers to tune hyper-parameters in every single task. In this dissertation, we proposed to apply search-based software engineering method, like differential evolution algorithm, to tune defect predictors. According to the experiment, we usually get better performance than predictors with default parameters. Meanwhile, tuning with differential evolution can be terminated within seconds or minutes on defect prediction tasks, which requires far less resources than most frequently used hyper-parameter tuning techniques proposed by other researchers.

(2)

between different questions posted on Stack Overflow. First of all, it is hard to select a case study like the one we studied to explore simplicity versus complexity because most papers using complex methods (e.g., deep learning) do not share either the source code or data, which could introduce bias if we explore the same topic with different datasets or implement the proposed method. In this tudy, even though we got the same testing data from the author we compared wtih, we are still missing training data. Therefore, to fully reproduce the experiment, we spend months of time in collecting data, pre-processing data and reproducing the prior study. Based on the experimental resutls, we find that tuning simple learner with differential evolution algorithm outperforms the deep learning method for this task while our proposed method is 84 times faster. This conclusion further confirms that our first claim that software analytics can be simpler and complex software analytics methods can be improved and outperformed by simpler methods. Since most existing software analytics research tends to be more complex and not fully open-sourced, showing any single software analytics tasks can be simplified is not simple at all.

Thirdly, we show that simplification for software analytics can be done in a harmful way. Specifi-cally, over-simplification without considering the local information can generate a less effective model. By reproducing a recent study in FSE’16 about just-in-time effort aware defect prediction, we find that the unsupervised predictors proposed in the original paper can not get a consistent prediction among all proposed unsupervised learners. To improve their method, we investigated the use of local data to prune weak predictors and only select the best one as the final predictor. Experimental results show that our proposed simple method,OneWay, performs better than the unsupervised learners as well as more complex standard supervised learners. Through this study, we defend our claim that software analytics can be simpler but can not be oversimplified.