JExtract: An Eclipse Plug-in for Recommending Automated Extract Method Refactorings
Abstract
Although Extract Method is a key refactoring for improving program comprehension, refactoring tools for such purpose are often underused. To address this shortcoming, we present JExtract, a recommendation system based on structural similarity that identifies Extract Method refactoring opportunities that are directly automated by IDE-based refactoring tools. Our evaluation suggests that JExtract is more effective (w.r.t. recall and precision) to identify contiguous misplaced code in methods than JDeodorant, a state-of-the-art tool.
Tool demonstration video. http://youtu.be/6htJOzXwRNA
1 Introduction
Refactoring has increased in importance as a technique for improving the design of existing code [2], e.g., to increase cohesion, decrease coupling, foster maintainability, etc. Particularly, Extract Method is a key refactoring for improving program comprehension. Besides promoting reuse and reducing code duplication, it contributes to readability and comprehensibility, by encouraging the extraction of self-documenting methods [2].
Nevertheless, recent empirical research indicate that, while Extract Method is one of the most common refactorings, automated tools supporting this refactoring are most of the times underused [5, 4]. For example, Negara et al. found that Extract Method is the third most frequent refactoring, but the number of developers who apply the refactoring manually is higher than the number of those who do it automatically [5]. Moreover, current tools focus only on automating refactoring application, but developers expend considerable effort on the manual identification of refactoring opportunities.
To address this shortcoming, this paper presents , a tool that implements a novel approach for recommending automated Extract Method refactorings. The tool was designed as a plug-in for the Eclipse IDE that automatically identifies, ranks, and applies the refactoring when requested. Thereupon, may aid developers to find refactoring opportunities and contribute to a widespread adoption of refactoring practices. The underlying technique is inspired by the separation of concerns design guideline. More specifically, we assume that the structural dependencies established by Extract Method candidates should be very different from the ones established by the remaining statements in the original method.
2 The JExtract tool
is a tool that analyzes the source code of methods and recommends Extract Method refactoring opportunities, as illustrated in Figure 1. First, the tool generates all Extract Method possibilities for each method. Second, these possibilities are ranked according to a scoring function based on the similarity between sets of dependencies established in the code.
This main section of the paper is organized as follows. Subsection 2.1 provides an overview of our approach for identifying Extract Method refactoring opportunities. Subsection 2.2 describes the design and implementation of the tool. Finally, Subsection 2.3 presents the results of our evaluation in open-source systems. A detailed description of the recommendation technique behind JExtract is present in a recent full technical paper [9].
2.1 Proposed Approach
The approach is divided in three phases: Generation of Candidates, Scoring, and Ranking.
2.1.1 Generation of candidates
This phase is responsible for identifying all possible Extract Method refactoring opportunities. First, we split the methods into blocks, which consist of sequential statements that follow a linear control flow. As an example, Figure 2 presents method of class , extracted from ArgoUML. We can notice that each statement is labeled using the pattern, where and denote the block and the statement, respectively. For example, is the third statement of the second block, which declares a variable .

Input: A method
Output: List with Extract Method candidates
Fundamentally, the three nested loops in Algorithm 1 (lines 2, 4, and 5) enforce that the list of selected statements attend the following preconditions:
-
Only continuous statements inside a block are selected. In Figure 2, for example, it is not possible to select a candidate with and without including .
-
The selected statements are part of a single block of statements. In Figure 2, for example, it is not possible to generate a candidate with both and since they belong to distinct blocks.
-
When a statement is selected, the respective children statements are also included. In Figure 2, for example, when statement is selected, its children statements to are also included.
Last but not least, we do not ensure that every iteration of the loop yields an Extract Method candidate because: (i) a candidate recommendation must respect a size threshold defined by parameter Minimum Extracted Statements. The value is preset to (changeable), which means that an Extract Method candidate has to have at least three statements; and (ii) a candidate recommendation must respect the preconditions defined by the Extract Method refactoring engine.
2.1.2 Scoring
This phase is responsible for scoring the possible Extract Method refactoring opportunities generated in the previous phase, using a technique inspired by a Move Method recommendation heuristic [7]. Assume as the selection of statements of an Extract Method candidate and the remaining statements in the original method . The proposed heuristic aims to minimize the structural similarity between and .
Structural Dependencies: The set of dependencies established by a selection of statements with variables, types, and packages is denoted by , , and , respectively. These sets are constructed as described next.
-
Variables: If a statement from a selection of statements declares, assigns, or reads a variable , then is added to . In a similar way, reads from and writes to formal parameters and fields are considered.
-
Types: If a statement from a selection of statements uses a type (class or interface) , then is added to .
-
Packages: For each type included in , as described in the previous item, the package where is implemented and all its parent packages are also included in .
For instance, assume as the highlighted code in Figure 2 (i.e., an Extract Method candidate) and the remaining statements in the original method .
On one hand,
.
On the other hand, the set .
In this case, the intersection between these two sets contains only . Moreover, the computation of and is isolated from the remaining code. Therefore, one can claim that is cohesive and decoupled from , i.e., a good separation of concerns is achieved.
Scoring Function: To compute the dissimilarity between and , we rely on the distance between the dependency sets and using the Kulczynski similarity coefficient [10, 7]:
where , , and .
Thus, let be the selection of statements of an Extract Method candidate for method . Let also be the remaining statements in . The score of is defined as:
The scoring function is centered on the observation that a good Extract Method candidate should encapsulate the use of variables, types, and packages. In other words, we should maximize the distance between the dependency sets and .
2.1.3 Ranking
This phase is responsible for ranking and filtering the Extract Method candidates based on the score computed in the previous phase. Basically, we sort the candidates and filter them according to the following parameters: (i) Maximum Recommendations per Method. The value is preset to 3 (changeable), which means that the tool triggers up to three recommendations for each method; and (ii) Minimum Score Value, which has to be configured when the user desires to setup a minimum dissimilarity threshold.
2.2 Internal Architecture and Interface
We implemented as a plug-in on top of the Eclipse platform.
Therefore, we rely mainly on native Eclipse APIs, such as Java Development Tools (JDT)
and Language Toolkit (LTK).
The current implementation follows an architecture with five main modules:
-
Code Analyzer: This module provides the following services to other modules: (a) it builds the structure of block and statements (refer to Subsection 2.1.1); (b) it extracts the structural dependencies (refer to Subsection 2.1.2); and (c) it checks if an Extract Method candidate satisfies the underlying Eclipse Extract Method refactoring preconditions. In fact, this module contains most communication between and Eclipse APIs (e.g., and ).
-
Candidate Generator: This module generates all Extract Method candidates based on Algorithm 1 and hence depends on service (a) of module Code Analyzer.
-
Scorer: This module calculates the dissimilarity of the Extract Method candidates generated by module Candidate Generator (refer to Subsection 2.1.2) and hence depends on service (b) of module Code Analyzer.
-
Ranker: This module ranks and filters the Extract Method candidates generated by module Candidate Generator and scored by module Scorer. It depends on service (c) of module Code Analyzer to filter candidates not satisfying preconditions.
-
UI: This module consists of the front-end of the tool, which relies on the Eclipse UI API () to implement two menu extensions, six actions, and one main view. Moreover, it depends on module UI from LTK () to delegate the refactoring appliance to the underlying Eclipse Extract Method refactoring tool.
Such architecture permits the extension of our tool. For example, the Scorer module may be replaced by one that employs a new heuristic based on semantic and structural information. As another example, the Candidate Generator module may be extended to support the identification of non-contiguous code fragments.

Figure 3 presents ’s UI, displaying method previously presented in Figure 2. When a developer triggers to identify Extract Method refactoring opportunities for this method, it opens the Extract Method Recommendations view to report the potential recommendations. In this case, the best candidate consists of the extraction of statements to whose dissimilarity score is 0.7148.
2.3 Evaluation
We conducted two different but complementary empirical studies.
Study #1: In our previous paper [9], we evaluated the recommendations provided by our tool on three systems to assess precision and recall. We extended this study to consider minor modifications to the ranking method and to compare the results with , a state-of-the-art tool that identifies Extract Method opportunities [11]. For each system , we apply random Inline Method refactoring operations to obtain a modified version . We assume that good Extract Method opportunities are the ones that revert the modifications (i.e., restoring from ).
JExtract | JDeodorant | ||||||||
---|---|---|---|---|---|---|---|---|---|
Top-1 | Top-2 | Top-3 | |||||||
System | # | Recall | Prec. | Recall | Prec. | Recall | Prec. | Recall | Prec. |
JHotDraw 5.2 | 56 | 19 (34%) | 34% | 26 (46%) | 24% | 32 (57%) | 20% | 2 (4%) | 5% |
JUnit 3.8 | 25 | 13 (52%) | 52% | 16 (64%) | 33% | 18 (72%) | 25% | 0 (0%) | 0% |
MyWebMarket | 14 | 12 (86%) | 86% | 14 (100%) | 50% | 14 (100%) | 33% | 2 (14%) | 33% |
Total | 95 | 44 (46%) | 46% | 56 (59%) | 30% | 64 (67%) | 23% | 4 (4%) | 6% |
Table 1 reports recall and precision values achieved using with three different configurations (Top-k Recommendations per Method). While a high parameter value favors recall (e.g., Top-3), a low one favors precision (e.g., Top-1). Table 1 also presents results achieved using with its default settings. As the main finding, outperforms regardless of the configuration used.
Study #2: We replicate the previous study in other ten popular open-source Java systems to assess how the precision and recall rates would vary. Nevertheless, we do not compare our results with since we were not able to reliably provide the source code of all required libraries, as demanded by JDeodorant.
JExtract | |||||||
---|---|---|---|---|---|---|---|
Top-1 | Top-2 | Top-3 | |||||
System | # | Recall | Prec. | Recall | Prec. | Recall | Prec. |
Ant 1.8.2 | 964 | 235 (24.4%) | 24.4% | 363 (37.7%) | 19.1% | 460 (47.7%) | 16.3% |
ArgoUML 0.34 | 439 | 98 (22.3%) | 22.3% | 160 (36.4%) | 18.3% | 186 (42.4%) | 14.4% |
Checkstyle 5.6 | 533 | 227 (42.6%) | 42.6% | 338 (63.4%) | 31.9% | 389 (73.0%) | 24.7% |
FindBugs 1.3.9 | 714 | 179 (25.1%) | 25.1% | 278 (38.9%) | 19.7% | 350 (49.0%) | 16.7% |
FreeMind 0.9.0 | 348 | 85 (24.4%) | 24.4% | 134 (38.5%) | 19.4% | 181 (52.0%) | 17.8% |
JFreeChart 1.0.13 | 1,090 | 204 (18.7%) | 18.7% | 396 (36.3%) | 18.2% | 536 (49.2%) | 16.5% |
JUnit 4.10 | 35 | 11 (31.4%) | 32.4% | 17 (48.6%) | 26.6% | 22 (62.9%) | 23.7% |
Quartz 1.8.3 | 239 | 99 (41.4%) | 41.4% | 125 (52.3%) | 26.5% | 142 (59.4%) | 20.4% |
SQuirreL SQL 3.1.2 | 39 | 15 (38.5%) | 38.5% | 18 (46.2%) | 23.7% | 20 (51.3%) | 18.2% |
Tomcat 7.0.2 | 1,076 | 214 (19.9%) | 19.9% | 325 (30.2%) | 15.2% | 409 (38.0%) | 12.8% |
Total | 5,477 | 1,367 (25.0%) | 25.0% | 2,154 (39.3%) | 19.8% | 2,695 (49.2%) | 16.7% |
Table 2 reports the recall and precision values achieved using the same settings from the previous study. On one hand, the overall recall value ranges from 25% to 49.2%. On the other hand, the overall precision value ranges from 25% to 16.7%. We argue these values are acceptable for two reasons: (i) we only consider as correct a recommendation that matches exactly the one at the oracle; thus, a slight difference of including (or excluding) a statement is enough to be considered a miss; and (ii) the modified methods may have preexisting Extract Method opportunities, besides the ones we introduced, that will be considered wrong by our oracle.
3 Related Tools
Recent empirical research shows that automated refactoring tools, especially those supporting Extract Method refactorings, are most of the times underused [5, 4]. In view of such circumstances, recent studies on identification of refactoring opportunities are seeking to address this shortcoming. In this paper, we implemented our approach in a way that it can be straightforwardly incorporated to the current development process through a tool that identifies, ranks, and automate Extract Method refactoring opportunities [9].
is the refactoring recommendation system our approach is inspired by [7, 6]. The tool identifies Move Method refactoring opportunities based on the similarity between dependency sets [7]. More specifically, it computes the similarity of the set of dependencies established by a given method with (i) the methods of its own class and (ii) the methods in other classes of the system (). Whereas JMove recommends moving a method to a more similar class , our current approach recommends extracting a fragment from a given method into a new method when there is a high dissimilarity between and the remainder statements in .
is the state-of-the-art system to identify and apply common refactoring operations in Java systems, including Extract Method [11]. In contrast to our approach that relies on the similarity between dependency sets, relies on the concept of program slicing to select related statements that can be extracted into a new method. Our approach, on the other hand, is not based on specific code patterns (such as a computation slice). It is also more conservative to preserve program behavior (although it is currently restricted to non-contiguous fragments of code), and it relies on a scoring function to rank and filter recommendations.
4 Final Remarks
implements a novel approach for recommending automated Extract Method refactorings. The tool was designed as a plug-in for the Eclipse IDE that automatically identifies, ranks, and applies the refactoring. Thereupon, the tool may contribute to increase the popularity of IDE-based refactoring tools, which are normally considered underused by most recent empirical studies on refactoring. Moreover, our evaluation indicates that is more effective (w.r.t. recall and precision) to identify contiguous misplaced code in methods than , a state-of-the-art tool.
As ongoing work, we are extending to be able to do statement reordering to uncover better Extract Method opportunities, as long as the modification preserves the behavior of the original code. Moreover, we intend to evaluate our tool with human experts to mitigate the threat that the synthesized datasets did not capture the full spectrum of Extract Method instances faced by developers. Last, we also intend to support other kinds of refactoring (e.g., Move Method).
Acknowledgments: Our research is supported by CAPES, FAPEMIG, and CNPq.
References
- [1] G. Bavota, R. Oliveto, M. Gethers, D. Poshyvanyk, and A. D. Lucia. Methodbook: Recommending move method refactorings via relational topic models. IEEE Transactions on Software Engineering, pages 1–26, 2014.
- [2] M. Fowler. Refactoring: Improving the design of existing code. Addison-Wesley, 1999.
- [3] R. Marinescu. Detection strategies: Metrics-based rules for detecting design flaws. In 20th International Conference on Software Maintenance (ICSM), pages 350–359, 2004.
- [4] E. R. Murphy-Hill, C. Parnin, and A. P. Black. How we refactor, and how we know it. IEEE Transactions on Software Engineering, 38(1):5–18, 2012.
- [5] S. Negara, N. Chen, M. Vakilian, R. E. Johnson, and D. Dig. A comparative study of manual and automated refactorings. In 27th European Conference on Object-Oriented Programming (ECOOP), pages 552–576, 2013.
- [6] V. Sales, R. Terra, L. F. Miranda, and M. T. Valente. JMove: Seus métodos em classes apropriadas. In IV Brazilian Conference on Software: Theory and Practice (CBSoft), Tools Session, pages 1–6, 2013.
- [7] V. Sales, R. Terra, L. F. Miranda, and M. T. Valente. Recommending move method refactorings using dependency sets. In 20th Working Conference on Reverse Engineering (WCRE), pages 232–241, 2013.
- [8] O. Seng, J. Stammel, and D. Burkhart. Search-based determination of refactorings for improving the class structure of object-oriented systems. In 8th Conference on Genetic and Evolutionary Computation (GECCO), pages 1909–1916, 2006.
- [9] D. Silva, R. Terra, and M. T. Valente. Recommending automated Extract Method refactorings. In 22nd International Conference on Program Comprehension (ICPC), pages 146–156, 2014.
- [10] R. Terra, J. Brunet, L. F. Miranda, M. T. Valente, D. Serey, D. Castilho, and R. S. Bigonha. Measuring the structural similarity between source code entities. In 25th Conference on Software Engineering and Knowledge Engineering (SEKE), pages 753–758, 2013.
- [11] N. Tsantalis and A. Chatzigeorgiou. Identification of extract method refactoring opportunities for the decomposition of methods. Journal of Systems and Software, 84(10):1757–1782, 2011.