An Approach for Restructuring Text Content

Lerina Aversano, Gerardo Canfora, Giuseppe De Ruvo, and Maria Tortorella

University of Sannio, Italy

Track: New Ideas and Emerging Results
Session: Alternative Modeling
Software engineers have successfully used Natural Language Processing for refactoring source code. Conversely, in this paper we investigate the possibility to apply software refactoring techniques to textual content. As a procedural program is composed of functions calling each other, a document can be modeled as content fragments connected each other through links. Inspired by software engineering refactoring strategies, we propose an approach for refactoring wiki content. The approach has been applied to the EMF category of Eclipsepedia with encouraging results.