Grammatical inference from data exchange files: An experiment on engineering software

Autoren Markus Exler
Michael Moser
Josef Pichler
Günter Fleck
Bernhard Dorninger
Editoren Rocco Oliveto
Massimiliano Di Penta
David C. Shepherd
Titel Grammatical inference from data exchange files: An experiment on engineering software
Buchtitel Proceedings of the 25th IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER 2018)
Typ in Konferenzband
Verlag IEEE
ISBN 978-1-5386-4969-5
Monat March
Jahr 2018
Seiten 557-561
SCCH ID# 18005
Abstract

Complex engineering problems are typically solved by running a batch of software programs. Data exchange between these software programs is frequently based on semi-structured text files. These files are edited by text editors providing basic input support, however without proper input validation prior program execution. Consequently, even minor lexical or syntactic errors cause software programs to stop without delivering a result. To tackle these problems a more specific editor support, which is aware of language concepts of data exchange files, needs to be provided. In this paper, we investigate if and in what quality a language grammar can be inferred from a set of existing text files, in order to provide a basis for the desired editing support. For this experiment, we chose a Minimal Adequate Teacher (MAT) method together with specific preprocessing of the existing text files. Thereby, we were able to construct complete grammar rules for most of the language constructs found in a corpus of semi-structured text files. The inferred grammar, however, requires refactoring towards a suitable and maintainable basis for the desired editor support.