Semantics-based Reverse Engineering of Object-Oriented Data Models

28th International Conference on Software Engineering |

Published by IBM Research

We present an algorithm for reverse engineering object-oriented (OO) data models from programs written in weakly-typed languages like Cobol. These models, similar to UML class diagrams, can facilitate a variety of program maintenance and migration activities. Our algorithm is based on a semantic analysis of the program’s code, and we provide a bisimulation-based formalization of what it means for an OO data model to be correct for a program.