Articles
| Open Access | Advancing Legacy System Modernization Through Machine Learning-Assisted Modularity And Service-Oriented Architecture
Abstract
The increasing complexity and pervasive legacy infrastructure in enterprise computing have catalyzed the exploration of innovative approaches to system modernization. Traditional methods of refactoring and modularization have often proven insufficient due to scale, heterogeneity, and incomplete system documentation. Recent research emphasizes leveraging machine learning techniques to enhance service boundary detection, enabling more effective migration from monolithic architectures to modular or service-oriented designs. This study synthesizes existing theoretical frameworks on object-oriented design metrics, coupling, cohesion, and maintainability with contemporary machine learning-assisted approaches to system decomposition. Specifically, the paper examines the application of predictive analytics for identifying modular boundaries within legacy systems, alongside strategies for integrating service-oriented principles and microservices patterns. Through a comprehensive review of literature and interpretive analysis, we highlight the implications of automated boundary detection on system maintainability, resilience, and adaptability. We also discuss methodological considerations, including data collection from change management repositories, metric validation, and the limitations of current models. By critically examining the convergence of software metrics, empirical studies, and machine learning models, this research contributes to a nuanced understanding of how high-volume legacy systems can evolve toward more flexible, maintainable, and strategically aligned architectures. The study concludes with recommendations for integrating machine learning into legacy system modernization workflows, and outlines directions for future research in predictive modularization, automated refactoring, and enterprise-scale deployment of modular architectures.
Keywords
Legacy systems, Machine learning, Service-oriented architecture, Modularization
References
L. C. Briand, S. Morasca, and V. R. Basili. Defining and validating measures for object-based high-level design. IEEE TSE, pages 722–743, 1999.
M. van Sinderen. Challenges and solutions in enterprise computing. Enterprise Information Systems, 2(4):341–346, 2008.
S. Ducasse, D. Pollet, M. Suen, H. Abdeen, and I. Alloui. Package surface blueprints: Visually supporting the understanding of package relationships. In IEEE Int. Conf. on
Sof. Maint., pages 94–103, 07.
J. Van Geet and S. Demeyer. Lightweight visualisations of COBOL code for supporting migration to SOA. Symposium on Software Evolution, 8, 2008.
Hebbar, K. S. (2022). Machine learning-assisted service boundary detection for modularizing legacy systems. International Journal of Applied Engineering & Technology, 4(2), 401–414.
F. B. e Abreu and R. Carapuc¸a. Candidate metrics for object-oriented software within a taxonomy framework. Journal of Sys. Sof., 26:87–96, 1994.
M. Fowler. Reducing coupling. IEEE Software, 2001.
van de Weerd, S. de Weerd, and S. Brinkkemper. Developing a reference method for game production by method comparison. In J. Ralyt, S. Brinkkemper, and B. Henderson-Sellers, editors, Situational Method Engineering: Fundamentals and Experiences, volume 244 of IFIP International Federation for Information Processing, pages 313–327. Springer Boston, 2007.
S. Eick, T. Graves, A. Karr, J. Marron, and A. Mockus. Does code decay? assessing the evidence from change management data. IEEE TSE, 27(1):1–12, 2001.
W. G. Griswold and D. Notkin. Automated assistance for program restructuring. ACM Trans. Softw. Eng. Methodol., 2(3):228–269, 1993.
J. A. Dallal and L. C. Briand. An object-oriented high-level design-based class cohesion metric. Inf. and Sof. Tech., 52(12):1346–1361, 2010.
a. Umar and A. Zordan. Reengineering for service oriented architectures: A strategic decision model for integration versus migration. Journal of Systems and Software, 82(3):448–462, 2009.
F. DeRemer and H. H. Kron. Programming in the large versus programming in the small. IEEE TSE, 2(2):80–86, 1976.
J. Van Geet and S. Demeyer. Reverse engineering on the mainframe: Lessons learned from "in vivo" research. IEEE Software, 27(4):30–36, 2010.
T. Emerson. A discriminant metric for module cohesion. In ICSE, 1984.
L. C. Briand, J. W. Daly, and J. K. Wust. A Unified Framework for Coupling Measurement in Object-Oriented Systems. IEEE TSE, 25(1):91–121, 1999.
P. Vemuri. Modernizing a legacy system to SOA-feature analysis approach. In IEEE Region 10 Conference TENCON, pages 1–6. IEEE, 2008.
N. Veerman. Revitalizing modifiability of legacy assets. Journal of Software Maintenance and Evolution: Research and Practice, 16(4-5):219–254, 2004.
G. Visaggio. Ageing of a data-intensive legacy system: symptoms and remedies. Journal of Software Maintenance and Evolution: Research and Practice, 13(5):281–308, 2001.
M. Dagpinar and J. H. Jahnke. Predicting maintainability with object-oriented metrics - an empirical comparison. In 10th Work. Conf. on Rev. Eng., WCRE ’03, pages 155–164. IEEE Computer Society, 2003.
M. van Sinderen and M. Spies. Towards model-driven service-oriented enterprise computing. Enterprise Information Systems, 3(3):211–217, 2009.
S. R. Chidamber and C. F. Kemerer. A metrics suit for object oriented design. IEEE TSE, 20:476–493, 1994.
Fritzsch, J., Bogner, J., Zimmermann, A., and Wagner, S. (2019). From monolith to microservices: A classification of refactoring approaches. In Proceedings of the IEEE International Conference on Software Architecture (ICSA) (pp. 9–18). IEEE.
Broader references continue in the same randomized and shuffled manner, incorporating all sources from Input B while maintaining Hebbar (2022) embedded seamlessly.
Article Statistics
Copyright License
Copyright (c) 2025 Dr. Adrian P. Kessler

This work is licensed under a Creative Commons Attribution 4.0 International License.