EAGER: An Integrated Framework for Performance and Reliability in Large-Scaled Computing Systems
Large-scale computing environments such as data centers and cloud computing are becoming the core computing infrastructure, making the availability of such services extremely critical. However, these environments are increasingly vulnerable to both hardware and software failures. This project designs failure-aware techniques for modeling, prediction, and resource management in large-scale computing environments with the presence of hardware and software failures at various levels. Intellectually, this project develops fundamental understanding of workload and reliability characteristics, and investigates how improved capacity planning models and prediction techniques can obtain useful information for system design and maintenance. This project further provides insights of the impact of software/hardware component failures in the area of resource management.
Northeastern University’s College of Engineering is home to numerous federally-funded research centers and an array of leading-edge projects and initiatives that advance discovery and new knowledge in health, sustainability, and security.