About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
DSN-W 2017
Conference paper
Providing Resiliency to Orchestration and Automation Engines in Hybrid Cloud
Abstract
Hybrid cloud environments have seen a rapid rise in recent years. An essential part of a hybrid cloud is its ability to orchestrate the allocation, provisioning, and management of different compute resources spanning multiple cloud systems, and drive these operations across multiple cloud systems in an automated way. The Orchestration and Automation Engines (OAEs) of a hybrid cloud must themselves be highly available for ensuring high resiliency of the hybrid cloud. We present our experience in providing resiliency to the OAEs of a real-world hybrid cloud in this paper. The presentation includes the resiliency architecture of the OAEs, solutions that deal with errors ranging from software component crash to configuration/metadata error and data corruption, experimental results and our lessons learned from the practical experience.