{"id":25966,"date":"2022-09-10T21:52:27","date_gmt":"2022-09-10T21:52:27","guid":{"rendered":"https:\/\/cloud-cod.com\/?p=25966"},"modified":"2022-09-11T06:58:15","modified_gmt":"2022-09-11T06:58:15","slug":"disaster-recovery-strategies-part-2","status":"publish","type":"post","link":"https:\/\/cloud-cod.com\/index.php\/2022\/09\/10\/disaster-recovery-strategies-part-2\/","title":{"rendered":"Disaster Recovery Strategies &#8211; Part #2"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"25966\" class=\"elementor elementor-25966\" data-elementor-post-type=\"post\">\n\t\t\t\t\t\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-55d1a40 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"55d1a40\" data-element_type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-71476b1\" data-id=\"71476b1\" data-element_type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t\t\t<div class=\"elementor-element elementor-element-ab01ada elementor-widget elementor-widget-image\" data-id=\"ab01ada\" data-element_type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t<style>\/*! elementor - v3.17.0 - 01-11-2023 *\/\n.elementor-widget-image{text-align:center}.elementor-widget-image a{display:inline-block}.elementor-widget-image a img[src$=\".svg\"]{width:48px}.elementor-widget-image img{vertical-align:middle;display:inline-block}<\/style>\t\t\t\t\t\t\t\t\t\t\t\t<img loading=\"lazy\" decoding=\"async\" width=\"272\" height=\"300\" src=\"https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/blog_post6_2-272x300.png\" class=\"attachment-medium size-medium wp-image-26008\" alt=\"Disaster Recovery\" srcset=\"https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/blog_post6_2-272x300.png 272w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/blog_post6_2-928x1024.png 928w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/blog_post6_2-768x848.png 768w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/blog_post6_2.png 1091w\" sizes=\"auto, (max-width: 272px) 100vw, 272px\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-7e40cf3 elementor-widget elementor-widget-text-editor\" data-id=\"7e40cf3\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t<style>\/*! elementor - v3.17.0 - 01-11-2023 *\/\n.elementor-widget-text-editor.elementor-drop-cap-view-stacked .elementor-drop-cap{background-color:#69727d;color:#fff}.elementor-widget-text-editor.elementor-drop-cap-view-framed .elementor-drop-cap{color:#69727d;border:3px solid;background-color:transparent}.elementor-widget-text-editor:not(.elementor-drop-cap-view-default) .elementor-drop-cap{margin-top:8px}.elementor-widget-text-editor:not(.elementor-drop-cap-view-default) .elementor-drop-cap-letter{width:1em;height:1em}.elementor-widget-text-editor .elementor-drop-cap{float:left;text-align:center;line-height:1;font-size:50px}.elementor-widget-text-editor .elementor-drop-cap-letter{display:inline-block}<\/style>\t\t\t\t<p>Disaster Recovery and Business Continuity topic has been presented by me in two parts.<\/p><p>The Part #1 focused on Aviatrix Control-Plane and Data-Plane separation and Aviatrix Controller deployment recommendations. -&gt; <a title=\"Disaster Recovery Strategies\" href=\"https:\/\/cloud-cod.com\/index.php\/2022\/09\/09\/disaster-recovery-strategies-part-1\/\" data-wplink-url-error=\"true\">Disaster Recovery Strategies &#8211; Part 1<\/a><\/p><p>Part #2 (this post) focuses on Disaster Recovery \/ Business Continuity strategies.\u00a0<\/p>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-dd2ef50 elementor-widget elementor-widget-heading\" data-id=\"dd2ef50\" data-element_type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t<style>\/*! elementor - v3.17.0 - 01-11-2023 *\/\n.elementor-heading-title{padding:0;margin:0;line-height:1}.elementor-widget-heading .elementor-heading-title[class*=elementor-size-]>a{color:inherit;font-size:inherit;line-height:inherit}.elementor-widget-heading .elementor-heading-title.elementor-size-small{font-size:15px}.elementor-widget-heading .elementor-heading-title.elementor-size-medium{font-size:19px}.elementor-widget-heading .elementor-heading-title.elementor-size-large{font-size:29px}.elementor-widget-heading .elementor-heading-title.elementor-size-xl{font-size:39px}.elementor-widget-heading .elementor-heading-title.elementor-size-xxl{font-size:59px}<\/style><h2 class=\"elementor-heading-title elementor-size-default\">Aviatrix and Disaster Recovery<\/h2>\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-dfd5c2a elementor-widget elementor-widget-text-editor\" data-id=\"dfd5c2a\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<p>How does Aviatrix help the customers with their Disaster Recovery \/ Business Continuity strategies?<\/p><p>As you might already know, Aviatrix lets the customers connect environments (e.g. their Primary and DR locations) in different Clouds and Regions very easily.<\/p><p>The detailed strategies are<span style=\"color: var( --e-global-color-text ); font-family: var( --e-global-typography-text-font-family ), Sans-serif; font-weight: var( --e-global-typography-text-font-weight ); font-size: 0.875rem;\"> presented in 4 scenarios that could be leveraged by the customers. The specific strategy used by the customer is a trade-off between a cost and the restoration time, and it depends of course on multiple factors, incl. How is a DR important to the company? What is the cost in case of<\/span><span style=\"color: var( --e-global-color-text ); font-family: var( --e-global-typography-text-font-family ), Sans-serif; font-weight: var( --e-global-typography-text-font-weight ); font-size: 0.875rem;\"> an outage? Could the company deal with an outage? What are the applications\/databases DR requirements?<\/span><\/p>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-0924eef elementor-widget elementor-widget-heading\" data-id=\"0924eef\" data-element_type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">Disaster Recovery Strategies<\/h2>\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-e71efd8 elementor-widget elementor-widget-text-editor\" data-id=\"e71efd8\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<p>Four Disaster Recovery strategies are discussed by me in this post. Those strategies are presented from a cloud network perspective, especially how Aviatrix gateways could be deployed to achieve the desired DR\/BC scenario.<\/p>\n<p>At the same time, it is important to mention that application\/database requirements and capabilities must be taken into account whenever the DR strategy is discussed and implemented.<\/p>\n<p>Please keep in mind that all the drawings have been done using Azure as an example but the strategies apply to AWS as well.<\/p>\n<p>&#8212; Click on the links below to expand the content &#8212;<\/p>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-5895f5a elementor-widget elementor-widget-toggle\" data-id=\"5895f5a\" data-element_type=\"widget\" data-widget_type=\"toggle.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t<style>\/*! elementor - v3.17.0 - 01-11-2023 *\/\n.elementor-toggle{text-align:left}.elementor-toggle .elementor-tab-title{font-weight:700;line-height:1;margin:0;padding:15px;border-bottom:1px solid #d5d8dc;cursor:pointer;outline:none}.elementor-toggle .elementor-tab-title .elementor-toggle-icon{display:inline-block;width:1em}.elementor-toggle .elementor-tab-title .elementor-toggle-icon svg{margin-inline-start:-5px;width:1em;height:1em}.elementor-toggle .elementor-tab-title .elementor-toggle-icon.elementor-toggle-icon-right{float:right;text-align:right}.elementor-toggle .elementor-tab-title .elementor-toggle-icon.elementor-toggle-icon-left{float:left;text-align:left}.elementor-toggle .elementor-tab-title .elementor-toggle-icon .elementor-toggle-icon-closed{display:block}.elementor-toggle .elementor-tab-title .elementor-toggle-icon .elementor-toggle-icon-opened{display:none}.elementor-toggle .elementor-tab-title.elementor-active{border-bottom:none}.elementor-toggle .elementor-tab-title.elementor-active .elementor-toggle-icon-closed{display:none}.elementor-toggle .elementor-tab-title.elementor-active .elementor-toggle-icon-opened{display:block}.elementor-toggle .elementor-tab-content{padding:15px;border-bottom:1px solid #d5d8dc;display:none}@media (max-width:767px){.elementor-toggle .elementor-tab-title{padding:12px}.elementor-toggle .elementor-tab-content{padding:12px 10px}}.e-con-inner>.elementor-widget-toggle,.e-con>.elementor-widget-toggle{width:var(--container-widget-width);--flex-grow:var(--container-widget-flex-grow)}<\/style>\t\t<div class=\"elementor-toggle\">\n\t\t\t\t\t\t\t<div class=\"elementor-toggle-item\">\n\t\t\t\t\t<h4 id=\"elementor-tab-title-9281\" class=\"elementor-tab-title\" data-tab=\"1\" role=\"button\" aria-controls=\"elementor-tab-content-9281\" aria-expanded=\"false\">\n\t\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-toggle-icon elementor-toggle-icon-left\" aria-hidden=\"true\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-toggle-icon-closed\"><i class=\"fas fa-caret-right\"><\/i><\/span>\n\t\t\t\t\t\t\t\t<span class=\"elementor-toggle-icon-opened\"><i class=\"elementor-toggle-icon-opened fas fa-caret-up\"><\/i><\/span>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t\t\t<a class=\"elementor-toggle-title\" tabindex=\"0\">JIT Data-Plane deployment in DR Region<\/a>\n\t\t\t\t\t<\/h4>\n\n\t\t\t\t\t<div id=\"elementor-tab-content-9281\" class=\"elementor-tab-content elementor-clearfix\" data-tab=\"1\" role=\"region\" aria-labelledby=\"elementor-tab-title-9281\"><p>The JIT (Just In Time) data-plane deployment approach assumes that there will be no Aviatrix Gateways pre-built at the secondary Region before the primary Region outage. Only the VPCs\/VNETs will be created beforehand.<\/p><p>The Aviatrix Gateways at the secondary Region will be spun up at the time of downtime. To lower the time required by the deployment at the secondary location, the use of Terraform is recommended.<\/p><p>This strategy does not introduce any AWS\/Azure additional costs when it comes to compute resources or licenses. The VPCsVNETs are free of charge. There is also no Aviatrix-related cost.<\/p><p>The customer must be aware of the following trade-offs:<\/p><ul><li>Long Recovery Time, as the whole environment (Aviatrix Gateways and the connections between them) must be built at the secondary Region at the time of the outage<\/li><li>The risk of potential deployment issues. Each issue could extend the Recovery Time even more (which could be mitigated by using Terraform)<\/li><li>The risk that the primary Region outage would affect thousands of other customers\/organizations. The AWS\/Azure compute resources within a particular Region are finite resources. Those other customers might want to restore their workloads in the same Region that is our customer\u2019s secondary Region. In theory, there is a possibility of resource contention.<\/li><\/ul><p>The expected Recovery Time depends on the number and types of the components to be deployed, e.g. number of Gateways, number of peerings, and number of SNAT\/DNAT rules. However, it is safe to say that the time will be more than an hour.<\/p><figure id=\"attachment_25968\" aria-describedby=\"caption-attachment-25968\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/JIT_DR.png\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-25968 size-large\" title=\"JTI Data-Plane deployment in DR Region\" src=\"http:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/JIT_DR-1024x575.png\" alt=\"JTI Data-Plane deployment in DR Region\" width=\"800\" height=\"449\" srcset=\"https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/JIT_DR-1024x575.png 1024w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/JIT_DR-300x168.png 300w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/JIT_DR-768x431.png 768w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/JIT_DR-1536x862.png 1536w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/JIT_DR.png 1676w\" sizes=\"auto, (max-width: 800px) 100vw, 800px\" \/><\/a><figcaption id=\"caption-attachment-25968\" class=\"wp-caption-text\">JIT Data-Plane deployment in DR Region<\/figcaption><\/figure><h4>\u00a0<\/h4><h4>Advantages<\/h4><ul><li><p>No additional Azure\/AWS costs (compute resources)<\/p><\/li><li>No Aviatrix cost until the deployment<\/li><li>No need to synchronize the configuration between the primary and the secondary environments<\/li><\/ul><h4>Disadvantages<\/h4><ul><li>Long Recovery Time measured (1hr+), e.g. the Aviatrix Gateways, peerings, SNAT\/DNAT rules must be built during the outage<\/li><li><p>The risk of potential deployment issues. It can be mitigated by using a Terraform<\/p><\/li><li>A finite number of Azure\/AWS compute resources in a Region. The risk of resource contention between many customers\/organizations<\/li><\/ul><\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t\t<div class=\"elementor-toggle-item\">\n\t\t\t\t\t<h4 id=\"elementor-tab-title-9282\" class=\"elementor-tab-title\" data-tab=\"2\" role=\"button\" aria-controls=\"elementor-tab-content-9282\" aria-expanded=\"false\">\n\t\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-toggle-icon elementor-toggle-icon-left\" aria-hidden=\"true\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-toggle-icon-closed\"><i class=\"fas fa-caret-right\"><\/i><\/span>\n\t\t\t\t\t\t\t\t<span class=\"elementor-toggle-icon-opened\"><i class=\"elementor-toggle-icon-opened fas fa-caret-up\"><\/i><\/span>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t\t\t<a class=\"elementor-toggle-title\" tabindex=\"0\">Cold Data-Plane in DR Region<\/a>\n\t\t\t\t\t<\/h4>\n\n\t\t\t\t\t<div id=\"elementor-tab-content-9282\" class=\"elementor-tab-content elementor-clearfix\" data-tab=\"2\" role=\"region\" aria-labelledby=\"elementor-tab-title-9282\"><p>The Cold data-plane approach assumes to have the data-plane (Aviatrix Spoke and Transit Gateways) already deployed in both environments\/Regions (the primary \u201cactive\u201d one and the secondary \u201cstandby\u201d one) but the Gateway instances in the secondary environment are shut down to save both the Azure\/AWS compute cost (no compute costs are charged until used) and Aviatrix costs (tunnels are down). It means the customer deploys the same set of Aviatrix Spoke and Transit Gateways in both environments but only the Gateways and Tunnels in the primary environment are UP and functioning. The customer workloads and all the traffic flows are active only in the primary Region.<\/p><figure id=\"attachment_25981\" aria-describedby=\"caption-attachment-25981\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Cold-Data-Plane-in-DR-Region.png\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-25981 size-large\" title=\"Cold Data-Plane in DR Region\" src=\"http:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Cold-Data-Plane-in-DR-Region-1024x568.png\" alt=\"Cold Data-Plane in DR Region\" width=\"800\" height=\"444\" srcset=\"https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Cold-Data-Plane-in-DR-Region-1024x568.png 1024w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Cold-Data-Plane-in-DR-Region-300x167.png 300w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Cold-Data-Plane-in-DR-Region-768x426.png 768w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Cold-Data-Plane-in-DR-Region-1536x853.png 1536w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Cold-Data-Plane-in-DR-Region.png 1661w\" sizes=\"auto, (max-width: 800px) 100vw, 800px\" \/><\/a><figcaption id=\"caption-attachment-25981\" class=\"wp-caption-text\">Cold Data-Plane in DR Region<\/figcaption><\/figure><p>Some considerations must be taken into account when it comes to the Cold data-plane scenario:<\/p><ul><li>Pre-requisite: Do not deploy the Aviatrix Controller in the primary environment Region. The failure of this Region will impact the control-plane. Spinning up the Gateways will be dependent on the Controller being reachable. The Controller must be restored first before activating the data plane (Gateways) secondary environment. The shared failure domain between the Controller and primary Region could be catastrophic (as discussed in Part #1 post).<\/li><li>The Risk: Potential primary Region outage would affect thousands of other customers\/organizations. The Azure\/AWS compute resources within a particular Region are finite resources. Imagine that those other customers would like to restore their workloads in the same Region that you have your secondary Aviatrix Gateways deployed but shut down and you want to bring them up.<\/li><\/ul><p>The interconnection between the primary and secondary environments is not required because inter-region peering between Transit Gateways will be down anyway in this case.<\/p><p>The expected recovery time from a cloud network perspective will be less than an hour because the Aviatrix Gateways have been already pre-built in the secondary environment. The only thing to be done here is to Enable them so the recovery time is dependent on the Gateways changing their state from down to up, and the Tunnels being established between them.<\/p><h4>Advantages<\/h4><ul><li>Quickly available (&lt;1hr), a fair degree of certainty as the secondary environment is already pre-built but shut down<\/li><li>No time is required for building the data plane because the data plane is already pre-built. Though the time is required to bring up the Gateways and all the Tunnels between them<\/li><li>No Azure\/AWS cost because the Gateways are deployed but shut down. No compute costs are charged until Gateways are activated<\/li><li>No Aviatrix cost (tunnels down)<\/li><\/ul><h4>Disadvantages<\/h4><ul><li>The risk of activating the Gateways in the secondary location at the same time as other organizations might want to spin up their compute resources. Azure compute resources in a particular Region are finite<\/li><li>Any configuration changes to the primary location must be executed on the secondary location as well (Terraform could be used to make it automated and consistent)<\/li><\/ul><\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t\t<div class=\"elementor-toggle-item\">\n\t\t\t\t\t<h4 id=\"elementor-tab-title-9283\" class=\"elementor-tab-title\" data-tab=\"3\" role=\"button\" aria-controls=\"elementor-tab-content-9283\" aria-expanded=\"false\">\n\t\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-toggle-icon elementor-toggle-icon-left\" aria-hidden=\"true\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-toggle-icon-closed\"><i class=\"fas fa-caret-right\"><\/i><\/span>\n\t\t\t\t\t\t\t\t<span class=\"elementor-toggle-icon-opened\"><i class=\"elementor-toggle-icon-opened fas fa-caret-up\"><\/i><\/span>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t\t\t<a class=\"elementor-toggle-title\" tabindex=\"0\">Hybrid Hot\/Cold Data-Plane<\/a>\n\t\t\t\t\t<\/h4>\n\n\t\t\t\t\t<div id=\"elementor-tab-content-9283\" class=\"elementor-tab-content elementor-clearfix\" data-tab=\"3\" role=\"region\" aria-labelledby=\"elementor-tab-title-9283\"><p>The Hybrid hot\/cold data plane approach, as the name implies, is a mix of Hot and Cold strategies (discussed below). The idea is to deploy the whole environment in the secondary Region but keep half of the Aviatrix Gateways in a down state (to reduce both Azure\/AWS and Aviatrix cost).<\/p><p>Please notice that most of the Tunnels between Spoke and Transit Gateways will be down, and only one Tunnel will be UP. The reason is that a Tunnel can be active only if Gateways on both ends are active, which is shown in the diagram below.<\/p><figure id=\"attachment_25972\" aria-describedby=\"caption-attachment-25972\" style=\"width: 300px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Tunnels-in-a-Hybrid-Data-Plane-strategy.png\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-25972 size-medium\" title=\"Tunnels in a Hybrid Data-Plane strategy\" src=\"http:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Tunnels-in-a-Hybrid-Data-Plane-strategy-300x271.png\" alt=\"Tunnels in a Hybrid Data-Plane strategy\" width=\"300\" height=\"271\" srcset=\"https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Tunnels-in-a-Hybrid-Data-Plane-strategy-300x271.png 300w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Tunnels-in-a-Hybrid-Data-Plane-strategy-768x693.png 768w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Tunnels-in-a-Hybrid-Data-Plane-strategy.png 877w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><figcaption id=\"caption-attachment-25972\" class=\"wp-caption-text\">Tunnels in a Hybrid Data-Plane strategy<\/figcaption><\/figure><p>In this scenario, the primary Region Transit Gateways and the secondary Region Transit Gateways can be interconnected to make the data plane available for replication during normal operations. However, it depends on the applications\/databases deployed by the customer and the IP address scheme used in the VPCs\/VNETs (whether wthe same IP prefixes must be used in both environments or not).<\/p><p>As always, it is recommended to deploy the Aviatrix Controller in a different Region than primary and secondary environments. More on that in a post called &#8220;Part #1&#8221;.<\/p><figure id=\"attachment_25982\" aria-describedby=\"caption-attachment-25982\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Hybrid-Hot-Cold-data-plane-in-DR-Region-2.png\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-25982 size-large\" title=\"Hybrid Hot\/Cold data plane in DR Region\" src=\"http:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Hybrid-Hot-Cold-data-plane-in-DR-Region-2-1024x567.png\" alt=\"Hybrid Hot\/Cold data plane in DR Region\" width=\"800\" height=\"443\" srcset=\"https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Hybrid-Hot-Cold-data-plane-in-DR-Region-2-1024x567.png 1024w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Hybrid-Hot-Cold-data-plane-in-DR-Region-2-300x166.png 300w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Hybrid-Hot-Cold-data-plane-in-DR-Region-2-768x425.png 768w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Hybrid-Hot-Cold-data-plane-in-DR-Region-2-1536x851.png 1536w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Hybrid-Hot-Cold-data-plane-in-DR-Region-2.png 1677w\" sizes=\"auto, (max-width: 800px) 100vw, 800px\" \/><\/a><figcaption id=\"caption-attachment-25982\" class=\"wp-caption-text\">Hybrid Hot\/Cold data plane in DR Region<\/figcaption><\/figure><p>When the primary Region fails, the secondary Region is ready to take the traffic because half of the Gateways (and part of the Tunnels) are already functioning. The Recovery Time is the same as in the \u201cHot data plane\u201d approach. It means it is highly dependent on the time required by the applications\/databases\/workloads to be migrated from the primary environment to the secondary environment. From a cloud network perspective, the secondary environment is ready to take the traffic. The only required thing is to bring up the remaining half of the Gateways that are shut down to introduce resiliency and improve the performance.<\/p><p>This approach combines the advantages of Hot and Cold strategies. Though their disadvantages apply as well.<\/p><h4>Advantages<\/h4><ul><li><p>Combines the advantages of Hot and Cold strategies<\/p><\/li><li>No time is required for building the data plane because the data plane is already pre-built and functioning at the secondary location. The half of the Gateways must be brought up<\/li><li>Seamless disaster recovery from a cloud network perspective = The secondary environment is immediately available with one tunnel between active Spoke\/Transit Gateways<\/li><li>Data plane at the secondary location might be used for replication (depending on the applications\/databases used) already during normal operations<\/li><li>Aviatrix license cost because only some tunnels are up<\/li><\/ul><h4>Disadvantages<\/h4><ul><li>The Azure\/AWS cost because half of the Gateways are already deployed and functional at the pre-built secondary location and are consuming the compute resources<\/li><li>Any configuration changes to the primary location must be executed on the secondary location as well (Terraform could be used to make it automated and consistent)<\/li><\/ul><\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t\t<div class=\"elementor-toggle-item\">\n\t\t\t\t\t<h4 id=\"elementor-tab-title-9284\" class=\"elementor-tab-title\" data-tab=\"4\" role=\"button\" aria-controls=\"elementor-tab-content-9284\" aria-expanded=\"false\">\n\t\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-toggle-icon elementor-toggle-icon-left\" aria-hidden=\"true\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-toggle-icon-closed\"><i class=\"fas fa-caret-right\"><\/i><\/span>\n\t\t\t\t\t\t\t\t<span class=\"elementor-toggle-icon-opened\"><i class=\"elementor-toggle-icon-opened fas fa-caret-up\"><\/i><\/span>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t\t\t<a class=\"elementor-toggle-title\" tabindex=\"0\">Hot Data-Plane in DR Region<\/a>\n\t\t\t\t\t<\/h4>\n\n\t\t\t\t\t<div id=\"elementor-tab-content-9284\" class=\"elementor-tab-content elementor-clearfix\" data-tab=\"4\" role=\"region\" aria-labelledby=\"elementor-tab-title-9284\"><p>The Hot data-plane approach assumes to have the data-plane (Aviatrix Spoke and Transit Gateways) already deployed in both environments\/Regions.<\/p><p>As already mentioned a few times, this post focuses on the network perspective to achieve the desired DR strategy. The customer deploys the same set of Aviatrix Spoke and Transit Gateways in Primary and DR environments. All the Gateways and Tunnels in both parts are UP and functioning.\u00a0<\/p><p>Please keep in mind though that there might be two sub-scenarios from the Application\/DataBase perspective. There are a lot of factors (application and database related) that must be taken into account before choosing one or the other sub-scenario, incl. Are the applications ready to function in both environments? What about DNS setup? What about DataBase setup and replication? What about IP addressing? To name just a few. The sub-scenarios are:<\/p><ul><li>the primary Region could be \u201cactive\u201d and the secondary could be \u201cstandby\u201d from the Application\/DataBase perspective. Though the network part is active, from the application perspective only the primary Region is UP. In this scenario, there is also a possibility to interconnect the primary and secondary environments to make the data-plane (network) available for DB replication during normal operations.\u00a0<\/li><li>both Regions are Active from Applications\/DataBase perspective, meaning a true Active\/Active or Multi-Site solution is used. Applications are running in both Regions\/environments.<\/li><\/ul><figure id=\"attachment_25983\" aria-describedby=\"caption-attachment-25983\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Hot-Data-Plane-in-DR-Region.png\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-25983 size-large\" title=\"Hot Data-Plane in DR Region\" src=\"http:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Hot-Data-Plane-in-DR-Region-1024x570.png\" alt=\"Hot Data-Plane in DR Region\" width=\"800\" height=\"445\" srcset=\"https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Hot-Data-Plane-in-DR-Region-1024x570.png 1024w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Hot-Data-Plane-in-DR-Region-300x167.png 300w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Hot-Data-Plane-in-DR-Region-768x428.png 768w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Hot-Data-Plane-in-DR-Region-1536x855.png 1536w, https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/Hot-Data-Plane-in-DR-Region.png 1672w\" sizes=\"auto, (max-width: 800px) 100vw, 800px\" \/><\/a><figcaption id=\"caption-attachment-25983\" class=\"wp-caption-text\">Hot Data-Plane in DR Region<\/figcaption><\/figure><p>It is recommended to deploy the Aviatrix Controller in a different Region than primary and secondary environments. More on that in section: \u201cSeparation of failure domains\u201d.<\/p><p>The expected recovery time is the fastest one among all 4 presented scenarios. Of course, it also depends on the sub-scenario chosen from the Application\/DB perspective (Active\/Standby or Active\/Active). For Active\/Standby: the recovery time is highly dependent on the time required by the applications\/databases\/workloads to be migrated from the primary environment to the secondary environment.<\/p><p>Nevertheless, from a cloud network perspective, the secondary environment is up and running all the time and it does not require any additional configuration.<\/p><h4>Advantages<\/h4><ul><li><p>The most desired solution when it comes to the service uptime and the best recovery time<\/p><\/li><li>No time is required for building the Aviatrix data plane because the data plane is already pre-built and functioning at the secondary location<\/li><li>Seamless disaster recovery from the Aviatrix cloud network perspective = The secondary environment is immediately available<\/li><li>Data plane at the secondary location might be used for replication (depending on the applications\/databases used) already during normal operations OR the secondary location could be fully running (from an application\/database perspective)<\/li><\/ul><h4>Disadvantages<\/h4><ul><li>The highest AWS\/Azure cost because Gateways are already deployed at the pre-built secondary location and are consuming the compute resources<\/li><li>The highest Aviatrix cost because all the tunnels are UP<\/li><li>Any configuration changes to the primary location must be executed on the secondary location as well (Terraform could be used to make it automated and consistent)<\/li><\/ul><\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-b5d2310 elementor-widget elementor-widget-heading\" data-id=\"b5d2310\" data-element_type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">Summary<\/h2>\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-4819fc0 elementor-widget elementor-widget-text-editor\" data-id=\"4819fc0\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<p>The first step of DR\/BC planning is to gather application\/database requirements. As soon as it is known what is feasible from the app\/db perspective, the next step is to think about how the network could be used to achieve the goal. With Aviatrix, the network part of DR is straightforward.<\/p><p>Please find below the comparison of all solutions presented in this post:<\/p><ul><li>JTI Data-Plane deployment:<ul><li>The cheapest solution &#8211; DR network upfront costs can be even nailed down to &#8220;zero&#8221;<\/li><li>The longest network recovery time (1hr+)<\/li><li>The whole environment must be rebuilt in DR location<\/li><\/ul><\/li><li>Cold-Data-Plane<ul><li>No upfront network Azure\/AWS and Aviatrix cost (the same advantage as with the JIT approach).<\/li><li>Network Recovery Time of an hour is achievable<\/li><li>Network configuration in DR location must be synchronized with configuration in Primary location<\/li><\/ul><\/li><li>Hybrid Hot\/Cold Data-Plane<ul><li>Network in DR location is Active (means seamless DR)<\/li><li>It is possible to have traffic flowing to the DR location (e.g. db replication)<\/li><li>There are some Azure\/AWS compute costs because half of Aviatrix Gateways is up<\/li><li>There is some Aviatrix cost because 1 tunnel is up between each Gateway pair<\/li><li>Network configuration in DR location must be synchronized with configuration in Primary location<\/li><\/ul><\/li><li>Hot Data-Plane<ul><li>Network in DR location is fully UP at full capacity (seamless DR)<\/li><li>Compute cost (AWS\/Azure) might be high (compute resources are allocated for the Gateways)<\/li><li>Aviatrix cost (all tunnels are up)<\/li><li>Network configuration in DR location must be synchronized with configuration in Primary location<\/li><\/ul><\/li><\/ul><p>I hope this post was informative.<\/p>\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>Disaster Recovery and Business Continuity topic has been presented by me in two parts. The Part #1 focused on Aviatrix Control-Plane and Data-Plane separation and Aviatrix Controller deployment recommendations. -&gt; &#8230;<\/p>\n","protected":false},"author":2,"featured_media":26007,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"om_disable_all_campaigns":false,"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"categories":[14,8,6],"tags":[],"class_list":["post-25966","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-aviatrix","category-disaster-recovery","category-multi-cloud"],"uagb_featured_image_src":{"full":["https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/blog_post6-1.png",3592,3058,false],"thumbnail":["https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/blog_post6-1-150x150.png",150,150,true],"medium":["https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/blog_post6-1-300x255.png",300,255,true],"medium_large":["https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/blog_post6-1-768x654.png",768,654,true],"large":["https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/blog_post6-1-1024x872.png",800,681,true],"1536x1536":["https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/blog_post6-1-1536x1308.png",1536,1308,true],"2048x2048":["https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/blog_post6-1-2048x1744.png",2048,1744,true],"onepress-blog-small":["https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/blog_post6-1-300x150.png",300,150,true],"onepress-small":["https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/blog_post6-1-480x300.png",480,300,true],"onepress-medium":["https:\/\/cloud-cod.com\/wp-content\/uploads\/2022\/09\/blog_post6-1-640x400.png",640,400,true]},"uagb_author_info":{"display_name":"Jakub","author_link":"https:\/\/cloud-cod.com\/index.php\/author\/jakub\/"},"uagb_comment_info":89,"uagb_excerpt":"Disaster Recovery and Business Continuity topic has been presented by me in two parts. The Part #1 focused on Aviatrix Control-Plane and Data-Plane separation and Aviatrix Controller deployment recommendations. -&gt; ...","_links":{"self":[{"href":"https:\/\/cloud-cod.com\/index.php\/wp-json\/wp\/v2\/posts\/25966","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cloud-cod.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/cloud-cod.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/cloud-cod.com\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/cloud-cod.com\/index.php\/wp-json\/wp\/v2\/comments?post=25966"}],"version-history":[{"count":59,"href":"https:\/\/cloud-cod.com\/index.php\/wp-json\/wp\/v2\/posts\/25966\/revisions"}],"predecessor-version":[{"id":26051,"href":"https:\/\/cloud-cod.com\/index.php\/wp-json\/wp\/v2\/posts\/25966\/revisions\/26051"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/cloud-cod.com\/index.php\/wp-json\/wp\/v2\/media\/26007"}],"wp:attachment":[{"href":"https:\/\/cloud-cod.com\/index.php\/wp-json\/wp\/v2\/media?parent=25966"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/cloud-cod.com\/index.php\/wp-json\/wp\/v2\/categories?post=25966"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/cloud-cod.com\/index.php\/wp-json\/wp\/v2\/tags?post=25966"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}