Tag Archives: RecoverPoint

Mission Critical Center: A community for continuous availability

This blog post is about an internal effort we have started within EMC. We have talked about this at EMC World. Based on the initial response, the interest level behind this effort seems to be quite high. Here is some more information about the effort.

The challenge and the concept behind the solution

Over the past few years, customers are increasingly adopting / expecting continuous availability in their data centers. While it may be obvious, it still deserves saying that continuous availability is an end-to-end paradigm starting with the application to multi-pathing to SAN configuration to IP configuration to capabilities like VPLEX Metro and last but not the least physical storage.

We have always recognized that this has an impact on how customers view and purchase solutions. In other words, when a customer thinks about continuous availability, they think about continuous availability for their SAP Environment running on VMware in a SAN with multiple data centers etc. This has major implications for how we think about testing and validating what customers are deploying.

If you think of the normal testing paradigm for any product team, their responsibility is testing the product capabilities, product handling for failure conditions as well as performance, scale and other system testing needs. There is a second envelope of testing that is a superset of all of this – interoperability testing. EMC has built a core capability around interoperability testing with the world class ELab within the EMC family. ELab is responsible for interoperability and protocol testing and certifying products to work with EMC products. This results in generating Support Matrices. Customers and the field treat these support matrices as their bibles for how to configure and deploy products for interoperability. One more envelope around this testing is solution testing. This is now taking the end-to-end pieces that are supported and deploying them and testing them for functionality and performance.

One critical piece is still missing – especially with the focus that customers are putting on continuous availability. With the paradigm rapidly moving to 6 9s and 7 9s availability, it is not sufficient to test the part pieces and trust that interop and solution testing will result in customers reaching those hallowed availability levels. Instead, what is needed is proactive stress and failure testing of these end-to-end deployments. It is also important that we understand the operational paradigm a customer is likely to take in such a deployment.

How are we solving this challenge?

As you can imagine, in a multi-business unit company such as EMC, this is a herculean effort. You need different business units to buy into the concept of solution level failure and stress testing and then align on what is needed to validate and test this capability. Ultimately, our vision as EMC was to deliver to customers a continuous availability experience at the data center level. Talk about setting ambitious goals. But then, our goal was to deliver value to our customers. And setting goals only because they are achievable is not the way to get there.

Similar to when we built ELab, the decision was to invest in a new competency center – Mission Critical Center (MCC).

The mission of the MCC is to build a platform to test and demonstrate greater than 6 9s availability in production for products in the EMC portfolio.

And when we say production, we mean it. For our internal purposes, we treat the MCC exactly as we would treat a customer. They file an SR, escalations to engineering go through exactly the support route that the customer would follow. Upgrades to systems are done similar to how customers would go through it. For all practical purposes, they get exactly the same handling and care that EMC would provide in a customer environment. This teaches us about not only how the product behaves but also about what the impact is of our support processes from a customer perspective. Finally, this helps us also start to look at the problem holistically – i.e. we do not approach debugging the problem from a product perspective but rather from the perspective of the complete solution that the customer deploys.

Mission Critical Center: What is in place and where are we going?

Now that we have talked through the concept, let’s look at what the MCC team has done so far. The MCC team was started as a ground up effort looking for like minded and interested stakeholders across different business units (translation: it has largely been built through a lot of conviction and convincing). The team is essentially built through a shared collaboration between a lot of business units (VMAX, VNX, RecoverPoint and VPLEX). Here is the configuration they have put together.

Mission Critical Center Architecture
Mission Critical Center Architecture

For readers of this blog, you should be very familiar with this topology – it represents the cascaded VPLEX and RecoverPoint topology discussed here and specific topology captured here. The team has built use-cases around stretched Oracle RAC across DC1 and DC2, stretched VMware HA and other applications all running production level workloads across DC1 and DC2 and protected in DC3. Once this mission critical platform was built, their focus was certainly to run I/Os and then start to do accelerated failure testing (i.e. simulate data center type failure scenarios to understand what failures happen across the entire solution set). The goal of this is _NOT_ to test interoperability of VPLEX with VMAX or VPLEX with VNX or to test the performance any one component. The goal is to take real world customer workloads and deploy them across infrastructure the way a customer would and to learn their operational challenges as well as how the infrastructure handles and recovers from failures. So, the MCC team will often fail WAN links, entire arrays, do tech refreshes, introduce a fabric wide zoning change, simulate disaster of a data center, … you get the idea. Needless to say, I am a big fan!

The team has some very concrete plans on how to take this forward. This configuration is now being morphed into the MetroPoint configuration. That way, they can implement this new and exciting capability in much the same way as a customer would and corresponding to that is a whole new set of failure modes to test and simulate. We will continue to add more applications (SQL, SAP HANA, Hadoop), more infrastructure variances (data center moves, network outages, rolling outages and the ilk) and then more of EMC’s product families (DataDomain, Networker, Avamar, ViPR).

Mission Critical Center: The call to action

As the team is building their capabilities, we have a very real need for active guides / participants to build a strong community around the mission critical center. So, here are the concrete asks:

  1. If you are a customer / field person with solutions / design experience and would like to participate in this effort, do reach out to me and I can put you in touch with this effort. You can contribute as often or as little as you like. Your role will be to provide guidance to the team in terms of what they should look for, help understand operational processes on your end and to help us along the journey towards how your data center is evolving to make our products provide the same world class capabilities as they do in your environments today
  2. If there are specific scenarios / applications that you think would be worthy additions to this environment, please reach out to me and we can work to get those on our TODO list for the Mission Critical Center

In the end, this is a community of some very talented engineers within EMC volunteering a big chunk of their time (in addition to doing their day jobs) to enable EMC products to deliver a 6 9s experience in customer data centers. Your help is going to help us get there sooner and make this process more effective. Do consider contributing to this effort!

ViPR 2.0: New use-cases to support VPLEX and RecoverPoint

The GA of ViPR 2.0 was announced in time for EMC World. While there are significant announcements in ViPR 2.0, I will focus on the pieces that benefit VPLEX and RecoverPoint in this new integration.

A quick recap of what was supported prior to the 2.0 release is available here.

Support for Snaps and Clones on arrays behind VPLEX

In the 2.0 release, ViPR now supports full life cycle management of Snaps and Clones on arrays behind VPLEX. This allows customers to get a single pane of glass management function for snaps and clones. This seamless experience makes it easy for customers to take advantage of the performance and scale of these capabilities on underlying arrays and not compromise on the ease of use needed to make this capability work. Here is a demo of this capability.

Setting up a Local Mirror (RAID-1)

Another addition made in the ViPR 2.0 release is the ability to add a local mirror leg to a given virtual volume for the purposes of creating a RAID-1. This allows the volume to be protected across arrays. Here is a demo of what this capability is:

VPLEX and RP Protection

One of the big additions with the ViPR 2.0 release was common management for RecoverPoint within the VPLEX context. This allows RecoverPoint protection for VPLEX volumes to be accomplished through the same user interface. Combined with the end-to-end VPLEX provisioning through ViPR, you can now accomplish complete VPLEX provisioning with RecoverPoint. Please note that ViPR 2.0 does not support the MetroPoint topology. This is targeted for future releases.

Updated Provisioning use-case

Since ViPR 1.0, the provisioning for VPLEX has been updated. Here is a demo of the updated provisioning workflow.

RecoverPoint and VPLEX @ EMC World 2014

It is THAT time of the year. All roads lead to EMC World. In fact, as I write this, I am on my way to completing the second leg of my journey to EMC World. I have spent the last few days in Edmonton, Alberta with the VPLEX team. Some of the ideas we have been discussing have been simply mind-blowing. I cannot wait to get to build on those post EMC World. But let us get to the work at hand first. (Incidentally, Edmonton had snow today (May 4th)!!!!!!!! Yeah – thats their definition of spring I am told. Winter is when it is too cold to snow)

This will be our first year EMC World as a part of the new DPAD organization. Tons of excitement on that front. As we worked through the planning for EMC World, one of the big positives was the level of intersection that VPLEX and Recoverpoint have with other parts of EMC and the level of collaboration between all these teams to enable spreading that message. To all these teams, a BIG THANK YOU! This will result in VPLEX and RecoverPoint products having a really good presence on the show floor.

As always the sessions will be chock full of technical and strategy insights. A key part of the messages will be around the launches that we did for RecoverPoint and VPLEX in early April. However, beyond the new capabilities, there will also be sessions for the practitioners as well as customer insights. In fact, one of the key themes for us this year is customers who are presenting at the various sessions with us.

So without further ado, here is the list of VPLEX and RecoverPoint sessions at EMC World this year.

  • VPLEX: Introducing the VPLEX Virtual Edition: Presented by Cody Garvin who led the product management effort for VPLEX Virtual Edition. Cody will cover the basics of VPLEX/VE, why we built it the way we have, the use-cases we are targeting it for and deployment topologies that it will work with.
  • VPLEX: VPLEX Virtual Edition Architecture and use cases for 2014: Presented by super-CSE Don Kirouac (@dk_defined). Don will cover the gory depths of the VPLEX Virtual Edition to help you understand the architecture as well as what use-cases will work and how they will work out of the gate.
  • VPLEX: New VPLEX Provisioning Options with ViPR and Unisphere: Steve Breault and Peter Lund both product managers on the VPLEX team will present the fabulous integration work that has been done between the ViPR and the VPLEX team. They cover in-depth the pieces discussed in my prior blog post (here). In addition, there are new capabilities being added to ViPR to support VPLEX and RecoverPoint. Additionally, they will also discuss the work we have done with VPLEX Integrated Array Services (VIAS) discussed here.
  • VPLEX: Advanced Configuration and Design – Performance, Design, Failure Modes and More: The very cool Jen Aspesi (@routr_grl) presents this session. This was a big draw last year. Jen is amping it up with new learnings from the year, recommendations for new configurations. I have a feeling that this session will be oversubscribed once again.
  • VPLEX: Continuous Availability For All Business-Critical Applications: Robert Wagner presents this session. Here his focus is on the application layer and how they can be configured / structured to support continuous availability. If you think applications and VPLEX, this session is for you.
  • VPLEX: The Future of Availability powered by New VPLEX Use-cases: I have the privilege of presenting this session. This is an overview session which will talk about all the things that we have announced in 2014. It is a breadth first session which covers all the areas first and then each of the sessions above go into greater depth.
  • Introducing MetroPoint: Combining The Best Of VPLEX Metro And RecoverPoint Capabilities: This is an _awesome_ session presented by Saar Cohen, Chief Architect of RecoverPoint and Idan Kentor, one of our RecoverPoint focused CSE team members. I have seen Saar and Idan present this session in practice sessions. The content is top-notch and they cover the nitty-gritty of MetroPoint exceptionally well. You get the technical depth as well as the practical knowledge to realize your three site HA/DR dreams!
  • RecoverPoint Overview: Top Reasons Why Users Love It: Yossi Saad leads the Business Development team for VPLEX and RecoverPoint. He will cover the very well-known aspects as well as the not so well-known aspects of why RecoverPoint is the compelling solution that it is.
  • RecoverPoint: Accelerated Recovery for Virtual Environments: Yair Cohen presents this session that goes into depths of how RecoverPoint can protect and recover from virtualized environments. This shows all the work that we are doing to integrate with VMware and how those environments can be tuned / modified to make the experience seamless.
  • RecoverPoint: Planning and Deployment Best practices: Zahid Fadli is another of the RecoverPoint focused CSEs on our team. He will go into depths of how RecoverPoint can be configured, sized, architected for different environments. He will also cover best practices for the product in different environments. If you are a disaster recover practitioner, this is a session for you
  • RecoverPoint: Data protection for cost sensitive environments: Boaz Michaely presents this session to help customers and potential customers understand how they can derive even more value from RecoverPoint – this will talk about all the choices we have to drive down cost – data compression, WAN optimization, virtual RecoverPoint appliance.
  • VNX with VPLEX: Making Continous Operations seamless: I am presenting this VNX focused session which will cover all the goodness of VNX with VPLEX. In addition, it also covers the new items that we have introduced for integrations between VPLEX and VNX.

In addition, there are other partner sessions that also touch upon VPLEX and RecoverPoint. Here is the list:

  • EMC ViPR: Explore the New ViPR Control Services.
  • From Backup to Availability: Explore the Data Protection Continuum.
  • PowerPath Advanced Multipathing: What’s New in 2014.
  • VCE Vblock Data Protection and Mobility: Converge to Save Your Job.
  • Buckle Up! A 15-demo & Technical Tour of What’s New & What’s NEXT in Data Protection and Availability.
  • How Can ControlCenter & ProSphere Customers Make the Move to Storage Resource Management Suite.
  • Increasing Intelligence & Efficiency With Data Protection Advisor: Demonstrating Real Proof of Data Protection
  • AppSync 2.0: What’s New in 2014
  • VMAX Performance: Performance Aspects of Remote Replication

To all our internal partners: a BIG THANK YOU for making these possible and helping make RecoverPoint and VPLEX strategic to your ecosystem.

Other activities related to VPLEX and RecoverPoint:

  • Guy Churchward and Stephen Manley’s will cover the DPAD Portfolio in their super session on Tue at 3:00PM
  • Area 53: This is a super secret session about future capabilities being developed in the bunkers at EMC. Rumors are that there will be a healthy dose of some of the products you know and love. This is on Tue at 4:30 PM
  • Birds-of-a-feather – Data Protection and Availability Executive Panel – Redefining Data Protection for a Software-Defined World: Ask the DPAD Executive team
  • Hands On Labs – We have eight hands-on-labs that are focused on VPLEX and RecoverPoint. Do take your time to go through these. This should give you a very tactile operational feel for how the products work. Do not hesitate to give us feedback on what things you would like to see improved. Here is the list of HoLs:

  • HOL 07 – VPLEX: Introduction to VPLEX and VPLEX Integrated Array Services (VIAS)
  • HOL 08 – MetroPoint: Enhanced 3-site protection with VPLEX Metro and RecoverPoint
  • HOL 09 – VPLEX Virtual Edition: Continuous Availability Delivered To Your Geographically Dispersed ESXi Environments
  • HOL 10 – RecoverPoint: Discover Operational Recovery and Disaster Recovery In A Multi-Site Environment
  • HOL 17 – SRM Suite for VPLEX
  • HOL 20 – The Data Protection Continuum: Getting More with EMC Data Protection & Availability
  • HOL 22 – ViPR 2.0 – Introduction to all new 2.0 features, including VPLEX Snaps, Local Mirroring
  • HOL 24 – Infrastructure-As-A-Service Made Easy With VSPEX
  • And there are show floor displays, booth presentations and theatre presentations in the DPAD theatre as well as in partner theatres. Last but not the least, if you are one of the 100+ customers with NDA conversations set up, we are looking forward to meeting you as well.

    We are sending engineers, corporate systems engineers, architects and product managers to EMC World to help you get the best in terms of technical knowledge to help you understand the product, its directions and what capabilities we have added to it over the last year. I cannot wait for the show to kick off!

    Talkin’ about VPLEX and RecoverPoint Part 4

    The past three editions of these have been very popular. Our marketing and CSE team has created some new videos in support of the Q2 launches for VPLEX and RecoverPoint. So here are twelve videos for you to dig into.

    1. Why VPLEX for VMware Environments: Don Kirouac does an excellent job explaining how VPLEX integrates with VMware environments.
    2. Why VPLEX for Oracle RAC: Don Kirouac from the Corporate Systems Engineering team talks about the integration between Oracle RAC and VPLEX Metro to deliver continuous availability
    3. VPLEX with XtremIO: Charlie Kraus from the Product Marketing team explains how VPLEX delivers value to XtremIO environments
    4. ViPR with VPLEX and RecoverPoint: Devon Helms from the Product Marketing team explains how provisioning for VPLEX and RecoverPoint can be made simple with the ViPR Controller.
    5. Why VPLEX for SAP: Jim Whalen from the Solutions Marketing Team explains how VPLEX can help deliver SAP Application Availability.
    6. Why VPLEX for Microsoft Hyper-V Environments: Charlie Kraus talks about how VPLEX integrates with Microsoft Hyper-V environments to deliver mobility and availability
    7. VPLEX with Vblock: Charlie Kraus delves into how VPLEX integrates with and provides value to a Vblock environment.
    8. VSPEX Solutions for VPLEX and RecoverPoint: Karl Connolly from the VSPEX Marketing Team

    9. MetroPoint topology: Paul Danahy and I walk through the benefits and value propositions of the MetroPoint topology
    10. VPLEX Virtual Edition: Paul Danahy and I introduce the VPLEX Virtual Edition solution and why we think this is such a game changer
    11. Simplified Provisioning with VPLEX: Paul Danahy and I talk through how VPLEX Integrated Array Services simplifies provisioning with VPLEX
    12. EMC AppSync for RecoverPoint: Parag Pathak from the AppSync Marketing team and Devon Helms talk about the integration between AppSync and RecoverPoint to deliver application consistent protection

    2014 Launch Post 2: MetroPoint: Extending the Availability and Protection Continuum

    On April 4th, 2014, as part of the Data Protection and Availability Division (DPAD) launch, there were three VPLEX and RecoverPoint items that were launched or GAd:

    • VPLEX Virtual Edition – Availability late Q2
    • MetroPoint Topology – Joint capability of VPLEX and RecoverPoint – Availability Late Q2
    • VPLEX Integrated Array Services – Available now

    This is the second in a series of posts to walk through what was launched / delivered.

    VPLEX and RecoverPoint

    It has been two years since we introduced the RecoverPoint splitter within VPLEX. The awesomeness of VPLEX was joined with the coolness of RecoverPoint. With this combination, we delivered operational and disaster recovery to VPLEX customers to add to the continuous availability that they already had access to. These were extremely complementary use-cases. While there were a lot of skeptics outside of EMC about this combination, we were quietly confident in our belief that customer wanted an extended continuum between disaster recovery and continuous availability. Suffice it to say, that this combination has exceeded our revenue expectations. Since the launch in May 2012, the organizations have come even closer together within a single business unit further solidifying the bonds between the two teams.

    A quick recap of the current integration points between VPLEX and RecoverPoint.

    RecoverPoint delivers continuous data protection enabling local and/or remote protection. This is enabled by a RecoverPoint splitter which resides within the VPLEX platform. RecoverPoint has a similar splitter in the VMAX and VNX platform as well. The RP splitter enables WRITES to be sent to a RecoverPoint Appliance (RPA). From there, you can enable local protection (where the writes are journaled locally) or remote protection (where the writes are journaled remotely) or both. The beauty of RecoverPoint is that it can store every single write to give recovery a DVR like capability. The other benefit of RecoverPoint is that the protection is heterogeneous i.e. it can protect between every combination of VPLEX / VMAX and VNX.

    The combination of VPLEX and RecoverPoint supports the following topologies:

    1. VPLEX Local with RecoverPoint Local Protection
    2. VPLEX Local with RecoverPoint Remote Protection
    3. VPLEX Metro with RecoverPoint Local Protection
    4. VPLEX Metro with RecoverPoint Remote Protection
    5. The slide below shows the currently supported topologies.

      Currently supported VPLEX and RecoverPoint topologies
      Currently supported VPLEX and RecoverPoint topologies

      Customer topologies are all over the map – we see a lot of traction with the VPLEX Local and RecoverPoint Remote Protection (as we expected). However, the second largest topology is the three sided cascaded topology. And that was a surprise. Upon digging further, a lot of customers have business requirements that need them to have out-of-region disaster recovery site. Yet other customers are deploying VPLEX Metro within one site. So, the usage of RecoverPoint in this case is to provide DR to a Metro deployed within the site. This is the cascaded topology.

      As you can imagine, the downside of the cascaded topology is that if the replicating VPLEX Cluster fails or loses connectivity, DR protection is lost. Since the launch of RecoverPoint on VPLEX quite a few customers have been asking us to add the capability to protect both sides of a VPLEX Metro to a common third site using RecoverPoint. Well, that is exactly what we have done.

      MetroPoint: Operational and Disaster Protection across both sides of a VPLEX Metro

      MetroPoint Topology
      MetroPoint Topology

      The MetroPoint solution launched April 4th will GA at the end of Q2. This is a joint capability between RecoverPoint and VPLEX. Starting with RecoverPoint 4.1 and GeoSynchrony 5.4, customers will now be able to add Disaster Recovery and Operational Recovery protection to both sides of a distributed volume. With MetroPoint, we took the time to do this right – although the protection is on both sides of a distributed volume, only one of the sides is replicating data. The data goes to a single copy of a DR leg. In other words, no additional bandwidth or storage is needed to enable MetroPoint as compared to enabling a standard DR scenario.

      To enable this, we have created a new kind of consistency group called MetroPoint consistency groups. This enables replication on both sides of a distributed volume. Another characteristic of the MetroPoint consistency group is that you can load balance which site is the primary replication site. If there is a failure on the primary replication site, the replication will AUTOMATICALLY switch to the surviving site. In other words, there is no loss of DR protection even if you lose the primary replication site.

      To me, one of the more exciting implications is the extension of the VMware HA and VMware SRM use cases to the MetroPoint topology. Here is what this looks like:

      image
      MetroPoint with VMware HA and SRM

      The VPLEX Metro sites are protected with VMware HA and the remote DR site is protected with VMware SRM. This now gives our customers simultaneous HA and DR.

      One comment here: We talk about MetroPoint as a three site deployment and that is true. However, it is worth remembering that there are a number of customers who deploy VPLEX Metro within a data center either to protect multiple floors or multiple SANs or across a campus type environment. In those scenarios, customers can use MetroPoint to protect to a second site. There is a lot of interest in this deployment model.

      More coolness – along the way, we were able to meet one more request that our customers had requested. With the MetroPoint consistency group, we were able to provide operational recovery on both sides of a VPLEX Metro. And this does not need a third site!!

      Operational Recovery on both sides of a VPLEX Metro
      Operational Recovery on both sides of a VPLEX Metro

      To top this all off, MetroPoint is completely heterogeneous. All these goodies work with both EMC as well as non-EMC arrays. So long as the storage array is supported by VPLEX, you are good to go.

      Here is a short video that Paul Danahy and I put together to give you brief overview of MetroPoint:

      With MetroPoint, we have raised the bar on continuous availability and disaster recovery. This has been the result of collaboration between the VPLEX and RecoverPoint engineering team with a lot of input from some of our lead customers. To all those who helped us get here, a very BIG thank you!