Nutanix Sizer automates and simplifies the process of sizing Nutanix infrastructure for specific workloads, but it is still important to understand certain best practices when architecting a Nutanix solution.
RF2 clusters require a minimum of 3 nodes. While there are only two copies of data, a fully healthy cluster requires 3 copies of metadata.
Note
Single-Node and Two-Node clusters are an exception to this minimum.
Two-Node clusters provide the same level of resiliency as a standard 3 node RF2 cluster but leverage an external Witness VM to arbitrate in the event of a node failure. Refer to the Prism Web Console Guide for guidelines and requirements for Two-Node clusters.
Single-Node clusters provide the ability to run a limited number of user VMs with local resiliency. If an SSD fails, the cluster goes into a read-only state. In the read-only state, there are no create, read, update, or delete operations and no cluster changes. If the cluster experiences an unresponsive SSD, data loss is a risk. The Prism Central console offers an override option that allows users to continue writing to single-node clusters after it has entered read-only mode. Otherwise, be sure there is more than 1 SSD available for the system to place metadata. Refer to the Prism Web Console Guide for guidelines and requirements for Single-Node clusters.
RF2 clusters should be sized to account for N+1 availability for BOTH compute AND storage. In the event of a node failure you should have enough resources to run all VMs and re-protect storage.
Refer to the following Nutanix TechTopX video for a detailed explanation of how Nutanix protects data:
Refer to the following Nutanix TechTopX video for a detailed explanation of how Nutanix implements scalable metadata:
Nutanix supports DARE through Self-encrypting SSDs and HDDs, providing a FIPS 140-2 validated hardware-based solution. Use of self-encrypting drives (SEDs) also require the use of a third-party Key Management Service (KMS), such as Vormetric, SafeNet, and IBM SKLM.
Nutanix also supports DARE as a FIP 140-1 software-based solution, using Intel Advanced Encryption Standard instructions. Software-based encryption is available across AHV, ESXi, and Hyper-V on x86 platforms.
Refer to the Information Security Tech Note for complete details on DARE.
Block fault tolerance is the Nutanix cluster’s ability to make redundant copies of any data and place the data on nodes that are not in the same block. Metadata also must be block fault tolerant. A block is a rack-mountable enclosure that contains one to four Nutanix nodes. The power supplies, front control panels (ears), backplane, and fans are shared by all nodes in a block. With block fault tolerance enabled, guest VMs can continue to run with a block failure because redundant copies of guest VM data and metadata exist on other blocks.
Refer to the NX and SX Series Hardware Administration Guide for complete details on Block Fault Tolerance requirements.
Unlike traditional data efficiency technologies like compression and deduplication, Erasure Coding (EC-X) does not depend on the content of that data, but rather whether or not the data is frequently updated/overwritten. This means that workloads that store lots of pre-compressed data (e.g. encoded audio, images, and video) or unique data that can’t be deduplicated (e.g. logging data captured by a workload such as Splunk or ELK Stack) can still benefit from the space savings of Erasure Coding.
For details on minimum node requirements and sizing impact of Erasure Coding, refer to Sizing assumptions for solutions with Erasure Coding (EC-X) and Erasure Coding Overheads.
Prism Central is used to monitor and manage entities across multiple Nutanix clusters. Prism Central is deployed as 1 or 3 VMs and is sized relative to the number of VMs managed across all clusters.
When sizing a cluster that will run Prism Central it is important to account for that overhead within Nutanix Sizer.
Refer to the latest Prism Central Release Notes > Prism Central Scalability for exact VM specification requirements.
The Controller Virtual Machine (CVM) is responsible for all cluster data services on each node. While Sizer accounts for CVM CPU and memory overhead appropriately for specific workloads, these values can be manually specified for Raw Input workloads.
Controller VM memory allocation requirements differ depending on the models and the features that are being used. Refer to the latest Acropolis Advanced Administration Guide > Controller VM Memory Configurations for exact VM specifications.
While product support doesn’t directly impact sizing, ensuring the appropriate level of support to meet customer requirements is still a critical component of the overall solution. Nutanix offers multiple tiers of both software and hardware support, as well as programs for non-returnable disks, and on-site spare parts.
Click here for complete details on available Nutanix Support Programs.