Hyperconverged File Systems PT1 Taxonomy

One way of categorizing Hyperconverged filesystems (or any filesystem really) is by how data is distributed across the nodes, and the method used to track/retrieve that data. The following is based on knowledge of the internals of Nutanix and publicly available information for the other systems.

Metadata

Characteristics

Implemented by

Distributed Distributed data & metadata Nutanix Hash Random data distribution, hash-lookup (object store) VSAN Dedupe Data stored in HA-Pairs, Lookup by fingerprint Simplivity Dedupe Random data distribution, Lookup by fingerprint Springpath/Hyperflex Psuedo Distributed Data stored in HA pairs, Unified namespace via redirection NetApp C-Mode
    Nutanix uses a fully distributed metadata layer that allows the cluster to decide where to place data depending on the location of the VM accessing it. The data can move around to follow the VM. The Nutanix FS uses a lot of ideas from distributed systems research and implementation, rather than taking a classic filesystems approach and applying it to HCI.

Author: gary

Performance hacker @ nutanix.com