Clustered file system

Clustered file system

A clustered file system is a file system which is shared by being simultaneously mounted on multiple servers. There are several approaches to clustering, most of which do not employ a clustered file system. While many computer clusters don't use clustered file systems, unless servers are underpinned by a clustered file system the complexity of the underlying storage environment increases as servers are added.

Contents

Shared-disk

A shared disk file system uses a storage area network (SAN) or RAID to provide direct disk access from multiple computers at the block level. Translation from file-level operations that applications use to block-level operations used by the SAN must take place on the client node (and what is the client node ?). The most common type of clustered file system, a shared disk file system adds a mechanism for concurrency control which gives a consistent and serializable view of the file system, avoiding corruption and unintended data loss even when multiple clients try to access the same files at the same time. Shared disk file systems also usually employ some sort of a fencing mechanism to prevent data corruption in case of node failures.[citation needed]

The underlying storage area network might use any of a number of block-level protocols, including SCSI, iSCSI, HyperSCSI, ATA over Ethernet (AoE), Fibre Channel, and InfiniBand.

There are different architectural approaches to a shared disk file system. Some distribute file information across all the servers in a cluster (fully distributed). Others utilize a centralized metadata server. Both achieve the same result of enabling all servers to access all the data on a shared storage device.[citation needed]

Usage

Scientists working at ALICE will be using a 4 Gbit/s Fibre Channel SAN with a clustered file system to store the massive amount of data generated by the experiment (estimated at 1 GB/second for a month). Reasons quoted for this choice include "performance, scalability and vendor independence"[1]

Examples

Distributed file systems

Distributed file systems do not share block level access to the same storage but use a network protocol.

Examples

Network attached storage

Network Attached Storage provides both storage and a file system, like a SAN + shared disk file system. NAS typically uses file-based protocols (as opposed to block-based protocols) such as NFS (popular on UNIX systems), SMB/CIFS (Server Message Block/Common Internet File System) (used with MS Windows systems), or AFP (used with Apple Macintosh computers).

Shared-nothing vs. single point of failure

The failure of disk hardware can create a single point of failure that can result in data loss. To avoid this problem, a shared nothing architecture can be employed. Each storage node communicates changes to other nodes or to a master, for replication purposes. If a single disk fails, other copies can be used to reconstruct or replace it on the fly so "nothing" is lost. To enable this feature, clients must be unaware of the physical location of a file. A single global file system is presented to clients, so the file system itself deals with allocations and low-level failures. Examples of this type of file system are found in products such as Ceph, Lustre, Isilon, IBRIX Fusion, and Hadoop.[2]

History

IBM mainframes in the 1970s could share physical disks and file systems if each machine had its own channel connection to the drives' control units. In the 1980s, Digital Equipment Corporation's TOPS-20 and VAX/VMS clusters included shared disk filesystems.[3]

See also

References

External links


Wikimedia Foundation. 2010.

Игры ⚽ Поможем сделать НИР

Look at other dictionaries:

  • Shared disk file system — A shared disk file system, also known as clustered file system or SAN file system, is an enterprise storage file system which can be shared (concurrently accessed for reading and writing) by multiple computers. Such devices are usually clustered… …   Wikipedia

  • IBM General Parallel File System — Infobox Software name = IBM GPFS caption = developer = IBM latest release version = 3.2.1 6 latest release date = September 2008 operating system = AIX / Linux / Microsoft Windows Server 2003 R2 genre = filesystem license = Proprietary website =… …   Wikipedia

  • Lustre (file system) — Infobox software name = Lustre developer = Sun Microsystems latest release version = 1.6.5.1 latest release date = release date|2008|07|10 operating system = Linux genre = Shared disk file system license = GPL website = http://www.lustre.org,… …   Wikipedia

  • Network File System (protocol) — Internet protocol suite Application layer BGP DHCP DNS FTP HTTP …   Wikipedia

  • File area network — File Area Networking refers to various methods of sharing files over a network such as storage devices connected to a file server or network attached storage (NAS). Background Data storage technology over the years has evolved from a direct… …   Wikipedia

  • Clustered NAS — A clustered NAS is a Network attached storage solution which is using a distributed file system running simultaneously on multiple servers. The key differentiating factor of clustered NAS from traditional NAS is its ability to stripe data and… …   Wikipedia

  • Scale-out File Services — Infobox Software name = IBM SoFS caption = developer = IBM latest release version = 1.5 latest release date = August 2008 operating system = RHEL 5.2 genre = Network attached Storage license = Proprietary website =… …   Wikipedia

  • Clustered hosting — technology is designed to eliminate the problems inherent with typical shared hosting infrastructures. This technology provides customers with a “clustered” handling of security, load balancing, and necessary website resources.A clustered hosting …   Wikipedia

  • List of file systems — The following lists identify, characterize and link to more thorough information on computer file systems.Many older operating systems support only their one native file system, which does not bear any name apart from the name of the operating… …   Wikipedia

  • Managed File Transfer — (MFT) In simplest terms, Managed File Transfer (MFT) refers to software and hardware technologies that enable secure and reliable exchange of documents between organizations. [Schroth, C: Loosening the Hierarchy of Cross Company Electronic… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”