Grid-oriented storage

Grid-oriented storage

Grid-oriented Storage (GOS) is a dedicated data storage architecture which can be connected directly to a computational grid to support advanced data bank services and reservoirs for data that can be shared among multiple computers and end users on the grid.

Description

GOS is a successor of well-used network-attached storage (NAS) products in the Grid Computing era. A GOS-specific File System (GOS-FS), a P2P clustering technique, the single-purpose intent of a GOS operating system (OS), and a browser-based management console motivate and enable this new architecture. GOS is the first demonstration that Office/database/Web/media applications can be accelerated by tenfold in real-world tests. GOS systems usually contain one or more hard disks, often arranged into logical, redundant storage containers or RAIDs (redundant arrays of independent disks), as do traditional file servers.

GOS is designed to deal with long-distance, cross-domain and single-image file operations, which is typical in Grid environments. GOS behaves like a file server via the file-based GOS-FS protocol to any entity on the grid. Inspired by the success of GridFTP, GOS-FS integrates a parallel stream engine and Grid Security Infrastructure (GSI). Conforming to the universal VFS (Virtual Filesystem Switch), GOS-FS can be pervasively used as an underlying platform to best utilize the increased transfer bandwidth and accelerate the NFS/CIFS-based applications. GOS can also run over SCSI, Fibre Channel or iSCSI, which does not affect the acceleration performance, offering both file level protocols and block level protocols for Storage Area Network (SAN) from the same system.

Geographically distributed nature of resources that make up the grid infrastructure, along with their heterogeneity and different control policies in different domains, make the availability of these resources dynamic and conditional upon local constraints. Centralized management techniques for these resources are limited in their scalability both in terms of execution efficiency and fault tolerance. Provision of services across such platforms requires a distributed resource management mechanism and the P2P clustered GOS appliances allow a single storage image to continue to expand, even if a single GOS appliance reaches its capacity limitations. The cluster shares a common, aggregate presentation of the data stored on all participating GOS appliances. Each GOS appliance manages its own internal storage space. The major benefit of this aggregation is that clustered GOS storage can be accessed by users as a single mount point.

GOS products fit the thin-server categorization. Compared with traditional “fat server”-based storage architectures, thin-server GOS appliances deliver numerous advantages, such as the alleviation of potential network/grid bottle-necks, CPU and OS optimized for I/O only, ease of installation, remote management and minimal maintenance, low cost and Plug and Play, etc. Examples of similar innovations include NAS, printers, fax machines, routers and switches.

An Apache server has been installed in the GOS operating system, ensuring an HTTPS-based communication between the GOS server and an administrator via a Web browser. Remote management and monitoring makes it easy to set up, manage, and monitor GOS systems.

History

Grid-oriented Storage (GOS) was proposed by Prof. Frank Wang and Dr. Na Helian in 2003 in a funding proposal, titled “Grid-Oriented Storage (GOS): Next Generation Data Storage System Architecture for the Grid Computing Era”, to the UK government. The proposal was approved and granted one million pounds in 2004. The first prototype was constructed in 2005 at [http://www.cranfield.ac.uk/soe/amac/computing/ Centre for Grid Computing, Cambridge-Cranfield High Performance Computing Facility] . The first conference presentation was at IEEE Symposium on Cluster Computing and Grid (CCGrid), 9 - 12 May 2005, Cardiff, UK. As one of the five best work-in-progress, it was included in the IEEE Distributed Systems Online. In 2006, the GOS architecture and its implementations was published in IEEE Transactions on Computers, titled “Grid-oriented Storage: A Single-Image, Cross-Domain, High-Bandwidth Architecture”. From January 2007, the GOS invention led to invitations to present demonstrations at Princeton University, Cambridge University (Computer Lab), IBM, Rolls Royce, BBC, Xerox, Carnegie Mellon University, CERN, and EADS, etc. A number of collaborations have been fostered with industrial giants.

External links

[http://www.cranfield.ac.uk/soe/amac/computing/ Grid Computing is already moving from a research vision into production]

Academic papers

• Frank Wang, Na Helian, Sining Wu, Yuhui Deng, Yike Guo, Steve Thompson, Ian Johnson, Dave Milward & Robert Maddock, Grid-Oriented Storage, IEEE Distributed Systems Online, Volume 6, Issue 9, Sept. 2005.

• Frank Wang, Sining Wu, Na Helian, Andy Parker, Yike Guo, Yuhui Deng, Vineet Khare, Grid-oriented Storage: A Single-Image, Cross-Domain, High-Bandwidth Architecture, IEEE Transaction on Computers, Vol.56, No.4, pp. 474-487, 2007.

• Frank Zhigang Wang, Sining Wu, Na Helian, An Underlying Data-Transporting Protocol for Accelerating Web Communications, International Journal of Computer Networks, Elsevier, 2007.

• Frank Zhigang Wang, Sining Wu, Na Helian, Yuhui Deng, Vineet Khare, Chris Thompson and Michael Parker, Grid-based Data Access to Nucleotide Sequence Database with 6x Improvement in Response Times, New Generation Computing, No.2, Vol.25, 2007.

• Frank Wang, Yuhui Deng, Na Helian, Evolutionary Storage: Speeding up a Magnetic Disk by Clustering Frequent Data, IEEE Transactions on Magnetics, Issue.6, Vol.43, 2007.

• Frank Zhigang Wang, Na Helian, Sining Wu, Yuhui Deng, Vineet Khare, Chris Thompson and Michael Parker, Grid-based Storage Architecture for Accelerating Bioinformatics Computing, Journal of VLSI Signal Processing Systems, No.1, Vol.48, 2007.

• Yuhui Deng and Frank Wang, A Heterogeneous Storage Grid Enabled by Grid Service, ACM Operating System Review, No.1, Vol.41, 2007.

• Yuhui Deng & Frank Wang, Optimal Clustering Size of Small File Access in Network Attached Storage Device, Parallel Processing Letters, No.1, Vol.17, 2007.


Wikimedia Foundation. 2010.

Игры ⚽ Поможем решить контрольную работу

Look at other dictionaries:

  • Grid computing — is a term referring to the combination of computer resources from multiple administrative domains to reach a common goal. The grid can be thought of as a distributed system with non interactive workloads that involve a large number of files. What …   Wikipedia

  • Smart grid — Public infrastructure …   Wikipedia

  • Smart power grid — Smart Grid is a transformed electricity transmission and distribution network or grid that uses robust two way communications, advanced sensors, and distributed computers to improve the efficiency, reliability and safety of power delivery and use …   Wikipedia

  • D-Grid — The D Grid Initiative (German Grid Initiative) builds a sustainable grid infrastructure for education and research (e Science) in Germany. The D Grid infrastructure will help to establish methods of e Science in three core areas: Grid Computing… …   Wikipedia

  • Nordic Data Grid Facility — The Nordic Data Grid Facility, or NDGF, is a collaboration between the Nordic countries (Denmark, Finland, Norway, Sweden and Iceland). The motivation for NDGF is to ensure that researchers in the Nordic countries can create and participate in… …   Wikipedia

  • CoABS Grid — The Control of Agent Based System (CoABS) Grid is an advanced service oriented architecture (SOA) that supports distributed object management; mobile computing and surrogate processing for constrained devices; multi modal, adaptive user… …   Wikipedia

  • Super grid — Supergrid redirects here. For other uses, see Super grid (disambiguation). One conceptual plan of a super grid linking renewable sources across North Africa, the Middle East and Europe. (DESERTEC) A super grid is a wide area transmission network… …   Wikipedia

  • Service Oriented Infrastructure — or SOI ( [pronounced sō i or es ō i ] ) is a system for describing IT infrastructure as a service. The underlying principles go back to, among others, Mainframe and LDAP technologies, but SOI provides a framework or mindset making business… …   Wikipedia

  • Open Science Grid Consortium — The Open Science Grid Consortium is an organization that administers a worldwide grid of technological resources called the Open Science Grid, which facilitates distributed computing for scientific research. Founded in 2004, the consortium is… …   Wikipedia

  • National Grid Service — The National Grid Service (NGS), now[when?] entering its seventh year, aims to help UK academics and researchers carry out their research by providing easy to use access to computational, data and other resources. It is funded by two governmental …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”