* Lustre high-performance storage and file system now and in the future Last time, we introduced Lustre, a new clustered storage architecture that offers an open parallel file system with extreme scalability. Today, we’ll talk about how it is being used in the scientific world and how it could eventually be used in the commercial sector.Lustre is object-oriented, based on Linux, and provides storage to high performance computing (HPC) environments that require ultra fast I/O. The plan is for Lustre to scale to the point where it can incorporate tens of thousands of nodes to provide parallel I/O to a grid of application servers. As such, it plays a key role in the ongoing development of data grids.Lustre is still in the relatively early stages of development but it is already real and in use. Ultimately, Lustre will have automatic failover and server reboot, will have no single point of failure, will work with tens of thousands of clients and thousands of application servers, and will be application-transparent. But what is its present status?Release 1, in production for about a year now, is up and running at a number of HPC sites, including Pacific Northwest National Labs (53 terabytes of storage serving 1,280 dual-CPU Intel Xeon clients) and The National Center for Supercomputing Applications (150 terabytes of storage serving 1,280 dual-CPU Xeon clients and 104 server nodes, all on a Gigabit Ethernet backbone). Lustre now runs on i386, ia64, and x86-64 platforms and it is being tested for the PowerPC processor. An OS/X client version is under development. Lustre is aimed at serving large computer clusters, but with minor variations in the implementation will work with smaller commercial environments as well. Key points regarding the file system topology are:* Data and metadata are stored separately. Data resides on an “object storage target” (OST), which includes both “object storage servers” and storage devices. Data is addressed using metadata services that reside on “metadata servers.” * Clients – ultimately, in the tens of thousands – can reside on any of several types of LANs (Gigabit Ethernet and InfiniBand, for example) simultaneously. * Data is accessed by first getting a file’s metadata from an active-passive pair of metadata servers (MDS). These use a journal file system and front-end a dedicated metadata database. All file system namespace operations, such as file lookups, file creation, and file and directory attribute manipulation, take place here. This is a high availability solution: should the active MDS fail, the standby server takes over immediately. * Sitting between the data and the clients is a series of object storage servers (OSS), which manage the storage located on the storage devices.* The data itself resides on storage devices behind the OSSes. Storage devices are treated as object based storage, and may be of any sort: RAID, JBOD or individual disks. They may be connected to the OSSes directly or by a networked connection. All files fall within a global namespace, which means that the file system presents the many directories from multiple file servers as a single unified directory tree. The value of this is that this single directory tree is valid from every workstation and remains valid when configurations are updated.Why should commercial users be interested in a technology that is clearly designed for high performance technical environments? Because the benefits of high performance computing often find their way to commercial applications sooner that we might think. Even today, there are reports (unconfirmed, but from a very credible source) that one commercial site is putting together a large-scale Lustre implementation. And besides, despite the fact that Lustre is intended for use in a high performance computing file system, Lustre runs on commodity hardware, which would make things easier when the time comes for it to move over to the commercial world. Until then, look for Lustre to appear on a data grid near you. Related content news analysis Western Digital keeps HDDs relevant with major capacity boost Western Digital and rival Seagate are finding new ways to pack data onto disk platters, keeping them relevant in the age of solid-state drives (SSD). By Andy Patrizio Dec 06, 2023 4 mins Enterprise Storage Data Center news analysis Global network outage report and internet health check Cisco subsidiary ThousandEyes, which tracks internet and cloud traffic, provides Network World with weekly updates on the performance of ISPs, cloud service providers, and UCaaS providers. By Ann Bednarz and Tim Greene Dec 06, 2023 286 mins Networking news analysis Cisco uncorks AI-based security assistant to streamline enterprise protection With Cisco AI Assistant for Security, enterprises can use natural language to discover policies and get rule recommendations, identify misconfigured policies, and simplify complex workflows. By Michael Cooney Dec 06, 2023 3 mins Firewalls Generative AI Network Security news Nvidia’s new chips for China to be compliant with US curbs: Jensen Huang Nvidia’s AI-focused H20 GPUs bypass US restrictions on China’s silicon access, including limits on-chip performance and density. By Anirban Ghoshal Dec 06, 2023 3 mins CPUs and Processors Technology Industry Podcasts Videos Resources Events NEWSLETTERS Newsletter Promo Module Test Description for newsletter promo module. Please enter a valid email address Subscribe