Loading…
This event has ended. Create your own event → Check it out
This event has ended. Create your own
View analytic
Wednesday, March 2 • 2:30pm - 3:00pm
Disruptive Technology 3: BeeGFS - A Parallel File System to Solve I/O Problems

Sign up or log in to save this to your schedule and see who's attending!

WATCH THE PRESENTATION

With the increasing size of parallel computers and the increasing speed of individual nodes ( CPU+GPU) the challenges for parallel file systems with respect to I/O pattern, bandwidth, latency, robustness and scalability are becoming more obvious. When the first dual core CPU´s hit the market we started to develop a parallel file system from scratch with full scalability for Data & Metadata, ease of use, robustness and high flexibility in mind. As the CPU roadmap was clearly pointing towards many core CPU´s one central development requirement was to follow a strict multithreaded approach to keep the software overhead low and allow the software to run on dedicated servers, on the compute nodes and adapt to new architectures on the rise like ARM and its variants.

Our own test cases for the development of BeeGFS ( former FhGFS) during the past 10 years have been a broad range of O&G codes mostly developed next door.
The paper presents an architectural overview over BeeGFS with special focus on scalability, metadata performance and reliability in large installations. As the BeeGFS server components are efficient multithreaded user space programs, which work on every underlying POSIX file system BeeGFS supports a variety of hardware and software solutions. As a special use case the paper will explain BeeOND: the BeeGFS on demand file system. SSD´s (NVRAM) in every compute node are delivering high speed, low latency I/O. With BeeOND we create a private parallel file system (../myscratch/ ) for very compute job on the corresponding nodes that fully utilizes the NVRAM capabilities and acts as a burst buffer for most of the temporal I/O behavior present in today´s applications. The paper will report about BeeOND O&G use cases and present benchmarks.

As the amount of storage grows data resilience and self-healing capabilities are essential requirements in a storage system. BeeGFS has ist own approach to this topic based on software robustness and its build-in data mirroring capabilities. . The paper shortly cover these HA aspects of BeeGFS and outlines the future BeeGFS roadmap which includes erasure coding as well as a non -POSIXI. The last section of the talk is related to the BeeGFS approach to Exascale.

Speakers
CM

Christian Mohrbacher

Fraunhofer ITWM
Christian Mohrbacher studied computer sciences and afterwards joined Fraunhofer's Competence Center for High Performance Computing in 2008. He is currently part of the parallel file system group, which drives the development of BeeGFS.



Wednesday March 2, 2016 2:30pm - 3:00pm
BioScience Research Collaborative Building (BRC), Room 103

Attendees (8)