High-Performance Computing in the Asia-Pacific Region, International Conference on
Download PDF

Abstract

One of checkpointing and recovery technique's important capabilities is file checkpointing, i.e., to save and restore the state of user files of the process. This paper describes the design and implementation of a file check-pointing approach called Modification Operation Buffering. This approach buffers all the modification operations after a checkpoint until the next one, making all the operations between two checkpoints atomic as a whole. By choosing a suitable size dynamically for memory buffer, and by hiding the latency of flushing the buffer, this approach achieved an overhead lower than other approaches.
Like what you’re reading?
Already a member?
Get this article FREE with a new membership!