Extending TCP for Accelerating Replication on Cluster File Systems over SDNs

12/27/2018
by   Sungheon Lim, et al.
0

This paper explores the changes required of TCP to efficiently support cluster file systems such as Hadoop Distributed File System (HDFS) where the storage nodes are connected through a software defined networking (SDN). Traditional chain replications in these file systems incur large delay and cause inefficient network use. But SDN can cooperate with the cluster file systems to address the problems by pre-arranging a distribution tree, which opens the possibility of parallel replication. Unfortunately, it cannot be realized without extending TCP, to accommodate the parallel transfer on the transport layer. This paper discusses how to extend TCP to make it possible, and demonstrates the feasibility by implementing a prototype in the Linux kernel. The prototype saves the data replication time by 25 substantially reducing network use.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/14/2017

Understanding System Characteristics of Online Erasure Coding on Scalable, Distributed and Large-Scale SSD Array Systems

Large-scale systems with arrays of solid state disks (SSDs) have become ...
research
06/12/2019

Exploring Fault-Tolerant Erasure Codes for Scalable All-Flash Array Clusters

Large-scale systems with all-flash arrays have become increasingly commo...
research
02/25/2019

MTFS: Merkle Tree based File System

The blockchain technology has been changing ourdaily lives since the cry...
research
01/15/2020

A Software-Defined Networking approach for congestion control in Opportunistic Networking

The short-term adoption of opportunistic networks (OppNet) depends on im...
research
10/26/2021

BuffetFS: Serve Yourself Permission Checks without Remote Procedure Calls

The remote procedure call (a.k.a. RPC) latency becomes increasingly sign...
research
01/01/2010

A distributed file system for a wide-area high performance computing infrastructure

We describe our work in implementing a wide-area distributed file system...
research
07/03/2018

Design and optimisation of an efficient HDF5 I/O kernel for massive parallel fluid flow simulations

More and more massive parallel codes running on several hundreds of thou...

Please sign up or login with your details

Forgot password? Click here to reset