Leader Confirmation Replication for Millisecond Consensus in Geo-distributed Private Chains
Geo-distributed private chain and database have created higher performance requirements for consistency models. However, with millisecond network latency between nodes, the widely used leader-based SMR models cause frequent retransmission of logs since they cannot know the logs replication status in time, which resulting in the leader costing high network and computing resource. To address the problem, we proposed a Leader Confirmation based Replication (LCR) model. First, we demonstrate the efficacy of the approach by designing the Future Log Replication model, a log in which follower is responsible for non-transactional log replication. It reduces the leader's network load using the signal log. Secondly, we designed a Generation Re-replication strategy, which can ensure the security and consistency of future logs when the number of nodes changes. Finally, we implemented LCR-Raft and designed experiments. The results show that in the single-ms network latency environments, LCR-Raft can provide higher TPS (1.5X 1.9X), and reduce the network traffic of the leader by 20%-30% with acceptable network traffic and CPU cost on followers. Besides, since LCR does not change the number of leader and leader election process, it has good portability.
READ FULL TEXT