Heterogeneous Replica for Query on Cassandra

10/02/2018
by   Jialin Qiao, et al.
0

Cassandra is a popular structured storage system with high-performance, scalability and high availability, and is usually used to store data that has some sortable attributes. When deploying and configuring Cassandra, it is important to design a suitable schema of column families for accelerating the target queries. However, one schema is only suitable for a part of queries, and leaves other queries with high latency. In this paper, we propose a new replica mechanism, called heterogeneous replica, to reduce the query latency greatly while ensuring high write throughput and data recovery. With this replica mechanism, different replica has the same dataset while having different serialization on disk. By implementing the heterogeneous replica mechanism on Cassandra, we show that the read performance of Cassandra can be improved by two orders of magnitude with TPC-H data set.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/01/2021

MORTAL: A Tool of Automatically Designing Relational Storage Schemas for Multi-model Data through Reinforcement Learning

Considering relational databases having powerful capabilities in handlin...
research
05/17/2019

High Throughput Push Based Storage Manager

The storage manager, as a key component of the database system, is respo...
research
08/07/2023

A Polystore Architecture Using Knowledge Graphs to Support Queries on Heterogeneous Data Stores

Modern applications commonly need to manage dataset types composed of he...
research
05/20/2021

Interactive Query Formulation using Spider Queries

Effective information disclosure in the context of databases with a larg...
research
08/30/2021

MultiCategory: Multi-model Query Processing Meets Category Theory and Functional Programming

The variety of data is one of the important issues in the era of Big Dat...
research
02/02/2021

Interactive Query Formulation using Point to Point Queries

Effective information disclosure in the context of databases with a larg...
research
10/17/2019

An LSM-based Tuple Compaction Framework for Apache AsterixDB

Document database systems store self-describing records, such as JSON, "...

Please sign up or login with your details

Forgot password? Click here to reset