Graphs cannot be indexed in polynomial time for sub-quadratic time string matching, unless SETH fails

02/03/2020
by   Massimo Equi, et al.
0

We consider the following string matching problem on a node-labeled graph G=(V,E): given a pattern string P, decide whether there exists a path in G whose concatenation of node labels equals P. This is a basic primitive in various problems in bioinformatics, graph databases, or networks. The hardness results of Backurs and Indyk (FOCS 2016) imply that this problem cannot be solved in better than O(|E||P|) time, under the Orthogonal Vectors Hypothesis (OVH), and this holds even under various restrictions on the graph (Equi et al., ICALP 2019). In this paper we consider its offline version, namely the one in which we are allowed to index the graph in order to support time-efficient string matching queries. Indeed, it was tantalizing in the string matching community to believe that sub-quadratic time queries can be achieved, e.g. at the cost of a high-degree polynomial-time indexing. We disprove this belief, showing that, under OVH, no polynomial-time index can support querying P in time O(|E|^δ|P|^β), with either δ < 1 or β < 1. We prove this tight bound employing a known self-reducibility technique, e.g. from the field of dynamic algorithms, which translates conditional lower bounds for an online problem to its offline version. As a side-contribution, we formalize this technique with the notion of linear independent-components reduction, allowing for a simple proof of our result. As another illustration of our technique, we also translate the quadratic conditional lower bound of Backurs and Indyk (STOC 2015) for the problem of matching a query string inside a text, under edit distance. We obtain an analogous tight quadratic lower bound for its offline version, improving the recent result of Cohen-Addad, Feuilloley and Starikovskaya (SODA 2019), but with a slightly different boundary condition.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/03/2020

Conditional Indexing Lower Bounds Through Self-Reducibility

We provide a general technique to turn a conditional lower bound result ...
research
02/10/2019

On the Complexity of Exact Pattern Matching in Graphs: Determinism and Zig-Zag Matching

Exact pattern matching in labeled graphs is the problem of searching pat...
research
01/16/2019

On the Complexity of Exact Pattern Matching in Graphs: Binary Strings and Bounded Degree

Exact pattern matching in labeled graphs is the problem of searching pat...
research
02/25/2021

Algorithms and Complexity on Indexing Founder Graphs

We study the problem of matching a string in a labeled graph. Previous r...
research
09/11/2021

The Labeled Direct Product Optimally Solves String Problems on Graphs

Suffix trees are an important data structure at the core of optimal solu...
research
12/15/2022

Parameterized Algorithms for String Matching to DAGs: Funnels and Beyond

The problem of String Matching to Labeled Graphs (SMLG) asks to find all...
research
03/09/2023

Elastic Founder Graphs Improved and Enhanced

Indexing labeled graphs for pattern matching is a central challenge of p...

Please sign up or login with your details

Forgot password? Click here to reset