Systematicity, Compositionality and Transitivity of Deep NLP Models: a Metamorphic Testing Perspective

04/26/2022
by   Edoardo Manino, et al.
0

Metamorphic testing has recently been used to check the safety of neural NLP models. Its main advantage is that it does not rely on a ground truth to generate test cases. However, existing studies are mostly concerned with robustness-like metamorphic relations, limiting the scope of linguistic properties they can test. We propose three new classes of metamorphic relations, which address the properties of systematicity, compositionality and transitivity. Unlike robustness, our relations are defined over multiple source inputs, thus increasing the number of test cases that we can produce by a polynomial factor. With them, we test the internal consistency of state-of-the-art NLP models, and show that they do not always behave according to their expected linguistic properties. Lastly, we introduce a novel graphical notation that efficiently summarises the inner structure of metamorphic relations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/01/2019

Adaptive Metamorphic Testing with Contextual Bandits

Metamorphic Testing is a software testing paradigm which aims at using n...
research
05/13/2022

AEON: A Method for Automatic Evaluation of NLP Test Cases

Due to the labor-intensive nature of manual test oracle construction, va...
research
04/15/2023

Can Metamorphic Relations be Automatically Discovered?

Metamorphic testing alleviates the test oracle problem by defining oracl...
research
10/14/2022

TestAug: A Framework for Augmenting Capability-based NLP Tests

The recently proposed capability-based NLP testing allows model develope...
research
07/13/2023

Crucible: Graphical Test Cases for Alloy Models

Alloy is a declarative modeling language that is well suited for verifyi...
research
02/11/2023

MTTM: Metamorphic Testing for Textual Content Moderation Software

The exponential growth of social media platforms such as Twitter and Fac...
research
03/08/2021

Testing Autonomous Systems with Believed Equivalence Refinement

Continuous engineering of autonomous driving functions commonly requires...

Please sign up or login with your details

Forgot password? Click here to reset