Great Models Think Alike: Improving Model Reliability via Inter-Model Latent Agreement

05/02/2023
by   Ailin Deng, et al.
0

Reliable application of machine learning is of primary importance to the practical deployment of deep learning methods. A fundamental challenge is that models are often unreliable due to overconfidence. In this paper, we estimate a model's reliability by measuring the agreement between its latent space, and the latent space of a foundation model. However, it is challenging to measure the agreement between two different latent spaces due to their incoherence, , arbitrary rotations and different dimensionality. To overcome this incoherence issue, we design a neighborhood agreement measure between latent spaces and find that this agreement is surprisingly well-correlated with the reliability of a model's predictions. Further, we show that fusing neighborhood agreement into a model's predictive confidence in a post-hoc way significantly improves its reliability. Theoretical analysis and extensive experiments on failure detection across various datasets verify the effectiveness of our method on both in-distribution and out-of-distribution settings.

READ FULL TEXT

page 7

page 17

research
06/15/2018

Measuring intergroup agreement and disagreement

This work is motivated by the need to assess the degree of agreement bet...
research
11/11/2016

Improving Reliability of Word Similarity Evaluation by Redesigning Annotation Task and Performance Measure

We suggest a new method for creating and using gold-standard datasets fo...
research
10/10/2019

Rate-Distortion Optimization Guided Autoencoder for Generative Approach with quantitatively measurable latent space

In the generative model approach of machine learning, it is essential to...
research
02/06/2023

Trust, but Verify: Using Self-Supervised Probing to Improve Trustworthiness

Trustworthy machine learning is of primary importance to the practical d...
research
12/15/2022

Reliable Measures of Spread in High Dimensional Latent Spaces

Understanding geometric properties of natural language processing models...
research
07/23/2019

An ordinal measure of interrater absolute agreement

A measure of interrater absolute agreement for ordinal scales is propose...
research
06/26/2023

Inter-Annotator Agreement in the Wild: Uncovering Its Emerging Roles and Considerations in Real-World Scenarios

Inter-Annotator Agreement (IAA) is commonly used as a measure of label c...

Please sign up or login with your details

Forgot password? Click here to reset