Probing Taxonomic and Thematic Embeddings for Taxonomic Information

01/25/2023
by   Filip Klubička, et al.
0

Modelling taxonomic and thematic relatedness is important for building AI with comprehensive natural language understanding. The goal of this paper is to learn more about how taxonomic information is structurally encoded in embeddings. To do this, we design a new hypernym-hyponym probing task and perform a comparative probing study of taxonomic and thematic SGNS and GloVe embeddings. Our experiments indicate that both types of embeddings encode some taxonomic information, but the amount, as well as the geometric properties of the encodings, are independently related to both the encoder architecture, as well as the embedding training data. Specifically, we find that only taxonomic embeddings carry taxonomic information in their norm, which is determined by the underlying distribution in the data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/27/2023

Idioms, Probing and Dangerous Things: Towards Structural Probing for Idiomaticity in Vector Space

The goal of this paper is to learn more about how idiomatic information ...
research
09/21/2022

Representing Affect Information in Word Embeddings

A growing body of research in natural language processing (NLP) and natu...
research
10/21/2022

Probing with Noise: Unpicking the Warp and Weft of Embeddings

Improving our understanding of how information is encoded in vector spac...
research
02/28/2022

Magnitude-aware Probabilistic Speaker Embeddings

Recently, hyperspherical embeddings have established themselves as a dom...
research
09/10/2020

Multi-modal embeddings using multi-task learning for emotion recognition

General embeddings like word2vec, GloVe and ELMo have shown a lot of suc...
research
07/10/2023

Substance or Style: What Does Your Image Embedding Know?

Probes are small networks that predict properties of underlying data fro...
research
02/02/2018

Submodularity-inspired Data Selection for Goal-oriented Chatbot Training based on Sentence Embeddings

Goal-oriented (GO) dialogue systems rely on an initial natural language ...

Please sign up or login with your details

Forgot password? Click here to reset