EmbNum: Semantic labeling for numerical values with deep metric learning

06/26/2018
by   Phuc Nguyen, et al.
0

Semantic labeling is a task of matching unknown data source to labeled data sources. The semantic labels could be properties, classes in knowledge bases or labeled data are manually annotated by domain experts. In this paper, we presentEmbNum, a novel approach to match numerical columns from different table data sources. We use a representation network architecture consisting of triplet network and convolutional neural network to learn a mapping function from numerical columns toa transformed space. In this space, the Euclidean distance can be used to measure "semantic similarity" of two columns. Our experiments onCity-Data and Open-Data demonstrate thatEmbNumachieves considerable improvements in comparison with the state-of-the-art methods in effectiveness and efficiency.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset