GIANT: Scalable Creation of a Web-scale Ontology

by   Bang Liu, et al.

Understanding what online users may pay attention to is key to content recommendation and search services. These services will benefit from a highly structured and web-scale ontology of entities, concepts, events, topics and categories. While existing knowledge bases and taxonomies embody a large volume of entities and categories, we argue that they fail to discover properly grained concepts, events and topics in the language style of online population. Neither is a logically structured ontology maintained among these notions. In this paper, we present GIANT, a mechanism to construct a user-centered, web-scale, structured ontology, containing a large number of natural language phrases conforming to user attentions at various granularities, mined from a vast volume of web documents and search click graphs. Various types of edges are also constructed to maintain a hierarchy in the ontology. We present our graph-neural-network-based techniques used in GIANT, and evaluate the proposed methods as compared to a variety of baselines. GIANT has produced the Attention Ontology, which has been deployed in various Tencent applications involving over a billion users. Online A/B testing performed on Tencent QQ Browser shows that Attention Ontology can significantly improve click-through rates in news recommendation.


A User-Centered Concept Mining System for Query and Document Understanding at Tencent

Concepts embody the knowledge of the world and facilitate the cognitive ...

Ontology-driven Event Type Classification in Images

Event classification can add valuable information for semantic search an...

Use of OWL and Semantic Web Technologies at Pinterest

Pinterest is a popular Web application that has over 250 million active ...

CEVO: Comprehensive EVent Ontology Enhancing Cognitive Annotation

While the general analysis of named entities has received substantial re...

LB2CO: A Semantic Ontology Framework for B2C eCommerce Transaction on the Internet

Business ontology can enhance the successful development of complex ente...

Extracting Domain-specific Concepts from Large-scale Linked Open Data

We propose a methodology for extracting concepts for a target domain fro...

MeLinDa: an interlinking framework for the web of data

The web of data consists of data published on the web in such a way that...

Please sign up or login with your details

Forgot password? Click here to reset