WebChild: Harvesting and Organizing Commonsense Knowledge from the Web - IMT - Institut Mines-Télécom Accéder directement au contenu
Communication Dans Un Congrès Année : 2014

WebChild: Sammeln und Organisieren von Wissen aus dem Internet

WebChild: Harvesting and Organizing Commonsense Knowledge from the Web

Résumé

This paper presents a method for automatically constructing a large commonsense knowledge base, called WebChild 1 , from Web contents. WebChild contains triples that connect nouns with adjectives via fine-grained relations like hasShape, hasTaste, evokesEmotion, etc. The arguments of these assertions, nouns and adjectives, are disambiguated by mapping them onto their proper WordNet senses. Our method is based on semi-supervised Label Propagation over graphs of noisy candidate assertions. We automatically derive seeds from WordNet and by pattern matching from Web text collections. The Label Propagation algorithm provides us with domain sets and range sets for 19 different relations, and with confidence-ranked assertions between WordNet senses. Large-scale experiments demonstrate the high accuracy (more than 80 percent) and coverage (more than four million fine grained disambiguated assertions) of WebChild.
Fichier principal
Vignette du fichier
wsdm2014.pdf (293.22 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01699891 , version 1 (02-02-2018)

Identifiants

Citer

Niket Tandon, Gerard de Melo, Fabian M. Suchanek, Gerhard Weikum. WebChild: Harvesting and Organizing Commonsense Knowledge from the Web. WSDM, Feb 2014, New York, United States. ⟨10.1145/2556195.2556245⟩. ⟨hal-01699891⟩
215 Consultations
441 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More