6533b873fe1ef96bd12d5dcd
RESEARCH PRODUCT
Automatic Taxonomy Induction based on Word-embedding of Neural Nets
Zafar BushraImran AyeshaAsghar MuhammadCochez MichaelHämäläinen Timosubject
sanasemantiikkatekstinlouhintataxonomy inductionneuroverkottiedonlouhintaword-embeddinghyponym-hypernym relationsdescription
Taxonomy is a knowledge management tool that presents useful information in a well-ordered structure prevents overloading of information on its access and making the information access qualitative. This article is concerned with automatically extracting asymmetrical hierarchical relations from a large corpus and subsequent taxonomy construction by domain independent and semi-supervised system. The methodology relies on the term’s distributional semantics. The algorithm utilizes the word-embedding generated from the vector space model. The model is trained over a large corpus to generate word-embedding of each word in a corpus. Then, the system finds and extracts the hypernyms by using the genetic algorithm based on distributional semantics calculations. In the last step, the system adds hyponym-hypernym relations extracted from the string comparison module. Gold Standards taxonomies are used to evaluate the system’s taxonomies for each domain. Our system achieved significant results across each domain. peerReviewed
year | journal | country | edition | language |
---|---|---|---|---|
2018-01-01 |