Abstract:
Taxonomies play a crucial role in organizing knowledge for various natural language processing tasks. Recent advancements in LLMs have opened new avenues for automating taxonomy-related tasks with greater accuracy.
In this paper, we explore the potential of contemporary LLMs in learning, evaluating and predicting taxonomic relations across multiple lexical semantic tasks.
We propose novel method for taxonomy-based instruction dataset creation, encompassing multiple graph relations. With the use of this datasetwe build TaxoLLaMA, a unified model fine-tuned on datasets exclusively based on English WordNet 3.0, designed to handle a wide range of taxonomy-related tasks such as Taxonomy Construction, Hypernym Discovery, Taxonomy Enrichment, and Lexical Entailment. The experimental results demonstrate that TaxoLLaMA achieves state-of-the-art performance on 11 out of 16 tasks and ranked second on 4 other tasks.
We also explore LLM ability for constructed taxonomies graph refinement and present comprehensive ablation study and thorough error analysis supported by both manual and automated techniques.