ReqTagger: A rule-based tagger for automatic Glossary of Terms extraction from ontology requirements
[ 1 ] Instytut Informatyki, Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | [ P ] employee
2022
scientific article
english
- Competency Questions
- Ontology Requirements
- Ontology
- Information Extraction
- Part-of-Speech tagging
EN Glossary of Terms extraction from textual requirements is an impor- tant step in ontology engineering methodologies. Although initially it was intended to be performed manually, last years have shown that some degree of automatization is possible. Based on these promising approaches, we introduce a novel, human inter- pretable, rule-based method named ReqTagger, which can extract candidates for ontology entities (classes or instances) and relations (data or object properties) from textual requirements automatically. We compare ReqTagger to existing automatic methods on an evaluation benchmark consisting of over 550 requirements and tagged with over 1700 entities and relations expected to be extracted. We discuss the quality of ReqTagger and provide details showing why it outperforms other methods. We also publish both the evaluation dataset and the implementation of ReqTagger.
23.02.2022
65 - 86
CC BY-NC-ND (attribution - noncommercial - no derivatives)
open journal
final published version
at the time of publication
40
1,1