ANEA: Automated (Named) Entity Annotation for German Domain-Specific Texts

2021-12-13 | conference paper

Original publication

Jump to: Cite & Linked | Documents & Media | Details | Version history

Cite this publication

​ANEA: Automated (Named) Entity Annotation for German Domain-Specific Texts​
Zhukova, A. ; Hamborg, F. & Gipp, B. ​ (2021)
​EEKE 2021 - Workshop on Extraction and Evaluation of Knowledge Entities from Scientific Documents​

Documents & Media

paper1.pdf2.49 MBUnknown

License

GRO License GRO License

Details

Authors
Zhukova, Anastasia ; Hamborg, Felix; Gipp, Bela 
Abstract
Named entity recognition (NER) is an important task that aims to resolve universal categories of named entities, e.g., persons, locations, organizations, and times. Despite its common and viable use in many use cases, NER is barely applicable in domains where general categories are suboptimal, such as engineering or medicine. To facilitate NER of domain-specific types, we propose ANEA, an automated (named) entity annotator to assist human annotators in creating domain-specific NER corpora for German text collections when given a set of domain-specific texts. In our evaluation, we find that ANEA automatically identifies terms that best represent the texts' content, identifies groups of coherent terms, and extracts and assigns descriptive labels to these groups, i.e., annotates text datasets into the domain (named) entities.
Issue Date
13-December-2021
Conference
EEKE 2021 - Workshop on Extraction and Evaluation of Knowledge Entities from Scientific Documents
Event start
2021

Reference

Citations