Person-Centric Mining of Historical Newspaper Collections

2016 | book part. A publication with affiliation to the University of Göttingen.

Jump to: Cite & Linked | Documents & Media | Details | Version history

Cite this publication

​Coll Ardanuy M, Knauth J, Beliankou A, van den Bos M, Sporleder C. ​Person-Centric Mining of Historical Newspaper Collections​. ​In: Fuhr N, Kovacs L, Risse T, Nejdl W​, editors. ​Research and Advanced Technology for Digital Libraries. ​Cham: ​Springer; ​2016. p. 320​-331​. ​(Lecture Notes in Computer Science;9819). doi:10.1007/978-3-319-43997-6_25. 

Documents & Media

License

GRO License GRO License

Details

Authors
Coll Ardanuy, Mariona; Knauth, Jürgen; Beliankou, Andrei; van den Bos, Maarten; Sporleder, Caroline 
Editors
Fuhr, N.; Kovacs, L.; Risse, T.; Nejdl, W.
Abstract
We present a text mining environment that supports entity-centric mining of terascale historical newspaper collections. Information about entities and their relation to each other is often crucial for historical research. However, most text mining tools provide only very basic support for dealing with entities, typically at most including facilities for entity tagging. Historians, on the other hand, are typically interested in the relations between entities and the contexts in which these are mentioned. In this paper, we focus on person entities. We provide an overview of the tool and describe how person-centric mining can be integrated in a general-purpose text mining environment. We also discuss our approach for automatically extracting person networks from newspaper archives, which includes a novel method for person name disambiguation, which is particularly suited for the newspaper domain and obtains state-of-the-art disambiguation results.
Issue Date
2016
Publisher
Springer
Series
Lecture Notes in Computer Science 
ISBN
978-3-319-43996-9
978-3-319-43997-6
ISSN
0302-9743; 1611-3349
Language
English

Reference

Citations


Social Media