Marcel Bollmann (2012). (Semi-)Automatic Normalization of Historical Texts using Distance Measures and the Norma tool. In: Proceedings of the Second Workshop on Annotation of Corpora for Research in the Humanities (ACRH-2), pp. 3–14. Lisbon, Portugal. [PDF]
Marcel Bollmann, Stefanie Dipper, Julia Krasselt, and Florian Petran (2012). Manual and Semi-automatic Normalization of Historical Spelling – Case Studies from Early New High German. In: Proceedings of the First International Workshop on Language Technology for Historical Text(s) (LThist2012). Vienna, Austria. [PDF]
Marcel Bollmann, Florian Petran, and Stefanie Dipper (2011). Rule-Based Normalization of Historical Texts. In: Proceedings of the RANLP Workshop on Language Technologies for Digital Humanities and Cultural Heritage, pp. 34–42. Hissar, Bulgaria. [PDF]
Marcel Bollmann, Florian Petran, and Stefanie Dipper (2011). Applying Rule-Based Normalization to Different Types of Historical Texts – An Evaluation. In: Proceedings of the 5th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics (LTC 2011). Poznan, Poland. [PDF]
A revised version of this paper is available below. -
Marcel Bollmann, Florian Petran, and Stefanie Dipper (2015). Applying Rule-Based Normalization to Different Types of Historical Texts – An Evaluation. In: Zygmunt Vetulani and Joseph Mariana (Eds.), Human Language Technology Challenges for Computer Science and Linguistics. 5th Language and Technology Conference, LTC 2011. Revised Selected Papers, pp. 166–177. Lecture Notes in Artificial Intelligence 8387. Springer. (Final publication available at SpringerLink)
Stefanie Dipper, Julia Krasselt, and Simone Schultz-Balluff (2015). Creating synopses of ‘parallel’ historical manuscripts and early prints. Alignment guidelines, evaluation, and applications. In: Historical Corpora, Challenges and Perspectives (= CLIP 5). Tübingen: Narr. [PDF preprint]
Stefanie Dipper and Bettina Schrader (2008). Computing Distance and Relatedness of Medieval Text Variants from German. In: Angelika Storrer, Alexander Geyken, Alexander Siebert, and Kay-Michael Würzner (Eds.), Text Resources and Lexical Knowledge. Proceedings of KONVENS-08 (Konferenz zur Verarbeitung natürlicher Sprache), pp. 39–51. Berlin: Mouton de Gruyter. [PDF preprint]
Stefanie Dipper and Simone Schultz-Balluff (2013). The Anselm Corpus: Methods and Perspectives of a Parallel Aligned Corpus. In: Proceedings of the NODALIDA Workshop on Computational Historical Linguistics. Oslo, Norway. [PDF]
Julia Krasselt, Marcel Bollmann, Stefanie Dipper, and Florian Petran (2015). Guidelines für die Normalisierung historischer deutscher Texte / Guidelines for Normalizing Historical German Texts. Bochumer Linguistische Arbeitsberichte: 15. [PDF]
Florian Petran (2012a). Aligning the un-alignable – a pilot study using a noisy corpus of nonstandardized, semi-parallel texts. In: Alexander Gelbukh (Ed.), Computational Linguistics and Intelligent Text Processing, Vol. 2. Berlin/Heidelberg: Springer. [PDF preprint] (Final publication available at SpringerLink.)
Florian Petran (2012b). Segmentation of Historical Texts: Sentences or Chunks? In: Proceedings of the Second Workshop on Annotation of Corpora for Research in the Humanities (ACRH-2). Lisbon, Portugal. [PDF]
Simone Schultz-Balluff and Stefanie Dipper (2013). St. Anselmi Fragen an Maria – Schritte zu einer (digitalen) Erschließung, Auswertung und Edition der gesamten deutschsprachigen Überlieferung (14.-16. Jh.). In: Anne Bohnenkamp et al. (Eds.), Medienwandel/Medienwechsel in der Editionswissenschaft. 13. internationale Tagung der Arbeitsgemeinschaft für germanistische Edition 17.-20. Februar 2010, Beihefte zu editio, pp. 173–191.
Poster presentations
- Marcel Bollmann, Stefanie Dipper, Julia Krasselt, and Florian Petran (2012). The Anselm Project: Tools for Automatic Analysis of a Parallel Corpus in Early New High German. 34. Jahrestagung der Deutschen Gesellschaft für Sprachwissenschaft (DGfS). Frankfurt, Germany. [Poster]