Mats Wirén Photo: C Stensson

Mats Wirén


Visa sidan på svenska
Works at Department of Linguistics
Telephone 08-16 12 44
Visiting address Universitetsvägen 10 C, plan 2-3
Room C 361
Postal address Institutionen för lingvistik 106 91 Stockholm

About me

Associate Professor
Head of the Section for Computational Linguistics




Corpus-based Methods, LIM024, 7.5 ECTS credits [in English]
Mathematical Methods for Linguists, LIN420, 7.5 ECTS credits [in Swedish]
Independent Project for the Degree of Master, LIM050, 15 ECTS credits [in English]



A selection from Stockholm University publication database
  • 2018. Adam Ek (et al.). 11th edition of the Language Resources and Evaluation Conference

    This paper describes an approach to identifying speakers and addressees in dialogues extracted from literary fiction, along with a dataset annotated for speaker and addressee. The overall purpose of this is to provide annotation of dialogue interaction between characters in literary corpora in order to allow for enriched search facilities and construction of social networks from the corpora. To predict speakers and addressees in a dialogue, we use a sequence labeling approach applied to a given set of characters. We use features relating to the current dialogue, the preceding narrative, and the complete preceding context. The results indicate that even with a small amount of training data, it is possible to build a fairly accurate classifier for speaker and addressee identification across different authors, though the identification of addressees is the more difficult task.

  • 2018. Dan Rosén, Mats Wirén, Elena Volodina. Proceedings of CLARIN Annual Conference 2018, 181-184

    Error coding of second-language learner text, that is, detecting, correcting and annotating errors, is a cumbersome task which in turn requires interpretation of the text to decide what the errors are. This paper describes a system with which the annotator corrects the learner text by editing it prior to the actual error annotation. During the editing, the system automatically generates a parallel corpus of the learner and corrected texts. Based on this, the work of the annotator consists of three independent tasks that are otherwise often conflated in error coding: correcting the learner text, repairing inconsistent alignments, and performing the actual error annotation.

Show all publications by Mats Wirén at Stockholm University

Last updated: January 15, 2019

Bookmark and share Tell a friend