Profiles

Kristina Nilsson Björkenstam

Kristina Nilsson Björkenstam

Studierektor

Visa sidan på svenska
Works at Department of Linguistics
Telephone 08-16 17 61
Email kristina.nilsson@ling.su.se
Visiting address Universitetsvägen 10 C, plan 2-3
Room C 366
Postal address Institutionen för lingvistik 106 91 Stockholm

About me

Researcher in Computational Linguistics.

Director of Studies for General Linguistics, Phonetics and Computational Linguistics (first and second level). Co-ordinating Director of Studies for Linguistics and Sign Language.

Research

Projects:
Workshops:

Publications

A selection from Stockholm University publication database
  • 2018. Paul Ibbotson, Rose M. Hartman, Kristina Nilsson Björkenstam. Language, Cognition and Neuroscience 33 (6), 1-15

    We present an open-access analytic tool, which allows researchers to simultaneously control for and combine language data from the child, the caregiver, multiple languages, and across multiple time points to make inferences about the social and cognitive factors driving the shape of language development. We demonstrate how the tool works in three domains of language learning and across six languages. The results demonstrate the usefulness of this approach as well as providing deeper insight into three areas of language production and acquisition: egocentric language use, the learnability of nouns versus verbs, and imageability. We have made the Frequency Filter tool freely available as an R-package for other researchers to use at https://github.com/rosemm/FrequencyFilter.

  • 2017. Sofia Strömbergsson (et al.). Proceedings of the 18th Annual Conference of the International Speech Communication Association (INTERSPEECH 2017), Stockholm: The International Speech Communication Association (ISCA), 2017., 2214-2217

    Child-directed spoken data is the ideal source of support for claims about children’s linguistic environments. However, phonological transcriptions of child-directed speech are scarce,compared to sources like adult-directed speech or text data. Acquiring reliable descriptions of children’s phonological environments from more readily accessible sources would mean considerable savings of time and money. The first step towards this goal is to quantify the reliability of descriptions derived from such secondary sources. We investigate how phonological distributions vary across different modalities (spoken vs. written), and across the age of the intended audience (children vs. adults). Using a previously unseen collection of Swedish adult- and child-directed spoken and written data, we combine lexicon look-up and grapheme-to-phonemeconversion to approximate phonological characteristics. The analysis shows distributional differences across datasets both for single phonemes and for longer phoneme sequences. Some of these are predictably attributed to lexical and contextual characteristics of text vs. speech.The generated phonological transcriptions are remarkably reliable. The differences in phonological distributions between child-directed speech and secondary sources highlight a need for compensatory measures when relying on written data or onadult-directed spoken data, and/or for continued collection ofactual child-directed speech in research on children’s language environments.

  • 2017. Kristina Nilsson Björkenstam, Gintaré Grigonyté. Språktidningen (2), 24-27
  • 2016. Kristina Nilsson Björkenstam, Mats Wirén, Robert Östling. The 54th Annual Meeting of the Association for Computational Linguistics, 82-90

    How do infants learn the meanings of their first words? This study investigates the informativeness and temporal dynamics of non-verbal cues that signal the speaker's referent in a model of early word–referent mapping. To measure the information provided by such cues, a supervised classifier is trained on information extracted from a multimodally annotated corpus of 18 videos of parent–child interaction with three children aged 7 to 33 months. Contradicting previous research, we find that gaze is the single most informative cue, and we show that this finding can be attributed to our fine-grained temporal annotation. We also find that offsetting the timing of the non-verbal cues reduces accuracy, especially if the offset is negative. This is in line with previous research, and suggests that synchrony between verbal and non-verbal cues is important if they are to be perceived as causally related.

  • 2014. Kristina Nilsson Björkenstam, Sofia Gustafson Capková, Mats Wirén. Strindberg on International Stages/Strindberg in Translation

    We have approached the works of August Strindberg from  a computational linguistic point of view, resulting in The Stockholm University Strindberg Corpus, consisting of seven of Strindberg's autobiographical works with linguistic annotation. The corpus is freely available for research. We use this corpus for three quantitative studies of Strindberg’s work: in the first, we describe the novels included in the corpus by keywords; in the second, we compare Strindberg’s use of emotionally charged words with selected prose of both his contemporaries and present-day authors; in the third, we explore the semantic prosody of KVINNA (“woman”) and MAN (“man”).

  • Article SUC-CORE
    2013. Kristina Nilsson Björkenstam. Northern European Journal of Language Technology (NEJLT) 3 (2), 19-39

    This paper describes SUC-CORE, a subset of the Stockholm Umeå Corpus and the Swedish Treebank annotated with noun phrase coreference. While most coreference annotated corpora consist of texts of similar types within related domains, SUC-CORE consists of both informative and imaginative prose and covers a wide range of literary genres and domains.This allows for exploration of coreference across different text types, but it also means that there are limited amounts of data within each type. Future work on coreference resolution for Swedish should include making more annotated data available for the research community.

Show all publications by Kristina Nilsson Björkenstam at Stockholm University

Last updated: June 7, 2018

Bookmark and share Tell a friend