Corpus-Based Methods

7.5 credits cr.
Gå till denna sida på svenska webben

This course deals with corpus-based methods, that is, the large-scale study of written text, or spoken or signed utterances.

Contents: Data, methods and evidence in different linguistic traditions. Quantitative properties of language, frequencies, n-grams. Data collection for different types of corpora (including traditional sample corpora, monitor corpora and web corpora) and modalities (text, speech, signing). Representation of corpora in XML. Overview of computational linguistic methods for automatic segmentation and annotation of text, including tokenisation, part-of-speech tagging and syntactic analysis. Searching corpora using regular expressions. Analysis of corpora based on occurrences and co-occurrences. Relationship between corpus material and research questions. Ethics, copyright, licenses.

Course structure

Teaching format

The course is based on lectures and laborations.

Assessment

The course is examined through written exams and reports.

Examiner

Beata Megyesi
Schedule
The schedule will be available no later than one month before the start of the course. We do not recommend print-outs as changes can occur. At the start of the course, your department will advise where you can find your schedule during the course.
- Schedule LIM024 AU 2023
Course literature
Note that the course literature can be changed up to two months before the start of the course.
- Valid from Autumn 2021
- Reading list LIM024 - valid from Autumn 2021
See syllabus archive for all reading lists
Course reports
- AUTUMN 2023
- AUTUMN 2022
Course_report_LIM024_AU23.pdf

Course_report_LIM024_AU22.pdf
- Course report LIM024 AU 2020
Contact
For more contact details, see our education pages:

Contact information for student affairs
Department of linguistics
Student Office
exp@ling.su.se

08-16 23 46
Visiting address

C378

Office hours

Tuesdays 9–10
Wednesdays 13–15
Thursdays 13–16

Phone hours

Tuesdays 9–10
Wednesdays 13–15
Thursdays 9–11 and 13–16

Corpus-Based Methods

Course structure

Teaching format

Assessment

Examiner

Schedule

Course literature

Course reports

Contact

Discover Stockholm and Sweden

Step-by-step guide

Meet us online and around the world

Students of Stockholm University

Our researchers. Your teachers

Hear from our alumni

Corpus-Based Methods

Course structure

Teaching format

Assessment

Examiner

Schedule

Course literature

Course reports

Contact

Selected reading

Discover Stockholm and Sweden

Step-by-step guide

Meet us online and around the world

Students of Stockholm University

Our researchers. Your teachers

Hear from our alumni