2024-2025 / INFO0943-1

Textual corpora : analytical principles

Duration

30h Th

Number of credits

 Master en langues et lettres anciennes et modernes, à finalité approfondie5 crédits 
 Master in ancient languages and literatures : classics, research focus5 crédits 
 Master in ancient languages and literatures : Oriental studies, research focus5 crédits 
 Master in ancient languages and literatures : classics, teaching focus5 crédits 
 Master in ancient and modern languages and literatures, teaching focus5 crédits 
 Master en linguistique, à finalité spécialisée en analyse des données textuelles5 crédits 
 Master en langues et lettres anciennes et modernes, à finalité spécialisée en édition et métiers du livre5 crédits 
 Master en langues et lettres anciennes, orientation classiques, à finalité spécialisée en édition et métiers du livre5 crédits 
 Master in ancient languages and literatures : Oriental studies, professional focus in languages and civilisation of Far East : China-Japan5 crédits 
 Master in linguistics, professional focus in word processing and analysis of textual data (joint degree programme) (Double diplomation)5 crédits 
 Master en langues et lettres anciennes, orientation orientales, à finalité spécialisée en langues, cultures et sociétés de l'Asie orientale : Chine-Japon5 crédits 
 Master en langues et lettres anciennes et modernes5 crédits 
 Master in ancient languages and literatures : classics (60 ECTS)5 crédits 

Lecturer

Dominique Longrée, Julien Perrez

Language(s) of instruction

French language

Organisation and examination

Teaching in the first semester, review in January

Schedule

Schedule online

Units courses prerequisite and corequisite

Prerequisite or corequisite units are presented within each program

Learning unit contents

Introduction to the principles of collecting and annotating textual corpora: historical overview of the development of corpus linguistics, definition of key concepts, discussion of different methods of corpus annotation (metadata, lemmatization, part-of-speech tagging,...), using as well semi-automatic as fully autpmatic tools and methods, and presentation of techniques making it possible to transform a corpus into a textual database; data mining and analysis.  

Learning outcomes of the learning unit

The main objective of this course is to introduce the students 1st year students of the Master's degree in Linguistics (à finalité spécialisée en Traitement automatique des textes et analyse statistique des données textuelles) to principles of constitution and preparation of corpora or textual databases, so as to make it possible for them to to integrate these principles and techniques into further disciplinary research.

Prerequisite knowledge and skills

None.

Planned learning activities and teaching methods

Lectures and practical exercises

Mode of delivery (face to face, distance learning, hybrid learning)

1th semester

Course materials and recommended or required readings

Exam(s) in session

Any session

- In-person

written exam ( open-ended questions ) AND oral exam

- Remote

written exam ( open-ended questions ) AND oral exam

Written work / report


Additional information:

Written and oral examination.

Work placement(s)

Organisational remarks and main changes to the course

1h taeching with J. Perrez et 1h teaching with D. Longrée

Contacts

Julien.Perrez@ulg.ac.be
dominique.longree@ulg.ac.be

Association of one or more MOOCs

Items online

eCampus course
eCampus course