Duration
30h Th
Number of credits
Lecturer
Language(s) of instruction
French language
Organisation and examination
Teaching in the first semester, review in January
Schedule
Units courses prerequisite and corequisite
Prerequisite or corequisite units are presented within each program
Learning unit contents
Introduction to the principles of collecting and annotating textual corpora: historical overview of the development of corpus linguistics, definition of key concepts, discussion of different methods of corpus annotation (metadata, lemmatization, part-of-speech tagging,...), using as well semi-automatic as fully autpmatic tools and methods, and presentation of techniques making it possible to transform a corpus into a textual database; data mining and analysis.
Learning outcomes of the learning unit
The main objective of this course is to introduce the students 1st year students of the Master's degree in Linguistics (à finalité spécialisée en Traitement automatique des textes et analyse statistique des données textuelles) to principles of constitution and preparation of corpora or textual databases, so as to make it possible for them to to integrate these principles and techniques into further disciplinary research.
Prerequisite knowledge and skills
None.
Planned learning activities and teaching methods
Lectures and practical exercises
Mode of delivery (face to face, distance learning, hybrid learning)
1th semester
Course materials and recommended or required readings
Exam(s) in session
Any session
- In-person
written exam ( open-ended questions ) AND oral exam
- Remote
written exam ( open-ended questions ) AND oral exam
Written work / report
Additional information:
Written and oral examination.
Work placement(s)
Organisational remarks and main changes to the course
1h taeching with J. Perrez et 1h teaching with D. Longrée
Contacts
Association of one or more MOOCs
Items online
eCampus course
eCampus course