Voraussetzungen
For this class, you will need to bring your laptop. Familiarity with Python is essential. You will also need to have installed Microsoft Visual Code and a Python IDE (such as Spyder or PyCharm, both of which can be accessed on Anaconda).
Offizielle Kursbeschreibung
This exercise seminar provides an introduction to creating a gold-standard dataset involving preprocessing data, manual and automatic annotation of text for features such as part-of-speech and lemma, evaluating data formats such as TSV and XML/TEI and checking and correcting the annotations using taggers such as SpaCy. In this class, we will annotate a medical text, the Surgery by Guy de Chauliac, written in Old French in the 12th century using the annotation tool, INCEpTION.
Prior knowledge of Old French and other ancient languages is not required. Knowledge of modern French is useful but also not required. Resources such as dictionaries, grammar books (in English), lemma lists and an English translation of the text will be provided on Moodle.
The class is organised in association with the Knowledge Networks in Medieval Romance Speaking Europe (ALMA) project, based at the Heidelberg Academy of Sciences and Humanities and Heidelberg University. You can find more information about the project here: https://www.hadw-bw.de/en/research/research-center/knowledge-networks-medieval-romance-speaking-europe-alma
Exam:
To pass this course, you will need to complete the group assignments throughout the semester, along with a final exam/paper. More details will be announced in the first session.
Resources for learning Python:
The "Python for Everybody" specialisation by Charles Severance on Coursera or other learning platforms. The first two courses, Programming for Everybody (Getting Started with Python) and Python Data Structures are sufficient.
Introduction to programming in Python: https://swcarpentry.github.io/python-novice-gapminder/
Online-Angebote
moodle
Preconditions
For this class, you will need to bring your laptop. Familiarity with Python is essential. You will also need to have installed Microsoft Visual Code and a Python IDE (such as Spyder or PyCharm, both of which can be accessed on Anaconda).
Official Course Description
This exercise seminar provides an introduction to creating a gold-standard dataset involving preprocessing data, manual and automatic annotation of text for features such as part-of-speech and lemma, evaluating data formats such as TSV and XML/TEI and checking and correcting the annotations using taggers such as SpaCy. In this class, we will annotate a medical text, the Surgery by Guy de Chauliac, written in Old French in the 12th century using the annotation tool, INCEpTION.
Prior knowledge of Old French and other ancient languages is not required. Knowledge of modern French is useful but also not required. Resources such as dictionaries, grammar books (in English), lemma lists and an English translation of the text will be provided on Moodle.
The class is organised in association with the Knowledge Networks in Medieval Romance Speaking Europe (ALMA) project, based at the Heidelberg Academy of Sciences and Humanities and Heidelberg University. You can find more information about the project here:
[url]https://www.hadw-bw.de/en/research/research-center/knowledge-networks-medieval-romance-speaking-europe-alma[/url]
[b]Exam:[/b]
To pass this course, you will need to complete the group assignments throughout the semester, along with a final exam/paper. More details will be announced in the first session.
[b]Resources for learning Python:[/b]
The "Python for Everybody" specialisation by Charles Severance on Coursera or other learning platforms. The first two courses, Programming for Everybody (Getting Started with Python) and Python Data Structures are sufficient.
Introduction to programming in Python: [url]https://swcarpentry.github.io/python-novice-gapminder/[/url]
- Lehrende: Ragini Menon