This course introduces basic methods for extraction of text contents from different file formats, discusses possibilities of their annotation on different linguistic levels and shows a basic analysis in various corpus search engines. The course also covers the basics of processing of spoken and parallel corpora.
The course gives an introduction to programming in Python, fundamental methods of automatic processing of text data, and foundations of statistical language processing. The course is aimed at linguists.
- Teacher: Petra Bago
The course gives an introduction to different types of lexicons.
- Teacher: Nives Mikelic Preradovic
Today’s localization industry is faced with the task of translating a huge volume of texts to produce high-quality localized products in a short turnaround time, to satisfy the needs of global and local marketplaces. Such an undertaking would be inconceivable without the use of technology and, in recent decades, the development of localization tools has been instrumental and given rise to changes in collaborative workflows. This course aims to equip learners with a critical understanding towards the use of these tools, namely computer assisted translation (CAT) tools.