AI4DH Spring Workshops: Registration Now Open
In April and May 2026, two workshops will be offered for digital humanities and social sciences scholars who want to deepen their knowledge of AI and apply it in their research.
All seminars and workshops are free of charge.
Practical text mining in Orange
- Description: The workshop offers a practical demonstration of a workflow for text clustering. During the workshop, we will become familiar with the basic preprocessing of textual data and the transformation of documents into a vector space. We will implement a simple workflow for clustering texts and then interactively explore and interpret the results.
- Date and time: 13 April 2026, 11:00 – 15:00
- Location: Faculty of Computer and Information Science (Večna pot 113, 1000 Ljubljana), Lecture room P03
- Instructor: Dr Ajda Pretnar Žagar
- Duration: 2 x 90 minutes
- Difficulty: beginner
- Language of the workshop: Slovenian (or English in case of foreign applicants)
- Outcomes/skills gained:
- Knows the terminology of text mining.
- Performs preprocessing of textual data.
- Performs profiling or vectorisation of texts.
- Builds a simple workflow for text clustering.
- Ability to build a pipeline for data preprocessing.
- Ability to transform texts into a vector representation.
- Ability to build a workflow for clustering.
Information retrieval from the Internet
- Description: An enormous amount of data is published online. Large companies also collect this data and use it to build large language models. However, for specific analyses we may only need data from a particular part of the internet, so it is important to know how to obtain such data as efficiently as possible. We will explore the ways in which data is structured on the web, as well as tools that enable the automatic crawling and extraction of data from websites. For more advanced users, we will also demonstrate the possibility of using similar tools through programming.
- Date and Time: 25 May 2026, 11:00 – 15:00
- Location: Faculty of Computer and Information Science (Večna pot 113, 1000 Ljubljana), Lecture room P03
- Instructor: Dr Slavko Žitnik
- Duration: 2 x 90 minutes
- Difficulty: Beginning – intermediate. No prior knowledge is required. However, a basic understanding of website structure and/or programming in Pythonis desirable.
- Language: Slovenian (or English in case of foreign applicants)
- Outcomes / skills gained:
- Knows the types and formats in which content is presented on websites.
- Understands the concept of web scraping.
- Knows and uses tools for the automated extraction of data from the web.
- Builds a simple workflow for obtaining data from the web.
- Ability to assess the possibilities of obtaining data from the web.
- Ability to collect data from the web.