Skip to content

AI4DH Workshop: Information retrieval from the internet

On Monday, 25 May 2026, we hosted a workshop for digital humanities researchers led by Dr Slavko Žitnik.

An enormous amount of data is published online. However, for specific analyses we may only need data from a particular part of the internet, so it is important to know how to obtain such data as efficiently as possible. The workshop provided a hands-on introduction to the ways in which data is structured on the web as well as tools that enable the automatic crawling and extraction of data from websites. For more advanced users, the workshop also demonstrated the possibility of using similar tools through programming. 

We invite you not to miss the third spring workshop, Visual Network Analysis for Humanities Research: Extracting Insights from Historical and Literary Corpora, which will take place on 15 June 2026. More information is available at this link