Centre for Digital Humanities


UB & CDH Workshop: Working with large text corpora in I-Analyzer for humanities staff

Event details

04 October 2022
13:00 - 16:00
Digital Humanities Workspace - Room 0.32
Drift 27 (Room 0.32), Utrecht, 3512 BR

This joint workshop by the Utrecht University Library and the Centre for Digital Humanities will give an overview of how text corpora are processed in order to be used in I-Analyzer, and how to search, filter and visualize results.

I-Analyzer, developed by the Digital Humanities Lab, is an online text and data mining application that combines online availability and ease of use with flexibility. This application allows researchers to quickly hook up their own corpus and/or dataset.

In this workshop developers Mees van Stiphout and Jelte van Boheemen (Digital Humanities Lab) will first give an overview of how you process text corpora to be used in I-Analyzer. Next, Information- and Collection Specialist Ruben Schalk (Utrecht University Library), will demonstrate an open-source pipeline for importing, analyzing and geotagging subsets of I-Analyzer results. After this, they will dive into a hands-on session on how to search, filter and visualize data.

Using I-Analyzer, researchers can perform full text searches, filter, create visualizations and export subsets for further research in the following sources:

  • Dutch newspaper collection, Royal Library (Delpher)
  • ECCO (Eighteenth Century Collections Online)
  • The Times, newspaper archives 1785-2010
  • The Guardian-Observer, archive 1791-2003
  • Periodicals, archive 19th century
  • The Throne Speech, 1814-2018
  • Hebrew epigraph collection
  • Annual reports (1957 – 2008) of 304 companies

It might be possible to add a document collection of your choice in I-Analyzer. If you are interested, contact cdh@uu.nl to discuss the options. 

Admission is free, but the number of participants is limited, so please register as soon as possible (first come, first served). If you are unable to attend, please cancel your registration by sending an email to CDH@uu.nl, so another participant can take your place.

Please note: the Centre for Digital Humanities aims to promote digital literacy amongst staff-members and therefore compensates attendance to these courses in terms of DCU (22 hours = 1 DCU). This course (including preparation) counts for 4 hours, which is equal to 0,18 DCU. The DCU’s will be automatically settled with your department at the end of the course.