UB & CDH Workshop: Working with large text corpora in I-Analyzer – introduction workshop for humanities staff
This entry-level workshop, jointly organized by the University Library and the Centre for Digital Humanities, aims to provide participants with an overview of the functionalities and corpora available in I-Analyzer, an intuitive online text and data mining application for exploring large collections of texts, developed and maintained in-house.
I-Analyzer is a user-friendly online tool designed for exploring large collections of texts. Researchers can utilize this application to easily perform full-text searches, filter data, and export subsets of datasets for further analysis using other programs. The software also enables users to generate various visualizations, such as Ngrams and word models, allowing for a deeper understanding and presentation of broader trends within the corpus. Additionally, users have the option to upload their own corpora and datasets into the tool. Developed in-house by the CDH Research Software Lab, I-Analyzer is meticulously maintained, regularly improved, and continuously expanded to meet the evolving needs of researchers.
In this interactive workshop, developers from the RSLab will provide an overview of the pre-processing steps required to prepare text corpora for analysis in I-Analyzer. The DH Team of the University Library will showcase the available corpora within the tool. Following these presentations, participants will have the opportunity to actively engage with the tool by conducting their own searches, filtering data, and generating visualizations.
If you are interested in having your own corpus added to I-Analyzer, either for general use or specifically for this workshop, please feel free to contact us at email@example.com to discuss the possibilities.
Currently available corpora in I-Analyzer
- Dutch newspaper collection, Royal Library (Delpher)
- ECCO (Eighteenth Century Collections Online)
- The Times, newspaper archives 1785-2010
- The Guardian-Observer, archive 1791-2003
- Periodicals, archive 19th century
- Speeches by Dutch monarchs (Troonredes), 1814-2018
- Hebrew epigraph collection
- Dutch annual reports (1957 – 2008) of 304 companies
- Open data of (anonymised) court rulings of the Dutch judicial system
In order to ensure the viability of the training and allocate the necessary resources, we require a minimum of ten registrations for the course to proceed. We will inform all registered participants of the status at the latest two weeks in advance, confirming whether the course will definitively take place.
If you are a teacher at the Faculty of Humanities, the CDH will automatically settle a DCU compensation of 4 hours with your department after completion, which equals 0,18 DCUs.
To secure your spot, we encourage you to register as soon as possible. Registrations will be processed on a first-come, first-served basis. In the event that you are unable to attend the workshop, we kindly request that you cancel your registration by sending an email to CDH@uu.nl. This will allow us to offer the vacant spot to another interested participant. Thank you for your cooperation.