Corpora
Datahub SSH
AnnCor and Multiword Expression Identifier
The central goal of this project is to create a Multiword (e.g. De plaat poetsen.) Expression Identifier for Dutch (MWEIDD) and enrich various Dutch text corpora with annotations based on this Identifier.
Digital methods
Datahub SSH
Visual pattern recognition and (meta) data extraction
The project created a workbench that can be used for computationally analyzing large datasets of visual material. It is designed to cater to the needs of researchers in a range of studies who work with digitized visual material.
Text mining
Datahub SSH
Footprinter, a tool for the analysis of the Oceans of Light corpus
Footprinter is designed to discover the Qur’an citations in this corpus of legal texts.
Corpora
Datahub SSH
Historical Newspapers
During the Historical Newspapers project, a large corpora of newspaper articles was aggregated, along with tools for sentiment analysis and spatial analysis.