Corpora
The U-Blad corpus now available in I-Analyzer
The CDH Research Software Lab has added all print editions of the U-Blad to the online text and data mining tool I-Analyzer. The U-Blad served as Utrecht University’s (UU) independent magazine from 1969 to 2010. Launched on 5 September 1969, under the title U utrechtse universitaire reflexen, it was later renamed the U-Blad in 1974….
Read moreMODIFED Workshop
20 June 2024 – 21 June 2024 @ All Day – The Re-examining Dialect Syntax Network (REEDs) and Leiden University Centre for Digital Humanities (LUCDH) are organizing a Morphosyntactic Dialect Feature Detection (MODIFED) workshop on Thursday 20 June and Friday 21 …
Read morePeople & Parliament
A digital toolset for text- and datamining and natural language processing for a political-historical research project.
Read moreMind your Manner Adverbials!
A linguistic database housing manner adverbials from 15 languages and dialects.
Read moreDutch Dialect Idioms database
The ‘Dutch Dialect Idioms’ database is an online database of idioms in 13 Dutch dialects, publicly accessible and easily searchable.
Read moreLooking for patterns in readers’ reviews of translated books using DIOPTRA-L
What do English readers expect from translated literature? And how about the Dutch? Professor Haidee Kotze and her research team used big data to look for patterns in the way that ordinary readers review translated literature. Together with the Digital Humanities Lab they developed DIOPTRA-L, a corpus with 280.000 reviews of more than 150 books…
Read moreThe Digital Humanities Lab develops digital toolset for major European research project ‘People and Parliament’
The Utrecht Digital Humanities Lab will collaborate with the University of Jyväskylä (Finland) on the political-historical research project ‘People and Parliament’. This collaboration between software developers and historians enables groundbreaking research into parliamentary data. ‘People and Parliament’ is an ambitious project. The research focuses on the use of political language in the national parliaments of…
Read moreAnnCor and Multiword Expression Identifier
The central goal of this project is to create a Multiword (e.g. De plaat poetsen.) Expression Identifier for Dutch (MWEIDD) and enrich various Dutch text corpora with annotations based on this Identifier.
Read moreHistorical Newspapers
During the Historical Newspapers project, a large corpora of newspaper articles was aggregated, along with tools for sentiment analysis and spatial analysis.
Read more