Centre for Digital Humanities

Digital tools

There are numerous digital humanities tools that are publicly available to support research. Below you find a list of commonly used tools. This list is certainly not complete, but a starting point.

Text analysis

  • AntConc is a freeware corpus analysis toolkit for concordancing and text analysis.
  • I-Analyzer is an online text exploration tool that allows researchers to easily perform searches within extensive text corpora, visualize results, and download text and metadata for further analysis. I-Analyzer is developed and maintained by the CDH Research Software Lab.
  • Voyant is an open-source, web-based application for text analysis.
  • Constellate is a cloud-based lab where you can perform text analysis with scholarly and primary source content from Jupyter notebooks, JSTOR, Portico, and partners.

Data cleaning

  • OpenRefine is an open source and easy to use tool for cleaning, structuring, transforming and enriching your digital sources.

Georeferencing

  • By georeferencing, digitally positioning a scan of an old map over a modern reference map, the Utrecht University Library makes the scanned maps more geographically accessible. The UB team uses the innovative software of Georeferencer, version 4, for this purpose.

Text recognition and transcription

  • ABBYY is intelligent process automation software that combines process mining with intelligent document processing.
  • Transkribus is a comprehensive tool for the digitization, AI-powered text recognition, transcription and searching of historical documents, from any place, any time, and in any language.
    Note: As a Utrecht University employee, you can now access Transkribus at a significant discount. Find out more about accessing the discount here.

Qualitative text analysis

  • NVivo is a software package specifically designed to assist researchers in organizing, analyzing, and gaining insights from unstructured or qualitative data.

Statistical analysis

  • IRaMuTeQ is software that provides users with statistical analysis on text corpus and tables composed by individuals/words, based on R, software and on python language.
  • RStudio is an integrated development environment for the programming language R.
  • RDocumentation is a website with datasets in R commonly used by statisticians.

Network and visualization analysis

  • Gephi is an open-source software for network- and visualization analysis. It helps data analysists to intuitively reveal patterns and trends, highlight outliers and tells stories with their data.

Media tools

  • CLARIAH Media Suite facilitates research with large Dutch media collections by providing advanced search and analysis tools.
  • CoW is a tool, provided by CLARIAH, to convert a .csv file into Linked Data.

Storage

  • Yoda is a research data management service that enables researchers from Utrecht University and their partners to securely deposit, share, publish and preserve large amounts of research data during all stages of a research project.
  • DataverseNL is a research data repository where researchers can deposit, share and publish their datasets openly. This platfrom is available to all employees of Utrecht University.

Speech recognition

  • BAS Web Services, funded and carried out by CLARIN-D, are a set of tools for speech sciences and technology where you can find tools and tutorials for tasks such as speech recognition, subtitling, and phonetic alignment.
  • Amberscript is an user friendly automatic transcription tool for audio and video files in multiple languages. This tool is approved for the use of sensitive data and is therefore available for all employees of Utrecht University.
  • Whisper, created by OpenAI, is an open source automatic speech recognition system trained on 680,000 hours of multilingual and multitask supervised data collected from the web.

Writing tool

  • LiNT (an abbreviation for ‘Leesbaarheidsinstrument voor Nederlandse Teksten’) is an instrument to aid writers, writing coaches, and communication consultants to assess the comprehensibility of texts. LiNT is developed and maintained by the CDH Research Software Lab.

Computational linguistics

  • #LancsBox is a tool that incorporates many methods and knowledge on computational linguistics and will allow you to compare corpora and cover POS tagging.

Data capturing

  • Qualtrics is an online survey tool that allows researchers to set up surveys, collect data, distribute, and analyze responses. Issues such as privacy and security are well taken care of. All staff and students at Utrecht University can use Qualtrics free of charge.

Digital Humanities platforms

  • Nodegoat is a web-based research environment developed for the humanities. It allows you to set up a database and link, model, and analyze this data spatially and chronologically.
  • The Gale Digital Scholar Lab is a cloud-based digital research environment designed to support digital humanities research and easily create, manage, organize, and share content sets. You can access content and OCR data from Gale Primary Sources as well as your own plain text files and analyse these with text and data mining tools.
  • The KB Lab hosts all tools that are built for and by users of the digital collections of the Koninklijke Bibliotheek.

Software provided by Utrecht University

Other tools

Are you looking for a specific tool? Visit our weekly digital humanities walk-in hours to consult with an DH expert or send an email to cdh@uu.nl. Do you need a tool that doesn’t exist yet? The CDH Research Software Lab builds tailor-made tools and provides advice on ICT-solutions that can best be used for a specific research or educational problem. Take a look at their portfolio.