CRITT TPR-DB

The CRITT Translation Process Research Database (CRITT TPR-DB) consists of two sections. In private section, researchers can upload, process, analyze, download and also deleted their Translog-II compatible data. Privat data can also be added to the public section of the TPR-DB, if requested. A license for a private account can be obtained upon request. 

The second section of the TPR-DB is a publicly available database of recorded text processing sessions (mostly translation). It is available under a creative commons license  (see license). The TPR-DB consists of a data lake (raw data) of user activity data (UAD) from translation (and other text processing) studies recorded with Translog 2006, Translog-II, and with the CASMACAT workbench. This data acquisition software logs keystrokes and gaze data during text perception and text production. 

In addition to the raw logging data, a post-processed version of the database (TPR-DB) can be downloaded which consists of several tab-separated summary tables that can be more easily processed by various visualization and (statistical) analysis tools. 

More detailed information is available below and under these links: 


Post your technical, methodological, and theoretical questions and comments here

Download public studies via the TPR-DB management tool

pre-compiled summary tables can be downloaded from the CRITT TPRDB management tool:

Download raw TPR-DB data from sourceforge

Alternatively, the raw logging and aligned data for all sessions are also available on sourceforge https://sourceforge.net/projects/tprdb/ and can be checked out via svn (approx. 50 GB!)

On Linux (or cygwin):

On Windows:

Earlier versions of post-processed and zipped TPR-DB tables can be downloaded from here: https://sourceforge.net/projects/tprdb/files/

newer versions of the tables should be downloaded via the TPR-DB management tool: 

Generating a TPR-DB 

Visualizing TPR-DB data

Documentation

For documentation on how to extract and convert the raw logging data into the TPR-DB format, read this document. The document describes how to run the scripts in the "bin" in the TPR study folder. The database compilation process also requires external tools & resources:

License

The CRITT Translation Process Research Database (TPR-DB) by the Center for Research and Innovation in Translation and Translation Technology is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

We would like to thank all contributors and participants for their work.