Columbia Games Corpus » ToBI Labeling Guidelines

PLEASE NOTE

Recent changes are highlighted in yellow.

Things highlighted in red are waiting for confirmation, and subject to change. Please ignore them.

  » 0. Files and Links
  » 1. Getting started
  » 2. Getting the annotation files
  » 3. Correcting common mistakes in the .words files
  » 4. Guidelines and examples
  » 5. Labeling ambiguities
  » 6. How to save and submit your work

  « Back to the main page


0. Files and Links

ToBI Documentation



1. Getting started

  1. Go to the File Locking page.
  2. Look for the wav file you want to label and check the following: its "Orthographic Transcription" and "Word Alignment" tasks must have "finished" status, and its "ToBI Labeling" task must have "not started" status and no person assigned to it.
    If these conditions are not met, do not work on this file.
  3. Click "EDIT" in the "ToBI Labeling" task of the wav file you want to label.
  4. On the next screen, change its status to "in progress", and enter your name.


2. Getting the annotation files



3. Correcting common mistakes in the .words files

In the ortographic tier you might find some common mistakes. If you do, please correct them and submit the .words file along with the rest.



4. Guidelines and examples



5. Labeling ambiguities



6. How to save and submit your work

Once you have finished labeling a complete .wav file, follow these steps:

  1. WaveSurfer: Send the new ".breaks", ".tones", and ".misc" files to agus [at] cs.columbia.edu, maintaining the original file names. If you modified the ".words" file, submit it too.
    Praat: Send the new ".TextGrid" file to agus [at] cs.columbia.edu.
  2. Go to the File Locking page.
  3. Click "EDIT" in the "ToBI Labeling" task of the wav file you have labeled.
  4. On the next screen, change the task's status to "finished", and enter the current date.


« Back to the previous page