Anda di halaman 1dari 1


Truth is a complete transcription of everything in an audio file, including errors, noises, ums, etc. Here are some quick rules for typing Truth. For more details and explanations, or if you have questions, please refer to Truth Guidelines for Transcription NVC OVS.

DO change or delete any word that is NOT the one dictated. DO use normal dictionary capitalization. DO capitalize names, e.g. John, December, Canada, English. DO type a.m. and p.m., NOT A.M. , AM , am etc. DO write initials and spelled letters in upper case with periods, e.g. J. P. Type spoken modifying words like capital as separate words. DO type stock symbols as a single token of upper-case letters with no spaces, e.g. stock price for GOOG. DO transcribe single words for parts of URLs like.www., .com, .de, etc. DO type out words like doctor and mister, do not abbreviate them; capitalize them when part of a name e.g. Doctor Smith. DO type out address words like street and drive, do not abbreviate them; capitalize them when part of an address. DO type out numbers, including years, how they are pronounced as separate words without hyphens. DONT write digits. For example twenty one not twenty-one or 21. DO type fractions and decimals as the actual words that were said. Do not add slashes or hyphens that were not dictated.

DO Special words and forms

DO transcribe spoken punctuation with a backslash form, e.g. .\period. Do not insert punctuation that was not dictated. DO type abbreviations, acronyms, and contractions as spoken, DONT expand them. DO guess how a word is spelled as \sp:guessed spelling, where guessed spelling is your best guess of the spelling. DO include pronunciations for emoticons, e.g. :-)\smiley-face. DO type pause fillers as \pf:pronunciation, e.g. \pf:um. If the speaker begins a word and then stops, use a tilde (~) to show where the word is broken. If you know what s/he was saying, indicate the rest after a backslash, with the missing part of the word in square brackets [ ]. E.g. Califor~\Califor[nia]. If you cant understand what was said, type \unintelligible. DO type non-speech noises as \noise:name, where name is the name of the noise, e.g. \noise:sneeze, \noise:background_noise DO write markers like \noise in English for all languages, e.g.for French write \noise:music rather than \bruit:musique. DO type mispronunciations as intended word\mp:pronunciation, where pronunciation is an attempt to write what the person actually said. E.g. you\mp:cha and doing\mp:doin. DONT use an apostrophe to show missing sounds (doin)! DO mark files that are tests, blank files, audio errors messages, no dictation, and other languages with [g] [o] [ae] [bad wave] [p] and [i]. [g] gibberish [o] caller on the line but no speech [ae] audio error message [bad wave] open mic recording with no intended dictation [p] good speech surrounded by open mic [i] speech in another language DO mark all speakers after the first in a file with <intruderN> speech </intruderN> where N is a number starting with 1 and increasing for additional speakers.

DONT remove underscores, hyphens, or backslashes, so long the word reflects what was dictated. DONT fix a speakers incorrect grammar, false starts, or repetitions. DONT make stylistic changes to spelling or formatting. DONT capitalize a word that is generally lowercase, even if it occurs at the beginning of a sentence. DONT change a possessive suffix that looks like 's\z\r. But DO type possessives from scratch normally, e.g. Johns. DONT use an apostrophe for a plural! Use apostrophe only for possessives. Spell the plural of acronyms by adding a lowercase s with no apostrophe.