|
Understanding the Output of DRC
DRC creates a number of output files, depending on the command line options specified. A directory will be created to contain these output files, the name of which will usually be the concatenation of either the word being simulated, or the name of the batch file, and the extension ".drc". For example, a simulation of the single word "kyoot", will result in a directory called "kyoot.drc". A batch job executed from a batch file named "mybatch" will result in a directory named "mybatch.drc". If a directory with the same name already exists, DRC will append "-1" or "-2" etc. to the directory name until it finds a name for a directory that does not exist. Inside the ".drc" directory, there will be at least two files: a "parameters" file, and a ".RT" file. If the "-a" command-line option was used, there will also be one ".acts" file for each word simulated. The parameters file is a text file containing the parameters used during the simulation. The format of the other files is outlined below. The .RT fileThe .RT file contains a preamble, specifying a few important pieces of information about the simulations run. Each line in the preamble is preceded by a semi-colon. Here is an example:
Following this preamble there will be one line for each word simulated. Each line has 5 or 6 fields. They are:
For field 3, LOWAC indicates that activation levels in the phoneme layer did not reach the critical levels needed for the input to be considered "named". The critical levels required are defined by the MinNamingPhonology parameter. Field 5 contains an indicator of whether the final pronunciation produced by DRC was correct. If the input was a word in DRC's vocabulary, CORRECT means that the pronunciation produced by DRC matched one of the pronunciations for that word in the vocabulary. If the input was a nonword, CORRECT means that the final pronunciation produced by DRC matches the pronunciation produced by DRC's GPC rules for the nonword. If the extra output option (-e) is used, the output will contain the following fields.
In field 2, HETEROPHONIC indicates that the input was a word in DRC's vocabulary which is a heterophonic homograph. The correct pronunciation given in field 8 will be the regular (GPC rule based) pronunciation if the input was a nonword. If the input was a word then it will be either the pronunciation for the input that matches DRC's output, if DRC's output was correct, or the pronunciation with the most matching phonemes, with higher frequency pronunciations selected in the case of a tie. The .acts FilesThese files contain information about the levels of activation that occured during the simulation. There will be a preamble, similar to the preamble for the .RT file, at the start of each .acts file. Following that will be many lines indicating activation levels at different stages of the simulation. Only activation levels greater than or equal to the minimum are reported. At the end of each cycle, the letter layer, orthographic lexicon, phonological lexicon, GPC route and phoneme layer will have activation levels reported in order. After this a set of totals for each layer will be reported. For the letter layer, GPC route, and phoneme layer, there will be one line for each letter or phoneme with a sufficiently high activation. Each line will contain 4 fields:
For example: Cycle140 L2 0.999600 A Indicates that at cycle 140, the letter 'A' at position 2 in the letter layer had an activation level of 0.999600. For the two lexicons, there will be a line for each lexical unit with sufficiently high activation. Each line will contain 4 fields:
Examples:
Indicate that at cycle 140, the word THAW in the orthographic lexicon had an activation of 0.016596, while the phonology /D{t/ in the phonological lexicon had an activation of 0.793794. Prior to the reporting of the output activation level of the GPC route, a single line will appear showing which GPC rules are being used by the GPC route. This line has 4 fields:
For example:
Inidicates that at cycle 140, the input to the GPC route was THAT++++ (+ is the 'blank' indicator) and matching rules were: Letters TH produce output phoneme /T/, the letter A produces output phoneme /{/ and the letter T produces output phoneme /t/. The totals are reported one on each line. There are 3 fields on the totals lines:
Examples: TL Cycle140 8.735193 TO Cycle140 1.046514 TP Cycle140 0.793794 TGPC Cycle140 6.868971 TPh Cycle140 5.274763 Inidcate that at cycle 140 the letter layer had a total activation of 8.735193, the Orthographic Lexicon has a total activation of 1.046514, the Phonological lexicon had a total activation of 0.793794, the total output activation of the GPC route was 6.868971 and the total activation in the phoneme layer was 5.274763. |