1) The first time you open TMAT, set/update TMAT Configuration Settings a) On the File menu, select "Import" and "Configuration File". b) Choose the file samples/sample_config.xml == Opening a new Text File == 1) On the File menu, select "Import" and "Text File". 2) Choose your text file. There is a sample Nahuatl text file in the samples/ directory. Make sure your file is in utf-8; TMAT is not currently very graceful in handling other encodings! == Editing / Verifying Tokenization 1) In an unparsed text, each parsable token will be highlighted by a grey(*) box. Unparsable tokens will be displayed on a white background. 2) To view the details of a token, click on the token text in the left text panel. 3) The text of the token will be displayed in the top text field on the right. The "Parsable" check box will indicate whether the token should be submitted to the parser. 4) To change the text of the token (e.g. to correct a typo), you can edit the text in the top right text field directly. 5) To split a token into two tokens, click in the token in the top right text field where you want to split the token and click the "Split" button. 6) To merge a token and its following token into a single token, click the "Merge with next" button. 7) To merge a token and its preceding token into a single token, click the "Merge with previous" button. 8) To delete a token from the text entirely, click the "Delete" button. 9) If the text has already been parsed and you would like to re-submit the current token to the parser, click the "Parse Now" button when you are done making changes to the token text. 10) If you have to manually correct the tokenization, you may want to check the "Tokenization Error" box and enter a note describing the problem. == Changing your TMAT Settings == 1) On the Edit menu, select "Preferences". 2) The "Plugins" tab lets you choose which Parser and Tokenizer plugin TMAT should use. See the "Plugins" section for more information. 3) The "Display" tab lets you set preferences relating to how documents look in TMAT. a) The "Show Morphemes" and "Show Glosses" checkboxes indicate which layers of the interlinear text display should be visible in parsed documents, while the "Show Parses" checkbox indicates whether colored highlighting should be used to show the parsing status of each token. b) The "Text Direction" section lets you tell TMAT if your source text should be displayed left-to-right or right-to-left. c) The "Spacing" section lets you adjust the space between lines, paragraphs, and tokens in the document display. d) The "Highlight Colors" lets you customize the highlight color used for tokens of different parsing statuses. To change one of the colors, click on the color block and select a new color from the "Select Color" dialog. 4) The "Problem Notes" tab lets you customize the list of problem note types that are available (as a list of checkboxes) for each token. a) By default, "Missing Dictionary Entry", "Parser Error", "Spelling Error/Typo", and "Tokenization Error" are available. b) You can edit the text of any existing note type by selecting the note text and clicking the "Edit" button. c) You can delete a note type by selecting its text and clicking the "-" button. d) You can add a new note type by clicking the "+" button and entering your text in the dialog box. 5) The "Ignore in Parsing" tab lets you enter regular expressions. Tokens matching any of these will be marked as non-parsing by TMAT and will not be submitted to the parser. a) To enter specific words to be ignored, you can just list the word. By default, "enero" is listed as a token that should be ignored. b) To enter a pattern to be ignored, you can enter a regular expression. "\d+" will ignore all numerals while "\.\.\.w+" will ignore all tokens that start with "...". c) Any regular expressions recognized by python are valid. You can view a summary of regular expressions in python at http://docs.python.org/lib/module-re.html 6) To change the display font in TMAT, on the Edit menu select "Font". == Working with Parses == 1) To parse all parseable (not-white(*)) tokens, on the "Tools" menu select "Parse" and "Parse Document". 2) The progress dialog will indicate the number of tokens that have been parsed. Clicking "Stop" on the progress dialog will cancel the parsing process. 3) Once parsing is complete, each token will be highlighted by a color indicating its parsing status. Colors can be customized in the TMAT Settings. 4) When a token with parses is selected, each of the parses returned by the parser is listed in the "Glosses and Parses" section. Selecting a radio button in this section confirms the corresponding parse as the correct parse for the token. 5) To manually add a parse to a token, click the "Add/Edit Manually" button and enter custom Gloss and Parse text for the token. 6) To remove a previously-selected parse decision, select the "None" radio button. == Adding and editing Problem Notes == 1) The last panel on the right side of the tool lists categories of problem notes. These categories can be customized through the TMAT Settings. 2) To add a Problem Note, click the corresponding check box. You will then be able to add text in the corresponding text field if you wish. 3) To remove a Problem Note, uncheck the corresponding check box. (*) The color used in the display can be changed through the Preferenced dialog, but the default colors are used in descriptions for this file.