Tryag File Manager
Home
-
Turbo Force
Current Path :
/
home
/
cluster1
/
data
/
bu01
/
1121861
/
html
/
jlex
/
test
/
Upload File :
New :
File
Dir
//home/cluster1/data/bu01/1121861/html/jlex/test/data_cleanser.html
<html> <body> <h3>Data Cleansing Module</h3> <form action="php4/data_cleanser.php4" method="post"> Project Name: <input type="text" name="project" size="25"/>e.g. nahuatl<br/> Head Tag: <input type="text" name="head_tag" size="25"/>e.g. refgroup<br/> Id Field: <input type="text" name="id_field" size="25"/> e.g. ref<br/> Cross Reference Field <input type="text" name="cross_ref" size="25"/> e.g. mref<br/> Cross Reference Dialect Field <input type="text" name="cross_ref_d" size="25"/> e.g. mref_d<br/> Fields to search for each dialect: Create rules to associate the value of a dialect field of a cross-reference with a headword field. This rules must be in the form X = Y, e.g. Am = lxam.<br/> <textarea name="dialect_fields" rows="5" cols="30"></textarea><br/> <input type="submit"/> </form> <hr/> <h3>Embedded XML Cleanser</h3> Note: This module searches the xml file for embedded tags. Then, it searches the MySQL database for entries containing the content within the embedded tags.<br/> This means that you must put the xml file in the directory: jlex/nahuatl/php4. <form action="php4/embedded_xml_cleanser.php4" method="post"> <b>Project Name:</b> <input type="text" name="project" size="25" value="nahuatl"/>e.g. nahuatl<br/> <b>XML file: </b><input type="text" name="db_xml" size="50"/>e.g. ActiveNahuatl_2005.xml<br/> <b>Head Tag:</b> <input type="text" name="head_tag" size="25" value="refgroup"/>e.g. refgroup<br/> <b>Id Field:</b> <input type="text" name="id_field" size="25" value="ref"/> e.g. ref<br/> <br/> Please provide a start ID and an end ID. This reduces the set of entries which are cleansed. Remember that you have over 70,000 tags.<br/> Checking the content of every tag will be very time consuming (for the computer). Its better to check only a subset at a time.<br/> However, by leaving the Start ID and End ID empty, all records will be checked. <br/> <b>Start ID:</b> <input type="text" name="id_start" size="25"/> e.g. 1<br/> <b>End ID:</b> <input type="text" name="id_end" size="25" /> e.g. 100<br/> <br/> <b>Fields to search for each embedded xml tag:</b> Create rules to associate an embedded xml tag with the existence of an entry containing a particular column(s).<br/> For example, for the embedded tag <nlao>, there should be an entry containing a value for the lxam field and the lxoa field.<br/> To express this rule, write: <b>nlao = lxam,lxoa</b><br/><br/> You may have embedded xml tags signifying an entry should not exist containing a value for a specified field. Use an <b>!</b> to signify a column should be empty.<br/> For example, for the embedded tag <nla>, there should be an entry containing a value for the lxam field but NOT for the lxoa field.<br/> To express this rule, write: <b>nla = lxam,!lxoa</b><br/> <b>NOTE: Seperate fields with by commas (with no spaces in between), e.g. lxam,lxoa<br/> <textarea name="db_cols" rows="5" cols="30"> i l n na nao nam namoa nba nbo nbao nl nla nlo nlao nlam nlamoa no nr nt spn r</textarea><br> <input type="submit"/> </form> </body> </html>