Prefix Table

I. Inputs

  • LEXICON.ascii
  • normInflVarEui.data

II. Outputs

  • File: prefixTable.data
  • Format:
    normalized inflVarEUICitation Formspelling var 1spelling var 2...

III. Algorithm

  • Get Prefix table:
    shell>GetPrefixTable normInflVarEui.data LEXICON.ascii prefixTable.data.org
    • Read in LEXICON.ascii and load it into Hashtable<EUI, LexRecordObj>
    • Read in normInflVarEui.data (ASCII)
      • This table is used to replace prefix table
      • It is linked from ${STMT_DIR}/PreProcess/data/Lex/{$YEAR}Ascii
      • It is done by ${STMT_DIR}/PreProcess/bin/1.PreStmt
    • Go through each line and print out (to prefixTable.data.org)

  • Sort and unify
    sort -u prefixTable.data.org > prefixTable.data

  • Check pure ASCII