Add/Evaluate SD-Rules

This session describes how to add/evaluate a Sd-Rule. Once this step is done, an optimized Sd-Rules set needs to be derived as the next step.

  • Sd-Rules sources:
    • Original SD-Rules (done)
    • Derived from high frequency nomD-Pairs
      ln -sf ${NOM_D_DIR}/data/{$YEAR}/data/nomD.yes.data.type.S to nomD.yes.S.data
      shell> cd ${SUFFIX_D_DIR}/bin
      shell> GetSdRule ${YEAR}
      2
      nomD
    • Derived from high frequency original SD-Facts
      ln -sf ../../2013/data/orgD.yes.S.data to ${PRG_D_DIR}/data/${YEAR}/data/orgD.yes.S.data
      shell> cd ${SUFFIX_D_DIR}/bin
      shell> GetSdRule ${YEAR}
      2
      orgFacts
    • Suggested by users, experts, and linguists.

  • Verify:
    • Test each new rule onthe previous SD-Rule set:
      • Set up following data to run:
        • ${SUFFIX_D_DIR}/data/${YEAR}/dataOrg/sdRules.data.2017
        • ${SUFFIX_D_DIR}/data/${YEAR}/dataR/SdRulesCheck/${YEAR}
      • Check duplication, parents, child rules:
        shell> cd ${SUFFIX_D_DIR}/bin
        shell> GetSdRule {YEAR}
        5
        SD_YEAR (${YEAR})
        TEST_RULE (es$|noun|ic$|noun)
        => Make sure it is:
        		-- It is a root parent-rule (OK)!
        		-- Good: no parents, duplicated, children rules found!
        		
    • Manually add non-duplicated new rules to sdRules.data.${YEAR}
      • Verify the updated sdRules.data.${YEAR}
        shell> cd ${SUFFIX_D_DIR}/bin
        shell> GetSdRule {YEAR}
        4
        year
        ${YEAR}
        => Follow the instruction on the screen result to make sure it is OK
    • Get the SD-pairs list:
      • Get sd-pairs for each new sdRules:
        shell> cd ${SUFFIX_D_DIR}/bin
        shell> GetSuffixD ${YEAR}
        10
        suffix-1|cat1|suffix-2|cat2|status|source|relation

        Save ./data/suffixD.tbd.data.option to ./data/newRules

    • Send above SD-pair lists to linguists to tag:

  • Add/Evaluate Sd-Rules log