Package gov.nih.nlm.nls.nlp.textfeatures

Interface Summary
TokenInterface TokenInterface defines what a token should be.
 

Class Summary
AVL AVL Wed Oct 11 10:33:18 EDT 2000, mork Initial Version Notes:
Candidate Candidate Wed Oct 11 10:33:18 EDT 2000, mork Initial Version
CandidateListbyPosComparator  
CandidateListbyScoreComparator  
CandidateToken A CandidateToken is a Token that comes from a UMLS_String, which has been tokenized by the wordTokenizer().
Category Category or parts of speech, such as noun, verb, adjective, adverb.
Chunk Chunk A Chunk is a component of a Sentence.
Collection Collection is a collection of documents.
Cui Cui is a class to convert Cui strings to a hash and visa versa.
Document Document is the overarching object that holds all information about a document and it's analysisis.
DocumentFactory NLS Document Factory Created: Wed Jul 23 13:17:31 2003
FinalCandidateListbyScoreComparator  
FinalListbyCheckSumComparator  
FinalMapping FinalMapping is a container for a set of Candidates that cover a phrase.
FinalMappingsByScoreComparator  
Inflection Inflection is an enumeratio of all known inflections.
LexicalElement A lexicalElement is a meaningful sequence of tokens.
LexicalEntry LexicalEntry Fri Jun 02 17:46:45 EDT 2000, divita Initial Version
MatchMap MatchMap Mon Oct 16 09:51:03 EDT 2000, mork Initial Version
MatchMapComparator  
MmObject MmObject contains those attibutes and features that are common to the abstract objects associated with analyzing text.
Msu Msu Minimal Syntatic Unit Mon Jun 19 14:37:55 PDT 2000, divita Initial Version
Phrase A phrase is made up of a sequence of lexicalElements.
Section The Section class holds sections of medLine citations, sections of papers such as Title, abstract, intro, methods summary.
SemanticTypesChecker SemanticTypesChecker Thu May 05 09:49:12 GMT-05:00 2005, divita Initial Version
Sentence Sentence A sentence is made up of a set of phrases.
Shape Shape Thu Jun 27 13:06:00 EDT 2002, divita Initial Version
SourceInfoComparator SourceInfoComparator compares UMLSSourceInfo objects based on their presidence Thu May 12 17:43:07 GMT-05:00 2005, divita Initial Version
Span Span records the offsets of the mmObjects in terms of character positions.
Sui Sui is a class to convert Sui strings to a hash and visa versa.
Token Token is the atomic meaningful piece of text.
TokenChars  
TokenFactory TokenFactory returns a new Token of the appropriate type There could be multiple kinds of tokens - thus the interface This is currently used only for the wordTokenizer, so that candidate concepts can get tokenized, and have the candidateConcept tokens contain back pointers to the candidate Concept they refer to.
Tui Tui Wed Nov 10 11:13:08 GMT-05:00 2004, divita Initial Version
UiUtils UiUtils maps chars to hash numbers and visa versa Wed Nov 10 11:13:08 GMT-05:00 2004, divita Initial Version
UMLS_ConceptComparator This is a comparator compares two UMLS_ConceptPointers to sort them based on their score.
UMLS_ConceptPointer UMLS_ConceptPointer Tue Sep 10 11:47:12 EDT 2002, divita Initial Version
UMLS_SemanticTypePointer UMLS_SemanticTypePointer is a container that holds info for a UMLS Semantic Type, including a semantic type name, an abbreviation of it, and its TUI.
UMLS_StringPointer UMLS_StringPointer Tue Sep 10 11:47:12 EDT 2002, divita Initial Version
UMLSSourceInfo UMLSSourceInfo is a container for UMLS Source information.
Variant Variant Fri Jun 02 17:46:46 EDT 2000, divita Initial Version
VariantConceptTokenMapping VariantConceptTokenMapping Mon Mar 21 16:55:19 GMT-05:00 2005, divita Initial Version
VocabularySourceChecker VocabularySourceChecker is a container of those sources to include or exclude Thu May 05 09:49:11 GMT-05:00 2005, divita Initial Version
WordCount WordCount is a node in the wordFrequency table.
 



The use and distribution of this material is subject to the terms and conditions included in the file SPECIALIST_NLP_TOOLS_TERMS_AND_CONDITIONS.TXT, located in the root directory of the distribution.