meaning | format | from | to | |
1 PREFIX | values for FMT, LEX, LEX1, IDs1, POSspW1, IDsinW1, POSs1, nWORDs, nIDs, nPOSs | Text | 1 | LEX1 - 1 |
2 LEXICON | alfabetical list of indexed words | Text | LEX1 | IDs1 - 1 |
3 IDs | list of documents indexed | Text | IDs1 | POSspW1 - 1 |
4 POSpW | for each word: count + positions | int4 | POSspW1 | IDsinW1 - 1 |
5 IDsinW | for each word: 1 bit per ID
int4_count= CEILING(nIDs/32) |
int4 | IDsinW1 | POSs1 - 1 |
6 POSs | sequential for each ID: count, positions in document | int4 | POSs1 | LEN_TRIM(InvIdx) |
PREFIX | FMT=hex3 LEX=UPDATE LEX1=129 IDS1=149 POSSPW1=158 IDSINW1=174 POSS1=190 NWORDS=5 NIDS=3 NPOSS=22 | LEX1-1 characters
default FMT=int4 |
LEXICON | a banana is it what | 5 Words, sorted alfanum, space separated |
IDs | T1.T2.T2. | 3 IDs in order of call, . denote linefeed characters |
written with FMT | a | banana | is | it | what | |
POSspW | 2 | 2 | 7 | 7 | 4 | occurences (includes count) |
IDsinW | 4 | 4 | 1+2+4=7 | 1+2+4=7 | 1+2=3 | Bitsum of ID numbers that have word. |
POSs | 2 7 | 2 9 | 3 4 12 2 6 2 4 | 3 1 D 2 9 2 1 | 2 8 2 1 | count and positions in ID, HEX |