|
|
|
|
|
by marginalia_nu
1165 days ago
|
|
1) At this moment about 70 million documents. I've had it at about 110 million, dunno what the actual limit is. 2) Yes. Everything is in-house. Do you build a word index by document and find documents that match all words in the query?) Yeah. It's actually got three indices; * One is a forward index with `document id -> document metadata` * One is a priority term index with `term -> document id`. * One is a full index with `term -> (document, term metadata)` They're all based on static b-trees. |
|