|
|
|
|
|
by gioark
3292 days ago
|
|
hi, I am the author of the article. @devhead: To import the data into ES we used a custom application to extract the text from the OCR'd documents.
This is required to support our bookreader software. A complete ingestion takes a few days; we rate-limit indexing in order not to overload the cluster, and maintain reasonable search performance. |
|