| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by 0bit 1390 days ago
	I would recommend using Apache Tika to extract the text from the PDFs and using Solr (or Elasticsearch) to index and search them.