|
|
|
|
|
by julien040
1098 days ago
|
|
Thank you for reading it. The hit rate is low. I've only tried to get embeddings for stories with a score greater than 100.
SQL Query "SELECT count(*) FROM story WHERE score > 100;" gives me 155,228 stories and the corpus size is 108,477 stories. 108,477/ 155,228 = 0,6988236658 The main problems were 404 links and posts that weren't articles (such as tweets). |
|