|
|
|
|
|
by gcbw3
2172 days ago
|
|
Considering the close relationship with Google and Wikimedia https://en.wikipedia.org/wiki/Google_and_Wikipedia and the considerable money Google gives them, how can one not see this project as "crowdsourcing better training data-sets for Google?" Can the data be licensed as GPL-3 or similar? |
|
Almost every research group and company doing NLP work uses Wikipedia I'd say it is a fantastic donation by Google which improves science generally.
> Can the data be licensed as GPL-3 or similar?
It's under CC BY-SA and (with a few exceptions) the GNU Free Documentation License.