Y
Hacker News
new
|
ask
|
show
|
jobs
by
pabs3
426 days ago
It is definitely doable to get openly licensed data, you just have to do it via voluntary participation of crowdsourced data acquisition programs. For example the RNNoise model was retrained from such crowdsourced data.
1 comments
tedivm
426 days ago
IBM did it with their Granite models.
link
pabs3
426 days ago
The data used for training Granite doesn't sound like it would be under FOSS licenses.
https://en.wikipedia.org/wiki/IBM_Granite
link