Y
Hacker News
new
|
ask
|
show
|
jobs
by
sharemywin
984 days ago
distilbert was trained from Bert. there might be an angle using another model to train the model especially if your trying to get something to run locally.