Y
Hacker News
new
|
ask
|
show
|
jobs
by
EGreg
538 days ago
I meant deep neural networks with transformer architecture, and self-attention so they can be trained using GPUs. Doesn't have to be specifically "large language" models necessarily, if that's your hangup.