|
|
|
|
|
by xg15
185 days ago
|
|
I think it could be useful to combine the two paradigms to maybe get a better understanding of what transformers can and cannot learn. E.g. would it be possible to create an algorithm that takes a grammar (and maybe a desired context window size) as input and constructs a transformer network that generates sentences exactly from that grammar? ("Construct" meaning directly setting the weights, without any iterative training process) |
|
>E.g. would it be possible to create an algorithm that takes a grammar (and maybe a desired context window size) as input and constructs a transformer network that generates sentences exactly from that grammar?
You don't need transformers for what you describe. That's 101 theory of computation class where you learn about automata, grammars, parsers, and generators.