|
|
|
|
|
by energy123
178 days ago
|
|
An example of why a basic understanding is helpful: A common sentiment on HN is that LLMs generate too many comments in code. But comment spam is going to help code quality, due to the way causal transformers and positional encoding works. The model has learned to dump locally-specific reasoning tokens where they're needed, in a tightly scoped cluster that can be attended to easily, and forgetting about just as easily later on. It's like a disposable scratchpad to reduce the errors in the code it's about to write. The solution to comment spam is textual/AST post-processing of generated code, rather than prompting the LLM to handicap itself by not generating as much comments. |
|
Like I said, it's a trap to reason from architecture alone to behavior.