|
|
|
|
|
by code51
60 days ago
|
|
Why on earth is nobody here talking about the sudden jump to use von Mangoldt function? The reasoning trace never types Λ, never types "von Mangoldt", and never invokes ∑_{q|n} Λ(q) = log n. There is a clear discontinuity at play. I remember an article on this, maybe a comment by Terence Tao himself, seen here, but cannot find it. |
|
There is a relationship between the tokens in the output in the model's vector space, that is the most important, and something hidden we will never see.