|
|
|
|
|
by the_duke
95 days ago
|
|
What context length and related performance are you getting out if this setup? At least 100k context without huge degradation is important for coding tasks. Most "I'm running this locally" reports only cover testing with very small context. |
|
The models can be frustrating to use if you expect long contexts to behave like they do on SOTA models. In my trials I could give them strict instructions to NOT do something and they would follow it for a short time before ignoring my prompt and doing the things I told it not to do.