| > That's why you tell claude code to write tests, and use them I've seen Python unit tests emitted by LLM that, for a given class under test, start with. def test_foo_can_be_imported(self):
try:
from a.b.c import Foo
except ImportError:
self.fail()
def test_foo_can_be_instantiated(self):
from a.b.c import Foo
instance = Foo()
self.assertNotNull(instance)
self.assertTrue(isinstance(instance, Foo)
def test_other_stuff_that_relies_on_importing_and_instantiating_foo(self)
...
And I've watched Cursor do multiple rounds of"1: The tests failed! I better change the code. 2: The tests failed! I better change the tests. GOTO 1" until it gets passing tests, sometimes by straight out deleting tests, or hardcoding values to make them pass. So I don't have the same faith in LLM-authored tests as you do. > I feel like hallucinations have become a cop out, an excuse, for people who don't want to learn how to use these new tools anyway. I feel like you've taken that attitude so you can dismiss concerns you don't agree with, without having to engage with them. It's disappointing. |