Hacker News new | ask | show | jobs
by EdwardDiego 265 days ago
> That's why you tell claude code to write tests, and use them

I've seen Python unit tests emitted by LLM that, for a given class under test, start with.

    def test_foo_can_be_imported(self):
        try:
            from a.b.c import Foo
        except ImportError:
            self.fail()


    def test_foo_can_be_instantiated(self):
        from a.b.c import Foo
        instance = Foo()
        self.assertNotNull(instance)
        self.assertTrue(isinstance(instance, Foo)

   def test_other_stuff_that_relies_on_importing_and_instantiating_foo(self)
        ...
And I've watched Cursor do multiple rounds of

"1: The tests failed! I better change the code. 2: The tests failed! I better change the tests. GOTO 1"

until it gets passing tests, sometimes by straight out deleting tests, or hardcoding values to make them pass.

So I don't have the same faith in LLM-authored tests as you do.

> I feel like hallucinations have become a cop out, an excuse, for people who don't want to learn how to use these new tools anyway.

I feel like you've taken that attitude so you can dismiss concerns you don't agree with, without having to engage with them. It's disappointing.