| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by benschulz 1800 days ago
	My understanding is that it has been manually tested. I.e. it has produced correct results to previously intractable problems. I'm not sure how much automated testing would add at that point.

1 comments

dmos62 1800 days ago

Unit testing usually isn't easily replaced by manual testing. If you have, for example, 3 units that can be in 2 different modes each, that's 2^3 different combinations, but only 2*3 unit modes. Testing the end result is more work than testing the units.

link

dekhn 1800 days ago

Discovery science is different from web software engineering. Most discovery scientists use manual testing, not unit testing. Very few actually do integration tests or system tests (this is something I'm trying to change).

And, given the external results of the application, it's unclear to me how much additional value would come from a rigorous testing system.

link

dmos62 1800 days ago

> Very few actually do integration tests or system tests (this is something I'm trying to change).

Care to expand on what you're trying to do?

link

dekhn 1800 days ago

Sure, I'm trying to take the idea of merging continuous integration with workflow/pipelines. It's all stuff that I learned at Google and is non-proprietary. The idea is have presubmit checks that invoke a full instance of a complex pipeline, but on canned (synthetic or pseudoanonymized or somehow not directly connected to the prod system) data, as an integration test. This catches many errors that would be hard to debug later in a prod workfflow.

In a sense, I see software testing/big web data and modern large scale data processing in science as a continuum and I want to bring the practices from the big web data and testing fields to bear on science pipelines.

link

dmos62 1800 days ago

Apart from a shift in mental attitude, is it primarily about getting a dataset for the integration test?

link

dekhn 1800 days ago

also making sure the testing is hermetic (not breaking prod) and all the components are actually reproducible.

link