|
|
|
|
|
by ahakanbaba
1240 days ago
|
|
Fair enough, agreed.
It is tricky to “mock” as you said. Our CICD platform and their owners get unhappy if we spawn an ad hoc spark session for testing purposes. There is also a general expectation that unit tests are self contained and portable. So you could execute them in mac, linux, and arm ISA without much effort. Another point was that we need to make this mocking or test setup easy because data scientist and ML Modellers are the most important persona who needs to write these tests ideally. So mocking the data source with an abstraction layer and passing pandas dataframes, worked reasonably well for our use case. |
|