You can implement with something like this and simply train it against a watercolor image: https://github.com/yusuketomoto/chainer-fast-neuralstyle
Haven’t tried it with stable diffusion but you’d probably have more control and better results with a CNN like the one I linked.