One of the samples had vocals. Could the approach be used to create solely vocals?
Could it be used for speech? If so, could the speech be directed or would it be random?