If it works with a small model I can run locally, I might think of this approach, otherwise I'll skip