Nice job OP. I noticed that you used word2vec for the embedding space, do you think more modern ones like seq2seq or BERT might give better performance?
Yeah, I'm currently looking into them - I'm new to the field and exploring existing solutions to see what works best and what's also computationally feasible since the end user would mostly have limited access to powerful hardware.