LLM's are really sensitive to bad or even slightly ambiguous grammar. I wonder if the numbers would differ significantly with "Reply only with the tags, in the following format".
The semantics of the topics/tags could be improved for sure with a more detailed prompt
The semantics of the topics/tags could be improved for sure with a more detailed prompt