the use of novel puzzles is frankly awesome because there's a much lower chance of contamination from previous puzzles so we get a chance to see how much generalization they've achieved.
GPT-4 says: A more accurate and balanced title might be: "Comparing Bard and ChatGPT in Puzzle Solving: An Examination within the Context of a Word Game"