Completely agree on the reliability front...but I don't think mentioning it on some guy's 3rd party GitHub project is going to help all that much with that.
Yes, fair enough. I was just venting some frustration on how brittle and unstable Claude is proving to be. For all the warts that ChatGPT have, at least in comparison is reliable and rock-solid. Outputting higher-quality results in synthetic benchmarks might be nice but it's meaningless if the service is unusable.