Hacker News new | ask | show | jobs
by reckless 1333 days ago
I feel like the comments on backwards compatibility are due to the absolute shitshow of TF2 compatibility for TF1 code and models.

Also the threat of Pytorch can be seen when reading between the lines, especially since it's now run by a foundation and the darling of the diffusion model developments.

1 comments

PyTorch has been a darling of almost every noteworthy open source model for the past 3-4 years (BLOOM, GPT-J, StyleGAN3, detectron, etc). Personally, I've only seen people use TensorFlow/XLA if they got free TPU credits from Google (gpt-neo), or if it was released by Google (t5).
GPT-J is actually JAX
My mistake, I was thinking of gpt-neox. Thanks for the correction.