Hacker News new | ask | show | jobs
by sbierwagen 1023 days ago
phi-1 supposedly does 50.6 on HumanEval with 1.3B parameters. (Python only) https://arxiv.org/abs/2306.11644

Weights haven't been released, though.

2 comments

phi-1 is a code-specific base model, with further finetuning on top of that. This is a general language base model, not really comparable.
no code or dataset either for phi-1.