|
|
|
|
|
by yosai
668 days ago
|
|
We have a team of domain expert who do the vetting of the instruction dataset.We do typical RLHF(Reinforcement learning from human feedback) and connect back to our SFT(supervised finetuning) loop.That's why we name ourself as hardware and human in loop.Humans play an important role in ensuring quality and accuracy of our dataset. |
|