|
|
|
|
|
by mnkv
259 days ago
|
|
> the generation of 281,128 augmented examples, from which 1,000 were
held out as a benchmark test set. This model is trained on a custom dataset of 280k examples then tested on 1k very similar examples from the same dataset. Of course it is specialized to outperform general models on this specific task in this specific domain with this specific json format for output. This is a reasonable hobby project and interesting approach to synthetic data generation but not impressive research. At minimum you should test your model on other benchmarks that have similar tasks e.g. docbench |
|
It is probably obvious to most who follow the space closely, but you'd be surprised how many engineers don't recognize this.