|
|
|
|
|
by rwitten
949 days ago
|
|
This is a great series of questions and it isn't our goal to prove you otherwise! We work with customers interested in training models who run their own ablations, including batch size and learning rates. Based on that, we demonstrate workloads that we think will be interesting to potential customers! Absolutely agreed that this workload has a larger batch size than the public literature suggests. |
|