Hacker News new | ask | show | jobs
by guishen90 2204 days ago
Sounds interesting! I'm a bit confused about how can you profile the system's performance automatically? If I understand correct, the profiling process needs a client to mock real requests in production environment, and the client of serving Deep Learning models are various from one to another.
1 comments

Yes. Actually, for clients of Deep Learning model, the only difference is the I/O type of the testing data, since there are models of image classification, language model... all of the inputs should be converted into tensor type, we will generate the testing data according to the format of the model's input I/O, and we have some clients in the hub for you if you want to profile the models automatically when you register them in the database. And we also have a parent client class for you to implement by yourself.