|
|
|
|
|
by claytonjy
3032 days ago
|
|
Thanks for this; I really appreciate the detail here. There seems to be a lack of these kinds of explanations around. One piece did give me a bit of surprise: You might also need to implement the inference code as well to get the speed you need
I've never had the super-low-latency requirements you have, but as you point out this seems amazingly error-prone. I'd love to hear anything else you can share about the cost-benefit analysis you do before deciding to go this route, and if there's any tools or languages you've had a better time with than others for this. |
|
Also like GP, I used to work in advertising. The low latency bits are because RTB servers want you to respond within a certain amount of time. When I was in advertising, our solution was to precalculate a whole bunch of things, and throw them into redis. It was then a simple lookup of the hash of a vector (for bidding related stuff)