With a similar probability focus, Pattern Recognition and Machine Learning by Christopher Bishop [1] is pretty good. If you are looking into deep learning specifically, I think François Chollet's Deep Learning with Python is one of the most accessible books.
I think "Understanding Deep Learning" is very nice - https://udlbook.github.io/udlbook/ (an covers almost all topics, it has maybe just a couple of omissions, such as Multimodal Learning, NERFs and Time Series Prediction)
[1] https://www.microsoft.com/en-us/research/uploads/prod/2006/0...