Yes, point 2 is really the elevator pitch of Kalman filters. It enables sensor fusion, averaging a fast noisy sensor with a slow accurate sensor, or even add a model as one less confident input to the filter.
As pointed out elsewhere in this thread, demonstrating a Kalman filter with only one input doesn’t really show their real potential.
I think state of the art can work with something different than Gaussian distribution, either in the input data or the predicted one (which, with non linear models can be very unregular). Isn't that the point of the unscented kalman filter and all the ones that generate lots of hypotheses to check the target distribution. I probably don't use the correct vocabulary here... Sorry.
But doesn't that only describe one part of it. Since it also gives you a way to automatically figure out the "how confident you are" about each value over time ?
As pointed out elsewhere in this thread, demonstrating a Kalman filter with only one input doesn’t really show their real potential.