Hacker News new | ask | show | jobs
by doppioandante 1753 days ago
The term nonlinear system here must be understood from the point of view of control engineering. In control theory, a system is usually described in a white-box manner by a differential equation and an observation equation: dx/dt = f(x, u, t)

y = h(x, u, t)

x is a vector of the so called state-variables, while y is the output vectors which usually represents what we can measure about the system, u is a function which represents the inputs to the system, and t is the time.

A very useful and simple modeling tool is that of linear time-invariant systems (LTI), where the above equations are specifically

dx/dt = Ax+Bu

y = Cx+Du

where (A,B,C,D) are matrices. Every bit of theory is known about LTI systems: how to control them, global properties, etc. Nonlinear systems are, loosely speaking, dynamical systems which are not linear in the above sense. Depending on the specific nonlinear system form, much less can be said than the linear system case.

1 comments

Thanks. So we are talking about ODE in state-space form, right?

So in my example, if x was a (largish) time constant integer (f(x,u,t) = 0) and the observation function was

h(x,u,t) = sha256sum(x,u)

What could a control theorist tell about that (nonlinear) system?

I think you might also be confused about the difference between system identification and observability.

The problem of constructing a model of a system's dynamics is system identification [0].

A state observer [1] tries to determine the internal state of the system given measurements of a system's inputs outputs and a model of its dynamics. The system is observable [2] if the state observer can fully reconstruct the internal state. One motivation for checking whether a system is observable is to determine whether a given combination of sensors (corresponding to a particular set of outputs) is sufficient to for the internal state to be determined, or whether more should be added.

The state observer of a process is another dynamical system whose inputs are the process input and process output, and whose output is an estimate of the process' internal state; if the system is observable, then this estimate converges to the true state over time. This whole formalism doesn't really apply to something like your example, which has no dynamics.

If you have the code for the hash function in your example, then determining the values that its internal variables take for an arbitrary input is trivial; if you don't then it is very difficult. In either case, your problem is different to the problem of observabilty in control theory.

[0]: https://en.wikipedia.org/wiki/System_identification

[1]: https://en.wikipedia.org/wiki/State_observer

[2]: https://en.wikipedia.org/wiki/Observability

Observability is interested in asking if you, as an external observer, are given a set of inputs and measurements are you able to recover the initial state of the system?

While identiability typically is just a person running an experiment and saying "hey look these state values are unable to be recovered / estimated when I ran the algorithm". This is why I never understood why this analysis was very helpful as it isn't very useful outside of the specific experiment / motion / inputs used.

For example from [1] has these reasonably straightforward definitions.

  Observability is a structural property of a control system dened as the possibility to deduce the state of the system from observing its input-output behaviour

  Identiability is the possibility to identify the parameters of a control system from its input-output behaviour.

  Parameter identiability is a special case of the observability problem.

[1] http://www.math.chalmers.se/Math/Research/Preprints/Doctoral...
Yes: both observability and parameter identifiability assume knowledge of the model structure; system identification includes methods for choosing a model structure for a black-box system.

> This is why I never understood why this analysis was very helpful

If a system is not identifiable/observable, there may be multiple values for the parameters/state (respectively) that are consistent with observations; if you try to infer these without checking for identifiability/observability, you may obtain values that are consistent with observations but very different to the true values, which can result in serious errors when you try to do something using them.

> it isn't very useful outside of the specific experiment / motion / inputs used

There are cases where some parameters cannot be identified regardless of what inputs are provided to the system. For example, when you write down the differential equations governing the system in a particular way, you might see that two parameters (k1 and k2) always appear together as a product (k1 * k2), never independently - so their individual values are not identifiable, though (k1 * k2) is; this could be fixed by re-parametrising, by replacing k1 * k2 by k3.

Also, some parameters have very little effect on the behaviour of a system, so are weakly constrained by observations ("sloppiness"), whilst others are more tightly constrained - it can useful to know which are which.

sha256sum is not a continuous function. These functions should be continuous.

Usually there are light restrictions on these functions to avoid what you are trying to do. Lyapunov bounded, C2 continuous, ...

Control engineers could use a piecewise approximation of an observer or controller, or use non-linear methods (like reinforcement learning, model predictive control, LQR, ...)

Thanks. That makes a lot of sense, indeed.
...nothing? Why makes you think they should be able to say anything?

I'm so perplexed by your question... you seem already quite familiar with ordinary differential equations and state-space models... which are concerned with the observability of continuous-time models, right? And yet you pose a counterexample using a cryptographic hash function, which is a (very!) discrete function, and one that is specifically designed to be unobservable? Isn't that even more jarring than asking physicists to predict human life expectancy from the Standard Model? What makes you think the answer might be anything other than "no prediction"...?

Sometimes someone asks a question in earnest. They really don’t know, and want to know more. They aren’t trying to score points or win a debate or strike a rhetorical victory.

I had just the same question as the commenter. I just don’t know much about this topic, and think that this question and its answers really helped my understanding.

I did not expect a positive answer here. I wanted to know what the implicit constraints are so I chose an example that was obviously impossible.

I worked across domains for a while (CS, simulation, mathematics) and one thing that was really problematic was that people from one domain used a lot of implicit constraints that the other partners weren't aware of. This then led to incorrect software and weird simulation results.