Hamilton Jacobi Bellman Equation
- The state equation
$$
\dot{x} = f(t, x(t), u(t)) \tag{1}$$ - The performance measure
$$
J = h(x(t_f), t_f) + \int_{t_0}^{t_f} F(\tau, x(\tau), u(\tau)) d\tau$$
The induction of the HJB equation
The performance measure is a function on $x(t), t$, and $u(t)$. However, we want to find a control that minimize the performance measure. Therefore, the optimal performance measure based on the optimal control $u(t)$ is as follows :
$$
J^*(x(t),t) = \underset{\underset{1 \leq \tau \leq t_f}{u(\tau)}}{\min} \left\{ J^*(x(t),t) + \frac{\partial J}{\partial t} \Delta + \frac{\partial J}{\partial x} \left(x(t+\Delta) – x(t)\right) + \int_{t}^{t+\Delta t} F(\tau, x(\tau), u(\tau)) d\tau \right\}$$
by the expansion of the taylor series
Now for small $\Delta t$, since the performance measure $J$ is only dependent on $x(t)$ and $t$, we can obtain
$$
J^*(x(t),t) = \underset{\underset{1 \leq \tau \leq t_f}{u(\tau)}}{\min} \left\{ J^*(x(t),t) + \frac{\partial J^*}{\partial t} \Delta t + \frac{\partial J^*}{\partial x} f(t,x,u) \Delta t + F \Delta t \right\}$$
(By $x(t+\Delta) – x(t) =\Delta x = f(t, x(t), u(t)) \Delta x$))
$$
0 = \frac{\partial J^*}{\partial t} \Delta t + \underset{\underset{1 \leq \tau \leq t_f}{u(\tau)}}{\min} \left\{\frac{\partial J^*}{\partial x} f(t,x,u) \Delta t + F \Delta t \right\}
\label{Ham_01}\tag{2}$$
Since the Hamiltonian $H$ can be defined as follows (즉 $\min$ 안에 있는 것이 Hamiltonian 실제 핵심인 이것만 최적화 시켜야 하며 그것이 Hamiltonian 최적화의 핵심 아이디어임 )
$$
H(t, x, u, \frac{\partial J^*}{\partial x}) = F(t, x, u) + \frac{\partial J^*}{\partial x} f(t,x,u)$$
그러므로 최적 $J^*$ 에 대한 Hamiltonian은
$$
H(t,x,u^*(x(t), \frac{\partial J^*}{\partial x}, t), \frac{\partial J^*}{\partial x}) = \underset{\underset{1 \leq \tau \leq t_f}{u(\tau)}}{\min} H(t, x, u, \frac{\partial J^*}{\partial x}) \label{Ham_02}\tag{3}$$
The HJB equation is defined as follows from ($\ref{Ham_01}$), ($\ref{Ham_02}$)
$$
0 = \frac{\partial J^*}{\partial t} + H(t, x, u^*(x(t), \frac{\partial J^*}{\partial x}), \frac{\partial J^*}{\partial x})$$