태그 보관물: Optimization

One Dimensional Optimization

One Dimensional Optimization

$$
\min_{\lambda \geq 0} \{f(x_i + \lambda h_i) \} \Rightarrow \textit{Let} \;\; \phi(\lambda)= f(x_i + \lambda h_i) – f(x_i) \Rightarrow \min_{\lambda} \phi(\lambda)$$

OD_01

Golden Search Method

Assumption

$g(\cdot)$ is unimode i.e. $\exists \hat{\lambda}$ such that $g'(\hat{\lambda}) = 0$ (It means that there exists unique golden minimum)

Observation

  • Suppose that we have additional point $a’, b’$ such that $a < a' < b' < b$. Either,

    $$
    \phi(a’) \leq \min \{ \phi(a), \phi(b) \} \;\;\textit{or}\;\;\phi(b’) \leq \min \{ \phi(a), \phi(b) \}$$

    OD_02

For $\phi:[a, b] \rightarrow \mathbb{R}$ and the global minimizer $\hat{\lambda} \in [a,b]$

  • $[a, b] \rightarrow [X]$ What is $X$
    • $X$ is smaller than $[a, b]$
    • $[X] \subset [a, b]$
    • $\hat{\lambda} \in [X]$

From Assumption and Observation

case 1

$\phi(a’) \leq \min \{ \phi(a), \phi(b)\}$

  • Suppose that $\phi(a’) \leq \phi(b’)$ then, by mean value theorem, $\exists \lambda_i \in [a’, b’]$ such that $\phi'(\lambda_i) \geq 0$. Since $\phi(a’) \leq \phi(a)$, by the mean value theorem$\exists \lambda_2 \in [a, a’]$ such that $\phi(\lambda_2) \leq 0$, thus

    $$
    a \leq \lambda_2 \leq a’ \leq \lambda_1 \leq b’ \leq b$$

    so pick $[a, b’]$ as next interval.

case 2

  • Suppose that $\phi(a’) > phi(b’)$. By the mean value theorem$\exists \lambda_1 \in [a’, b’], \; \lambda_1 \in [b, b’]$ such that $\phi'(\lambda_1) \leq 0, \; \phi'(\lambda_1) \geq 0$, ($\phi(b’) \leq \phi(b)$), Since $\hat{\lambda} \in [\lambda_1, \lambda_2$, and

    $$
    a < a' \leq \lambda_1 \leq b' \leq \lambda_2 \leq b$$ pick $[a’, b]$

  • Given $[a_i, b_i]$ find $a’_i, b’_i$ such that

    $$
    \begin{align}
    \phi (a’_i) &= \min \{ \phi (a_i), \phi (b_i) \} \\
    \phi (b’_i) &= \min \{ \phi (a_i), \phi (b_i) \}
    \end{align}
    \Rightarrow$$

    $$
    \begin{align}
    \textit{if} \;\; \phi (a_i) \leq \phi (b_i) \;\; \textit{then} \;\; [a_{i+1}, b_{i+1}] = [a_i, b’_i] \\
    \textit{if} \;\; \phi (a_i) \geq \phi (b_i) \;\; \textit{then} \;\; [a_{i+1}, b_{i+1}] = [a’_i, b_i]
    \end{align}$$

Golden Search Algorithm

OD_03

$$
\begin{align}
l_{i+1} &= F l_i \;\;\; F \in (0,1) \\
l_{i+1} – (1 _F) l_i &= (1 – F) l_{i+1}
\end{align}$$

Since $(1 – F) l_i = l_i – l_{i+1}$

$$
F l_i – (1-F) l_i = (1-F)F l_i \implies F – 1 + F = F – F^2 \implies F^2 + F – 1 =0, \;\; F = 0.618$$

$F = 0.618 $ 에서 Golden Search

Procedure of Golden Search Algorithm

Procesdure Processing
Data $x_o \in \mathbb{R}$
Step 0 Compute a bracket $[a_0, b_0]$ containing $\hat{\lambda}$, the minimizer of $\phi(\lambda)$ and Set $i=0$ (초기조건 $\phi(a_0) < 0, \phi(b_0) > 0$
Step 1 Set $l_i = b_i – a_i$ and Compute
$ a’_i = a_i + (1 – F) l_i$
$ b’_i = b_i – (1 – F) l_i$
Step 2 If $\phi(b’_i) \leq \phi(a’_i)$ set $a_{i+1} = a’_i, b_{i+1} = b_i$
else $\phi(b’_i) > \phi(a’_i)$ set $a_{i+1} = a_i, b_{i+1} = b’_i$
Step 3 Set i++ and goto step 1

Successive Quadratic Interpolation (SQI)

$$
\min_{\lambda \in \mathbb{R}_{+}} \phi(\lambda), \;\;\; \phi(\lambda) = f(x + \lambda h)$$

Assumption

$\phi(\cdot)$ is continuously differentiable and unimodal with unique minimizer $\hat{\lambda}$ ($\exists \hat{\lambda} \implies \phi'(\hat{\lambda}) = 0$)

Note

Given three distict point $z_1 < z_2 < z_3$, we can construct a inique quadaratic polynomial.

$$
q(\lambda) = a_1 \lambda^2 + a_2 \lambda + a_3 \;\;\textit(such that)\;\; q(z_1) = \phi(z_i) \;\; i=1,2,3$$

Largrangian Interpolation formula

$$
q(\lambda) = \phi(z_1) \frac{(\lambda – z_2)(\lambda – z_3)}{(z_1 – z_2)(z_1 – z_3)} + \phi(z_2) \frac{(\lambda – z_1)(\lambda – z_3)}{(z_2 – z_1)(z_2 – z_3)} + \phi(z_3) \frac{(\lambda – z_1)(\lambda – z_2)}{(z_2 – z_1)(z_3 – z_2)}$$

  • Given two distict points $z_1 < z_3$
    We can construct interpolating polynomial $q(\lambda)$ such that

    $$
    q(z_i) = \phi(z_i), \;\; i=1,3 \;\;\textit{and}\;\; q'(z_i) = \phi'(z_i), \;\; i=1,3$$

Case 1 : $z_1 = z_2 < z_3$

OD_04

$$
q(\lambda) = \phi(z_1) \frac{(\lambda – z_3)^2}{(z_1 – z_3)^2} + \phi(z_3) \frac{(\lambda – z_1)^2}{(z_3 – z_1)^2}$$

  • case 1-1

    $$
    \phi'(z_1) \leq 0, \;\; \phi(z_3) \geq \phi(z_1) \;\;\; z_1 = z_2 < z_3$$

  • case 1-2

    $$
    \phi'(z_3) \geq 0, \;\; \phi(z_1) \geq \phi(z_3)$$

Case 2 : $z_1 < z_2 = z_3$

Replace $z_1$ by $z_3$ and $z_3$ by $z_1$
OD_05

$$
\phi(z_2) = \min \{ \phi(z_1), \phi(z_3) \}$$

Conclusion

Let $z = (z_1, z_2, z_3) \in \mathbb{R}^3$, then the possible set of a vector $z$ such that $z_1 \leq \hat{\lambda} \leq z_3$ is

$$
\begin{align}
T &= \{ z \in \mathbb{R}^3 | \phi(z_2) \leq \min \{ \phi(z_1), \phi(z_3)\}, z_1 < z_2 < z_3 \} \\ &\cup \{ z \in \mathbb{R}^3 | z_1 = z_2 < z_3, \;\;\textit{and}\;\; \phi'(z_1) \leq 0, \;\;\textit{and}\;\; \phi(z_1) \leq \phi(z_3) \} \\ &\cup \{ z \in \mathbb{R}^3 | z_1 < z_2 = z_3, \;\;\textit{and}\;\; \phi'(z_3) \geq 0, \;\;\textit{and}\;\; \phi(z_1) \geq \phi(z_3) \} \\ &\cup \{ z \in \mathbb{R}^3 | z_1 = z_2 = z_3 = \hat{\lambda} \} \end{align}$$ 즉, $q(\lambda|z) = q(\lambda)|_{(z_1, z_2, z_3)}$

<

p class=”mathjax”>$$
\hat{\lambda} = \arg \min q(\lambda|z)$$

OD_06