Characterization of strict convexity [Convex functions]

Characterization of strict convexity

Recall that a function \(f:\mathbb{R}^{n}\rightarrow \mathbb{R}\) is strictly convex if

\(\hspace{2cm}∀x,y,x≠y,∀λ∈(0,1),\)\(f(λx+(1-λ))<λf(x)+(1-λ)f(y).\)

Like we mentioned before, if \(f\) is strictly convex, then \(f\) is convex (this is obvious from the definition) but the converse is not true, take for example \(f(x)=x,x∈\mathbb{R}^{n}.\)

Second order sufficient condition:

\(\hspace{5cm}∇^{2}f(x)>0,∀x∈Ω⇒f\) strictly convex on \(Ω\).

The converse is not true through.

First order characterization

A function \(f\) is strictly convex on \(Ω⊆\mathbb{R}^{n}\) if and only if

\(\hspace{5cm}f(y)>f(x)+∇f^{T}(x)(y-x),∀x,y∈Ω,x≠y.\)

There are similar characterizations for strong convex functions. For example, \(f\) is strongly convex if and only if there exists \(m>0\) such that
\(\hspace{1cm}f(y)≥f(x)+∇^{2}f(x)(y-x)+m‖y-x‖^{2},∀x,y∈dom(f),\)

or if and only if there exists \(m>0\) such that \(∇^{2}f(x)≥mI,∀x∈dom(f).\)

One of the main uses of strict convexity is to ensure uniqueness of the optimal solution.