The closure (闭包) of a set \(\text{cl}(C)\) is the collection of its limit points.
A set in a linear space is convex, if any line segment between two vectors in the set is also in the set. {Bertsekas2003, 1.2.1} Convex hull (凸包) \(\text{conv}(C)\) is the smallest (intersection of all) convex set containing a set. Convex combination is a nonnegative weighted average of vectors. In comparison, nonnegative combination is a linear combination with nonnegative coefficients. Convexity is closely related to expectation: If \( P(X \in C) = 1 \) and \( C \subseteq \mathbb{R}^n \) is convex, then \( \mathbf{E}X \in C \).
Affine set is a shifted subspace. Affine hull (仿射包) is the smallest (intersection of all) affine set containing a set. The affine dimension of a set is the dimension of its affine hull. A set of vectors are affinely independent if when picking one as the origin, the rest are linearly independent. The number of affinely independent vectors in a set cannot be more than one plus the affine dimension of these vectors, if finite. The relative interior of a set is the intersection of its interior and its affine hull.
A set is a cone, if it is scale invariant. A cone is pointed if it contains no line. A cone is solid if it has nonempty interior. A proper cone (真锥) is a pointed, closed solid, convex cone. Generated cone \(\text{cone}(C)\) is the cone generated by a set is the nonnegative span of it. A vector is a direction of recession of a convex set, if arbitrary shifts of the set along this vector are within the set. The recession cone of a set is the cone generated by its recession directions.
A generalized inequality is the partial ordering on \(\mathbb{R}^n\) associated with a proper cone \(K\). Vector inequality is a special case of generalized inequality where the associated cone is the non-negative cone.
\[\begin{align} x \preceq_K y, y \succeq_K x &\Leftrightarrow y−x \in K \ x \prec_K y, y \succ_K x &\Leftrightarrow y−x \in \text{int}(K) \end{align}\]
An element of a multidimensional set is the minimum element (最小元) w.r.t. a generalized inequality \(\preceq_K\), if it is no greater than any other element in the set in such partial ordering: \(S \subseteq \{x\} + K\). Similarly, An element of a multidimensional set is the maximum element w.r.t. a generalized inequality \(\succeq_K\), if it is no less than any other element in the set in such partial ordering: \(S \subseteq \{x\} - K\).
An element of a multidimensional set is a minimal element (极小元) w.r.t. a generalized inequality \(\preceq_K\), if it is no less than any element in the set in such partial ordering: \((\{x\} - K) \cap S = \emptyset\). Similarly, An element of a multidimensional set is a maximal element w.r.t. a generalized inequality \(\succeq_K\), if it is no greater than any element in the set in such partial ordering: \((\{x\} + K) \cap S = \emptyset\).
The dual generalized inequality associated with a proper cone is the generalized inequality associated with its dual cone. This concept can be used for a dual characterization of the minimum/minimal elements. An element of a multidimensional set is the minimum element w.r.t. a generalized inequality \(\preceq_K\), if for all elements \(\lambda\) in the interior of the dual cone \(K^\) it is the unique minimizer for \(\langle \lambda, \cdot \rangle\) in the set. An element of a multidimensional set is a *minimal element** w.r.t. a generalized inequality \(\succeq_K\), if it minimizes \(\langle \lambda, \cdot \rangle\) over the set for an element \(\lambda\) in the interior of the dual cone.
An extended real-valued function is a function whose range is the extended real line. The extended-value extension \(\tilde{f}: \mathbb{R}^n → \mathbb{R} ∪ \{+\infty\}\) of a convex function \(f : C \to \mathbb{R}\) is the function defined in its containing (Euclidean) space that takes value \(+\infty\) outside its original domain.
The graph of a function is the collection of point-value pairs: \( \{ (x, f(x)) | x \in \text{dom}f \} \). The epigraph (supergraph) of a function is the collection of points on and above its graph: \( \{ (x, y) | x \in \text{dom}f, y \ge f(x) \} \).
Consistent definitions of convex functions:
A function is concave if its negative is a convex function. While concave set is simply any non-convex set, concave functions exactly mirror convex functions.
A level curve is the feasible set of an equality constraint. A level set is the feasible set of an inequality constraint. A sub-level set is the feasible set of a less-than-or-equal-to constraint. A function is quasi-convex if all its sub-level sets (and thus its domain) are convex. A function is quasi-concave if its negative is quasi-convex. A function is unimodal if it is quasi-convex or quasi-concave. A function is quasilinear if it is both quasi-convex and quasi-concave.
A positive function is log-concave (log-concave) if its logarithm is concave (convex).
Given a proper cone \(K\) in its domain space, a real-valued function is \(K\)-nondecreasing if it is dominated by values in its \(K\)-cone: \( \forall x, y, x \preceq_K y : f(x) \le f(y) \). The function is \(K\)-increasing if it is strictly dominated by values in its K-cone other than the vertex: \( \forall x \ne y, x \preceq_K y : f(x) < f(y) \). \(K\)-nonincreasing and \(K\)-decreasing are similarly defined. As a special case, a real-valued function on the space of symmetric matrices is matrix increasing (decreasing) if it is increasing (decreasing) with respect to the positive semidefinite cone.
Given a proper cone \(K\) in its range space, a vector-valued function is \(K\)-convex if every line segment of the function is \(K\)-dominated by its linear interpolation: \( \mathbf{f}(\boldsymbol\lambda \cdot \mathbf{x}) \preceq_K \boldsymbol\lambda \cdot \mathbf{f}(\mathbf{x}) \). The function is strictly \(K\)-convex if the inequality is strict except on the endpoints. As a special case, a symmetric matrix valued function is matrix convex if it is convex with respect to the positive semidefinite cone.
Given the dual pair of vector spaces \(X, Y\) w.r.t a bilinear form \( \langle \cdot, \cdot \rangle \), functions \( f: X \to \mathbb{R} \) and \( f^: Y \to \mathbb{R} \) are *conjugate** if their sum is supported from below by the bilinear form pointwise in the dual space: \( f(x) + f^(y) \ge \langle y,x \rangle \) (Young's inequality), and \( \forall y \in Y, \exists x \in X: f(x) + f^(y) = \langle y,x \rangle \). Equivalently, \( f^(y) = \sup_{x \in X} \{ \langle y,x \rangle - f(x) \} \), also known as *Fenchel transformation** {Fenchel1949}. If a function is smooth, strictly convex, and increases at infinity faster than a linear function, its conjugate is equivalent to its Legendre transformation {Legendre1789}: \( f^∗(y) = \langle x^∗, \nabla f(x^∗) \rangle − f(x^∗) \), where \( y = \nabla f(x^∗) \). In case of notation conflict with optimal values, use star \(\star\).
Common convex sets:
Common convex functions:
Common non-trivial log-concave and log-convex functions:
Common matrix monotone functions: (Real-valued functions over the symmetric matrix space.)
Common matrix convex/concave functions:
Set relations: \( C \subseteq \text{cl}(C) \subseteq \text{conv}(C) \subseteq \text{cone}(C) \)
Operations that preserves set convexity: {Bertsekas2003, 1.2.1}
Properties of convex set: {Bertsekas2003, 1.4.1}
The dual cone of a proper cone is also proper.
Generated cone is convex.
Recession cone theorem {Bertsekas2003, 1.5.1}: For a (nonempty) closed convex set,
Caratheodory's theorem {Bertsekas2003, 1.3.1}: (minimal representation of generated cone and convex hull)
Properties of generalized inequalities:
For a generalized inequality, minimum and maximum elements of a set are unique if they ever exist, while a set can have multiple minimal and maximal elements.
Non-decreasing in each argument is equivalent to \(K\)-nondecreasing where \(K\) is the non-negative cone.
First-order conditions for \(K\)-monotonicity: A differentiable function on a convex domain is \(K\)-nondecreasing iff its gradient is non-negative in \(K\)-dual inequality: \(\nabla f(x) \succeq_{K^∗} 0\); It is \(K\)-increasing if its gradient is positive in \(K\)-dual inequality: \(\nabla f(x) \succ_{K^∗} 0\);
First-order conditions for \(K\)-convexity: A differentiable function on a convex domain is \(K\)-convex iff the function globally \(K\)-dominates its linear expansion: \( f(y) \succeq_K f(x) + Df(x)(y−x) \), here \( f: \mathbb{R}^n \to \mathbb{R}^m \) and \( Df(x) \in \mathbb{R}^{m \times n} \). This proposition also holds when both sides are strict.
Composition theorem: A \(K\)-nondecreasing convex function of a \(K\)-convex function is convex. Note the monotonicity restriction is on the extended-value extension of the outer function.
A convex function can have discontinuities only on its relative boundary.
A function is affine iff it is simultaneously convex and concave.
Operations that preserves (vector-valued) function convexity (or concavity): {Bertsekas2003, 1.2.4}
Characterization of differentiable convex functions:
Operations that preserve quasi-convexity:
A quasilinear function has convex level sets; the converse does not hold.
A quasi-convex function \(f(x)\) can be represented by a family of convex functions \( \phi_t(x) \), where the \(t\)-sublevel set of \(f\) is the 0-sublevel set of \(\phi_t\). Such representation is not unique.
Relations:
Properties:
Transformation properties of conjugation:
A function is closed if its epigraph is a closed set; or equivalently, if all its sub-level sets are closed. The conjugate function of any function is closed and convex. Almost every convex function can be expressed as the conjugate of some function.
An extended real-valued function is lower semi-continuous at a point if the function values in the point's neighborhood are either close to or greater than the function value at that point: \( \liminf_{x \to x_0} f(x) \ge f(x_0) \). The conjugate of a function is always lower semi-continuous.
The convex closure of a function is the function whose epigraph is the closure of the convex hull of the original function. The conjugate of the conjugate (twice conjugate) of a function is its convex closure: \( \text{epi}f^{} = \text{cl}(\text{conv}(\text{epi}f)) \), noticing that \( f^{∗∗} \le f \). **Fenchel–Moreau theorem: If a function is convex and closed, its twice conjugate is itself: \( f^{∗∗} = f \).