Principle:Haifengl Smile Linear System Solving

Overview

Linear System Solving is the process of finding a vector $x$ that satisfies the equation $A x = b$ , where $A$ is a known matrix and $b$ is a known vector. This is one of the most fundamental problems in scientific computing, appearing in regression, optimization, physics simulation, signal processing, and machine learning.

Smile provides two classes of solvers:

Direct solvers that first decompose $A$ (LU, QR, Cholesky, SVD) and then solve via substitution. These are exact (up to floating-point precision) and have predictable cost.
Iterative solvers (Biconjugate Gradient) that approximate the solution through successive refinement. These are efficient for large sparse systems where direct decomposition is too expensive.

Theoretical Basis

Direct Solving via Decomposition

Direct methods transform the original system into a sequence of easily solvable triangular systems.

LU Solve

Given $P A = L U$ , solving $A x = b$ proceeds in three steps:

Permute: $P b$ (reorder $b$ according to the pivot vector)
Forward substitution: Solve $L y = P b$ where $L$ is lower triangular
Back substitution: Solve $U x = y$ where $U$ is upper triangular

$\begin{aligned} y_{i} & = (P b)_{i} - \sum_{j = 1}^{i - 1} l_{i j} y_{j}, i = 1, 2, \dots, n \\ x_{i} & = \frac{y_{i} - \sum_{j = i + 1}^{n} u_{i j} x_{j}}{u_{i i}}, i = n, n - 1, \dots, 1 \end{aligned}$

Complexity: $O (n^{2})$ for the solve phase (after $O (n^{3})$ factorization).

LAPACK routine: dgetrs / sgetrs

Cholesky Solve

Given $A = L L^{T}$ (SPD matrix), solving $A x = b$ :

Forward substitution: Solve $L y = b$
Back substitution: Solve $L^{T} x = y$

Complexity: $O (n^{2})$ for the solve, $O (\frac{1}{3} n^{3})$ for factorization.

Advantage over LU: Approximately 2x faster because Cholesky factors are half the size and no pivoting is required.

LAPACK routine: dpotrs / spotrs

QR Solve (Least Squares)

For overdetermined systems ( $m > n$ ), QR solves the least squares problem:

$\min_{x} ‖ A x - b ‖_{2}$

Given $A = Q R$ :

Apply $Q^{T}$ : Compute $Q^{T} b$ using dormqr/sormqr
Triangular solve: Solve $R x = (Q^{T} b)_{1 : n}$ using dtrtrs/strtrs

The solution minimizes the residual $‖ A x - b ‖_{2}$ and is numerically more stable than the normal equations approach $A^{T} A x = A^{T} b$ .

LAPACK routines: dormqr + dtrtrs / sormqr + strtrs

SVD Solve (Pseudoinverse)

For rank-deficient or ill-conditioned systems, SVD provides the most robust solution:

$x = V Σ^{- 1} U^{T} b$

where $Σ^{- 1}$ is formed by inverting only the nonzero (above threshold) singular values. Small singular values ( $σ_{i} \leq rcond$ ) are treated as zero, providing a regularized solution.

Compute $U^{T} b$ (project $b$ onto left singular vectors)
Divide by singular values: $(Σ^{- 1} U^{T} b)_{i} = (U^{T} b)_{i} / σ_{i}$
Compute $V (Σ^{- 1} U^{T} b)$ (expand in right singular vector basis)

Advantages: Works for any matrix (including singular and rank-deficient); provides the minimum-norm solution when the system is underdetermined.

Iterative Solving

Biconjugate Gradient Method (BiCG)

For large sparse systems where direct decomposition is impractical, the Biconjugate Gradient method iteratively refines an approximate solution. Given $A x = b$ :

Start with initial guess $x_{0}$
Compute residual $r_{0} = b - A x_{0}$
Iterate: update $x_{k}$ using search directions $p_{k}$ derived from the residual

Key operations per iteration:

One matrix-vector product $A p$
One transpose matrix-vector product $A^{T} p p$
One preconditioner solve

Convergence criteria (controlled by itol parameter):

itol	Criterion	Description
1	$‖ A x - b ‖ / ‖ b ‖ < tol$	Relative residual norm
2	$‖ A^{- 1} (A x - b) ‖ / ‖ A^{- 1} b ‖ < tol$	Preconditioned residual
3	$‖ x_{k + 1} - x_{k} ‖_{2} < tol$	Solution change (L2)
4	$‖ x_{k + 1} - x_{k} ‖_{\infty} < tol$	Solution change (Linf)

Preconditioning: The default preconditioner is Jacobi (diagonal scaling), which divides each equation by its diagonal element. Better preconditioners (ILU, SSOR) can be supplied via the Preconditioner interface.

Choosing the Right Solver

Solver	Matrix Type	System Type	When to Use
LU	Square, non-singular	$A x = b$ (exact)	General-purpose; multiple right-hand sides
Cholesky	SPD	$A x = b$ (exact)	Covariance systems, optimization; 2x faster than LU
QR	Overdetermined ( $m > n$ )	Least squares	Regression, fitting; more stable than normal equations
SVD	Any (including rank-deficient)	Least squares / min-norm	Ill-conditioned systems; need rank information
BiCG	Large, sparse	$A x = b$ (iterative)	When decomposition is too expensive; only needs $A v$ and $A^{T} v$

Multiple Right-Hand Sides

All direct solvers in Smile support solving $A X = B$ where $B$ is a matrix (multiple right-hand sides simultaneously). The matrix $B$ is overwritten in-place with the solution $X$ .

This is more efficient than solving individual systems because:

The decomposition is computed once
LAPACK can optimize memory access patterns for the block solve

Numerical Stability Considerations

LU with pivoting: Partial pivoting ensures stability for most practical matrices, but can amplify errors for pathological cases.
QR: Always backward stable; preferred when numerical accuracy is paramount.
Cholesky: The most stable option for SPD matrices because it exploits positive definiteness.
SVD: The most robust option overall; can handle singular and near-singular matrices gracefully through thresholding.
BiCG: Convergence depends on the condition number $κ (A)$ and the quality of the preconditioner. May stagnate for highly ill-conditioned systems.

Relationship to the Pipeline

Construction --> Arithmetic --> Decomposition --> Solving --> Result Extraction
                                                    ^
                                                    |
                                             (this principle)

Linear system solving is the fourth stage of the pipeline. It consumes decomposition results (LU, QR, Cholesky, SVD records) produced by the decomposition stage and produces solution vectors or matrices.

Knowledge Sources

Domains

Linear_Algebra, Numerical_Computing, Optimization

Page Connections

Double-click a node to navigate. Hold to expand connections.

Principle

Implementation

Heuristic

Environment