Stable Causal Graph ⇔ Complete Nonlinear ICA:
Even in Your Cyclic Graph / Dense Generation!
These are some of my computation drafts exploring the equivalence between stable causal graphs and complete nonlinear ICA (beyond the acyclic/lower-triangle case in Fu et al., 2025). The key insight is extending functional equivalence to the non-DAG setting, showing these two concepts remain equivalent even with cyclic structures.
Setup
Consider a structural causal model with exogenous variables $e = (e_1, e_2, e_3)$, latent sources $s = (s_1, s_2, s_3)$, and observed variables $x = (x_1, x_2, x_3)$, where each layer is generated through an invertible mechanism:
By the chain rule on the Jacobians:
Since all mappings are invertible, the Jacobian $J_x(e)$ is invertible. If the causal graphs over $s$ and $x$ are DAGs with adjacency matrices $B_1$ and $B_2$, we can write:
This gives us the functional equivalence between the mixing function's Jacobian and the causal structure.
Main Result: The Equivalence
Direction 1: Complete Nonlinear ICA $\Rightarrow$ Stable Causal Graph
Suppose we have a complete nonlinear ICA model $x = g(e)$ where $J_g(e)$ is invertible and square. Since $J_g(e)$ is an invertible matrix, it always admits a LUP decomposition:
where:
- $L$: lower triangular matrix (non-zero diagonal)
- $U$: upper triangular matrix (non-zero diagonal)
- $P$: permutation matrix
If we fix the scaling and the permutation $P$, the LUP decomposition is unique. Both $L$ and $U$ have non-zero diagonals, so we can express them as:
where $U'$ is strictly upper triangular and $L'$ is strictly lower triangular (both are DAG adjacency matrices). Since $J_s(e)$ and $J_x(s)$ are bounded, the resulting causal system is stable. Self-loops can be freely introduced following Lacerda et al., 2012.
Direction 2: Stable Causal Graph $\Rightarrow$ Complete Nonlinear ICA
Conversely, if we have a stable causal graph, the Jacobian of the full mapping $x = g(e)$ takes the form $(I - B_1)^{-1}(I - B_2)^{-1}$, which is invertible. This is precisely the structure of a complete nonlinear ICA model.
TL;DR
Two days of "multiplying matrices" led to:
Example: A Chain Graph
Consider a simple chain $x_1 \to x_2 \to x_3$ with adjacency matrix:
Its Markov equivalence class contains DAGs with the same set of conditional independencies. The corresponding "super matrix" (union of all edges in the equivalence class) is:
This illustrates that while the causal direction within a Markov equivalence class is ambiguous from observational data alone, the stable causal graph structure still guarantees complete nonlinear ICA identifiability.