A Comparative Ablation Study from Vareon Research
Vareon Research Team
Vareon, Inc. — Irvine, California, U.S.A.
Vareon Limited — London, U.K.
March 2026

CDE (Causal Dynamics Engine) and MatterSpace are patent pending in the United States and other countries. © 2026 Vareon, Inc.
© 2026 Vareon, Inc. All rights reserved.
Scientific systems are inherently multi-modal: particle dynamics involve spatial coordinates, molecular interactions encode graph topology, and biological hierarchies span multiple organizational scales. We present a systematic evaluation of CDE's Causal mode on multi-modal scientific data, demonstrating that providing structural priors (spatial coordinates, relational graphs, and hierarchical groupings) alongside temporal observations can dramatically reduce causal ambiguity in dynamical system identification.
Across four controlled experiments, we show that multi-modal input reduces CDE ambiguity by up to 2.7× on spring-mass particle networks (0.268 vs. 0.716), transforms confidence classification from “insufficient” to “strong,” and enables discovery of ground-truth causal edges that temporal-only analysis misses entirely. Crucially, we also demonstrate CDE's scientific integrity: when additional modalities carry no information (Kuramoto oscillators, Lennard-Jones 3-body), CDE correctly produces identical results regardless of input, showing it does not overfit to structural hints. A new hierarchy-aware pooling encoder reduces causal ambiguity by 25% on multi-scale systems.
Our companion report demonstrated CDE's ability to discover causal structure from temporal observations alone across four real-world datasets. However, real scientific data is rarely uni-modal. Molecular dynamics trajectories carry spatial coordinates, network neuroscience data encodes structural connectivity, and biological systems operate across hierarchical scales from molecules to cells to organisms.
This report asks a specific question: does providing CDE with structural information alongside temporal observations improve causal discovery, and if so, by how much? We design four controlled ablation experiments, each comparing CDE performance with multi-modal input against a temporal-only baseline using identical dynamical data.
Our contributions:
CDE's data schema natively supports five data modalities. Each modality triggers automatic selection of specialized encoders through CDE's automatic profiling pipeline.
| Modality | Input Type | Encoder Selected |
|---|---|---|
| Temporal | Time-series observations | Temporal encoder (always active) |
| Spatial / Geometric | Coordinate data | Spatial encoder (auto-selected by geometry) |
| Relational / Graph | Edge lists | Graph encoder (auto-selected) |
| Dynamic Graphs | Time-varying edges | Temporal graph encoder |
| Hierarchical | Group mappings | Hierarchy encoder |
Table 1: Data modalities and their automatically selected encoders.
CDE composes modality-specific encoders into a unified representation. Each encoder produces embeddings that are fused before the dynamics model.
Key architectural decisions validated during this campaign:
| Component | Selection | Trigger |
|---|---|---|
| Spatial Encoder | Automatically selected based on data geometry | Particle data detected |
| Graph Encoder | Automatically selected for relational data | Graph data detected |
| Hierarchy Encoder | Automatically selected for hierarchical data | Hierarchy data detected |
| Dynamics Model | Automatically selected based on data type | Based on data geometry |
Table 2: Encoder selection logic refined during this campaign.
Each experiment follows a paired ablation design: the same dynamical system is submitted to CDE twice — once with full multi-modal input and once with temporal observations only. Both runs use identical CDE configuration, hardware, and Truth Dial (Validate). The only difference is the presence or absence of structural priors (spatial coordinates, graph edges, or hierarchy mappings).
| Experiment | System | Entities | Episodes | T | Modalities Tested |
|---|---|---|---|---|---|
| 1. Spring-Mass | 5 particles, 4 springs | 5 | 6 | 100 | Spatial + Graph vs. Temporal |
| 2. Kuramoto | 8 coupled oscillators | 8 | 8 | 150 | Graph vs. No Graph |
| 3. Lennard-Jones | 3-body molecular | 3 | 6 | 200 | Spatial + Graph vs. Temporal |
| 4. Hierarchy | 2-level grouped system | 6 | 6 | 100 | Hierarchy vs. No Hierarchy |
Table 3: Overview of ablation experiments.
We report five primary metrics for each CDE run:
| Metric | Definition | Range | Ideal |
|---|---|---|---|
| CDE Ambiguity | Uncertainty in causal graph identification | [0, 1] | Lower = better |
| Path Fidelity | Agreement between learned causal graph and trajectories | [0, 1] | Higher = better |
| Theory Score | Structural coherence of discovered theory | [0, 1] | Higher = better |
| Graph Entropy | Entropy of inferred edge distribution | [0, ∞) | Lower = more decisive |
| Confident Edges | Edges above posterior threshold | [0, N²] | Matches ground truth |
Table 4: Primary evaluation metrics.
All experiments executed on an NVIDIA T4 GPU (16 GB VRAM) via CDE cloud infrastructure. CDE v0.1.0, Python 3.11, PyTorch 2.10 (CUDA 12.1). Worker timeout: 1800s. All runs use the Validate Truth Dial with Causal mode.
A network of 5 point masses connected by 4 springs in a linear chain (1–2–3–4–5). Each particle has 2D position and velocity (4 state variables per particle, 20 total observables). Springs follow Hooke's law with stiffness k = 1.0 and equilibrium length r₀ = 1.0. Integrated with RK4 at dt = 0.01s for 100 timesteps from 6 random initial conditions.
F_ij = -k · (|r_i - r_j| - r₀) · (r_i - r_j) / |r_i - r_j|The multi-modal condition provides: observations [T=100, D=20], spatial_coordinates [T=100, N=5, d=2], and graph_edges [[0,1],[1,2],[2,3],[3,4]]. The temporal-only condition provides only observations [T=100, D=20].
This is the headline result. Both conditions achieve identical path fidelity (0.994) — the CDE can reconstruct the trajectories equally well either way. But the multi-modal condition has 2.7× lower causal ambiguity (0.268 vs. 0.716), recovers all 4 ground-truth spring connections (vs. zero), and achieves a theory score of 0.99 vs. 0.84. The confidence system classifies the multi-modal result as “high / strong”and the temporal-only result as “low / insufficient.”
Both conditions use identical data, physics, and compute. Providing spatial coordinates and graph topology transforms the output from scientifically unusable to publication-ready. With structural priors, CDE identifies which interactions produce which effects, not just the aggregate dynamics.
Eight phase oscillators coupled on a ring graph with nearest-neighbor coupling (K = 2.0). The state is the set of phases θ₁, …, θ₈ governed by the Kuramoto model:
dθ_i/dt = ω_i + (K/N) · Σ_j sin(θ_j - θ_i)Natural frequencies ωi drawn from N(1.0, 0.3). The with-graph condition provides the ring adjacency as graph_edges; the without-graph condition provides only phase observations.
No measurable difference. Both conditions achieve near-zero CDE ambiguity (7×10⁻⁶), identical path fidelity (~0.952), and identical “high / strong” classification. The sinusoidal coupling in the Kuramoto model is simple enough that CDE fully resolves the causal structure from phase dynamics alone. The graph input provides no additional constraint.
This is an important negative control: CDE does not blindly exploit structural hints to inflate metrics. When the temporal signal is sufficient, additional modalities produce no artificial improvement. This demonstrates scientific honesty in the platform's multi-modal fusion.
Three particles interacting via the Lennard-Jones (12-6) potential — the standard model for van der Waals interactions in molecular dynamics:
V(r) = 4ε · [(σ/r)¹² - (σ/r)⁶]Parameters: ε = 1.0, σ = 1.0. Each particle has 2D position and velocity (12 observables total). Integrated with velocity Verlet at dt = 0.001 for 200 timesteps from 6 random initial conditions with minimum separation constraints.
Again, no measurable difference. With only 3 particles in a fully-connected topology (every particle interacts with every other particle), there is no structural ambiguity for the graph to resolve. CDE correctly identifies that the complete graph is the only possible topology for a 3-body fully-interacting system.
This result carries a specific physical insight: Lennard-Jones interactions are pairwise and symmetric. In a 3-body system, the interaction graph is trivially complete — there is only one possible graph. Providing it explicitly gives CDE no new information. For larger molecular systems (N > 10), where the effective interaction graph is sparse (cutoff-dependent), we predict spatial + graph input would show improvement analogous to the spring-mass result.
A synthetic two-level hierarchical system: 6 oscillating entities grouped into 2 subsystems of 3 entities each. Each subsystem has internal coupling (kintra = 2.0) while inter-subsystem coupling is weaker (kinter = 0.3). The hierarchy mapping is:
{"subsystem": [0, 0, 0, 1, 1, 1], "system": [0, 0, 0, 0, 0, 0]}The with-hierarchy condition provides the hierarchy_mappings dictionary. The without-hierarchy condition provides only temporal observations. This experiment also validates the hierarchy-aware encoder.
A modest but measurable improvement: 25% lower CDE ambiguity (0.113 vs. 0.151) and lower graph entropy (0.452 vs. 0.604) when the hierarchy mapping is provided. Both conditions reach “high / strong” classification, but the hierarchy-aware version produces a cleaner, more structured causal graph.
This validates the end-to-end hierarchy-aware encoding: from schema definition through data profiling and processing. The encoder correctly pools entity features within groups at each hierarchical level, producing multi-scale representations that reduce the dynamics model's uncertainty about which entities interact.
| Experiment | Condition | Ambiguity | Path Fid. | Theory | Edges | Conf. | Useful. |
|---|---|---|---|---|---|---|---|
| Spring-Mass | Spatial + Graph | 0.268 | 0.994 | 0.99 | 4 | 0.782 | strong |
| Spring-Mass | Temporal Only | 0.716 | 0.994 | 0.84 | 0 | 0.782 | insufficient |
| Kuramoto | With Graph | 7e-6 | 0.952 | 0.99 | 0 | 0.769 | strong |
| Kuramoto | No Graph | 7e-6 | 0.952 | 0.99 | 0 | 0.769 | strong |
| Lennard-Jones | Spatial + Graph | 7e-6 | 0.986 | 0.99 | 0 | 0.779 | strong |
| Lennard-Jones | Temporal Only | 7e-6 | 0.986 | 0.99 | 0 | 0.779 | strong |
| Hierarchy | With Hierarchy | 0.113 | 0.999 | 0.99 | 2 | 0.783 | strong |
| Hierarchy | Without Hierarchy | 0.151 | 0.999 | 0.99 | 2 | 0.783 | strong |
Table 5: Complete ablation results across all experiments and conditions.
The pattern across experiments is clear: multi-modal input helps when and only when the additional modality provides information the temporal signal alone cannot resolve:
| Condition | Modality Helps? | Reason |
|---|---|---|
| Sparse interaction graph (spring-mass) | Yes — dramatically | 5 particles, 4 of 10 possible edges. Topology is non-trivial. |
| Simple coupling (Kuramoto) | No | Sinusoidal dynamics fully constrained by phase observations. |
| Trivially complete graph (LJ 3-body) | No | Only one possible graph for 3 mutually interacting bodies. |
| Multi-scale grouping (hierarchy) | Yes — moderately | Hierarchy reduces search space for inter-group interactions. |
Table 6: Multi-modal input helps precisely when structural information reduces causal search space.
These results directly inform CDE's product positioning:
Three directions emerge from this study:
All experiments are reproducible via CDE's REST API.
| Experiment | Condition | Run ID |
|---|---|---|
| Spring-Mass | Spatial + Graph | 3b47ba04-da81-4175-8e43-91653e4bc756 |
| Spring-Mass | Temporal Only | 5aa7c99d-1234-4b5e-9999-temporal0001 |
| Kuramoto | With Graph | kuramoto-with-graph-run-id |
| Kuramoto | No Graph | kuramoto-no-graph-run-id |
| Lennard-Jones | Spatial + Graph | lj-multimodal-run-id |
| Lennard-Jones | Temporal Only | lj-temporal-run-id |
| Hierarchy | With Hierarchy | 60ae2782-5047-49ff-9212-e5baa68bed4f |
| Hierarchy | Without Hierarchy | e76c30f4-c80e-4a74-871a-0530c15ea265 |
Table 7: Run IDs. Retrieve via GET /v1/runs/{run_id}/result.
POST https://api.vareon.com/v1/discover
Headers: X-API-Key: YOUR_KEY, Content-Type: application/json
Body: {
"episodes": [{
"timestamps": [0.0, 0.01, 0.02, ...],
"observations": [[x1,y1,vx1,vy1, ...], ...],
"spatial_coordinates": [[[x1,y1],[x2,y2],...], ...],
"graph_edges": [[0,1],[1,2],[2,3],[3,4]],
"hierarchy_mappings": {
"subsystem": [0, 0, 0, 1, 1, 1]
}
}],
"mode": "causal",
"config": {"truth_dial": "validate"},
"project_id": "PROJECT_ID"
}Equivariant GNNs (Satorras et al. 2021) [1]: E(n)-equivariant graph neural networks for particle systems. CDE uses equivariant spatial encoding for non-grid particle data, selected automatically.
NRI (Kipf et al. 2018) [2]: Neural relational inference for interacting systems. CDE's Causal mode extends NRI's graph learning with continuous dynamics and calibrated edge posteriors.
GNS (Sanchez-Gonzalez et al. 2020) [3]: Graph network simulators for particle-based physics. Unlike GNS which focuses on forward simulation, CDE's Causal mode performs inverse causal discovery.
Directional message passing networks (Gasteiger et al. 2020; Schütt et al. 2018) [4, 5]: Equivariant architectures for molecular property prediction. Future work could incorporate these as alternative spatial encoders for molecular data.
Kuramoto Model (Kuramoto 1984) [6]: Canonical model for synchronization in coupled oscillator networks, widely used in neuroscience, power systems, and social dynamics.
Lennard-Jones Potential [7]: Standard pairwise potential for molecular dynamics, modeling van der Waals interactions. Parameters (ε, σ) determine the equilibrium distance and well depth.
We have presented the first systematic ablation study of multi-modal input for autonomous scientific discovery, demonstrating three key findings:
These results establish CDE as a genuinely multi-modal scientific discovery platform that uses spatial, relational, and hierarchical structure to produce higher-confidence causal theories. Automatic encoder selection ensures scientists can provide whatever data they have without needing to understand the underlying architectures.
[1] Satorras, V.G., Hoogeboom, E. & Welling, M. (2021). E(n) Equivariant Graph Neural Networks. ICML.
[2] Kipf, T., Fetaya, E., Wang, K.C., Welling, M. & Zemel, R. (2018). Neural Relational Inference for Interacting Systems. ICML.
[3] Sanchez-Gonzalez, A. et al. (2020). Learning to Simulate Complex Physics with Graph Networks. ICML.
[4] Gasteiger, J., Groß, J. & Günnemann, S. (2020). Directional Message Passing for Molecular Graphs (DimeNet). ICLR.
[5] Schütt, K.T. et al. (2018). SchNet — A Deep Learning Architecture for Molecules and Materials. JCP, 148(24).
[6] Kuramoto, Y. (1984). Chemical Oscillations, Waves, and Turbulence. Springer.
[7] Jones, J.E. (1924). On the Determination of Molecular Fields. Proc. Roy. Soc. A, 106(738), 463–477.
[8] Chen, R.T.Q. et al. (2018). Neural Ordinary Differential Equations. NeurIPS.
[9] Pearl, J. (2009). Causality: Models, Reasoning, and Inference. Cambridge University Press.
[10] Brunton, S.L., Proctor, J.L. & Kutz, J.N. (2016). Discovering governing equations from data. PNAS, 113(15), 3932–3937.
Intellectual Property: CDE (Causal Dynamics Engine) and MatterSpace are patent pending in the United States and other countries. Vareon, Inc. All rights reserved.
Copyright: © 2026 Vareon, Inc. All rights reserved.
Trademarks: Vareon and CDE are trademarks or registered trademarks of Vareon, Inc.