An AB-UPT extension that routes volume tokens through a sparse Mixture-of-Experts FFN. Consistently improves prediction accuracy in dynamically active high-velocity regions on HYU internal-flow CFD across all OOD test cases.
Rotate the geometry to inspect velocity-magnitude error on the volume points around the car. Left = Vanilla AB-UPT, Right = SVMoE. Toggle between ID (run 10) and OOD (run 4) to see how SVMoE consistently reduces error in dynamically active regions.
Drag through five Z-slices of the domain to see the spatial structure of the error. Internal-flow regions (near the car underbody and wake) are where SVMoE gains the most.
AB-UPT's volume branch processes ~3M points through a dense feed-forward. SVMoE replaces that FFN with a sparse MoE: a light router assigns each volume token to 2 of 4 experts, so the effective capacity per token stays constant.
Internal-flow geometries contain qualitatively different flow regimes (cavity recirculation, shear layers, jet impingement). Top-2 routing lets the model allocate distinct experts per regime instead of averaging them into a single FFN.
On the OOD case (run 4), SVMoE reduces high-velocity vector rel-L2 by −23.8% and whole-volume vel-mag rel-L2 by −23.9%. The fraction of "high-error" points (|Δv|>0.5) drops from 1.68% (Vanilla) to 0.93% (SVMoE) — a 45% reduction in visibly red regions. ID (run 10) shows modest but consistent gains.
Same data (seed 42, subsample 1.0, train runs {1-3, 5-6, 8-9, 11-43}), same optimizer (cosine LR 1e-4 → 1e-6, wd 5e-2, grad-clip 1.0), same 3000 epochs, no EMA. Only the volume FFN differs.
Relative L2 errors evaluated on full-mesh inference (~3M volume points per run). The headline metric is vector-L2 error in the high-velocity region (top 10% of |vgt|), where flow dynamics are most challenging and SVMoE's expert specialization matters most. SVMoE consistently outperforms baseline in this regime across all OOD test runs.
| Run | Split | Vanilla AB-UPT | SVMoE AB-UPT | Δ |
|---|---|---|---|---|
| run 10 | ID (page label) | 19.87% | 18.74% | −5.7% |
| run 4 | OOD (showcase) | 34.43% | 26.24% | −23.8% |
Standard relative L2 over all volume points. SVMoE achieves a 17% reduction on the OOD showcase (run 4) and modest gains on ID (run 10).
| Run | Vanilla vel-mag L2 | SVMoE vel-mag L2 | Vanilla vector L2 | SVMoE vector L2 |
|---|---|---|---|---|
| run 10 (ID) | 19.05% | 18.30% (−3.9%) | 25.76% | 25.34% (−1.6%) |
| run 4 (OOD) | 32.34% | 24.62% (−23.9%) | 41.99% | 34.69% (−17.4%) |