AIDE Chip Surrogates
This is a collection of physics-aware, monotonicity-constrained XGBoost models that replace expensive gem5 cache simulations during design-space exploration.
Each model predicts either IPC or L2 miss rate for a specific workload, using only cache configuration parameters as input. The models are interpretable via SHAP and enforce microarchitectural monotonicity where physically justified.
This model release accompanies the paper:
Udayshankar Ravikumar . Fast, Explainable Surrogate Models for gem5 Cache Design Space Exploration. Authorea. January 14, 2026. https://doi.org/10.22541/au.176843174.46109183/v1
Model Architecture
- Algorithm: XGBoost Regressor
- Targets:
- IPC
- L2 miss rate
- Features:
- Logβ cache sizes & associativities
- Set-count proxies
- Cache hierarchy ratios
- Constraints:
- Monotonic constraints encoding cache physics
- Selective relaxation for latency-sensitive workloads
Available Models
| Workload | IPC Model | L2 Miss Model |
|---|---|---|
| crc32 | β | β |
| dijkstra | β | β |
| fft | β | β |
| matrix_mul | β | β |
| qsort | β | β |
| sha | β | β |
Total models: 12
Performance
- Test set accuracy: RΒ² β 0.999
- OOD validation:
- 26 unseen cache configurations
- ~817Γ critical-path speedup
- Low absolute error even when RΒ² is unstable
Explainability
Each model is uploaded with its SHAP summary plot. They confirm:
- Cache sizes dominate IPC & miss behavior
- Associativity effects are workload-dependent
- Learned relationships align with microarchitectural intuition
Intended Use
- Architecture research
- Design-space exploration
- Educational use
- Explainable systems ML
Not for commercial deployment without separate licensing.
Limitations
- Single-core, single-thread models
- Cache hierarchy only (no pipeline, prefetcher, or multicore effects)
- Accuracy depends on training coverage; extreme OOD configs are flagged
Patent Notice
The models uploaded here implement techniques described in an accompanying research paper. The author has filed a pending patent application that may cover broader design-space exploration workflows beyond these specific model implementations.
The open-source license (CC BY-NC-SA 4.0) governs use of these models. This notice is informational only.