LennartPurucker commited on
Commit
28bec71
·
1 Parent(s): 4fda28d

maint: update to new lb design

Browse files
Files changed (47) hide show
  1. README.md +4 -2
  2. data/full-imputed-cls/figures/critical-diagram.png.zip +0 -3
  3. data/full-imputed-cls/leaderboard.tex +0 -53
  4. data/full-imputed-cls/tabarena_leaderboard.csv +0 -46
  5. data/full-imputed-cls/time_plot.png.zip +0 -3
  6. data/full-imputed-cls/tuning-impact-elo-horizontal.png.zip +0 -3
  7. data/full-imputed-cls/tuning-impact-elo.png.zip +0 -3
  8. data/full-imputed-reg/figures/critical-diagram.png.zip +0 -3
  9. data/full-imputed-reg/leaderboard.tex +0 -52
  10. data/full-imputed-reg/tabarena_leaderboard.csv +0 -45
  11. data/full-imputed-reg/time_plot.png.zip +0 -3
  12. data/full-imputed-reg/tuning-impact-elo-horizontal.png.zip +0 -3
  13. data/full-imputed-reg/tuning-impact-elo.png.zip +0 -3
  14. data/full-imputed/figures/critical-diagram.png.zip +0 -3
  15. data/full-imputed/leaderboard.tex +0 -53
  16. data/full-imputed/tabarena_leaderboard.csv +0 -46
  17. data/full-imputed/time_plot.png.zip +0 -3
  18. data/full-imputed/tuning-impact-elo-horizontal.png.zip +0 -3
  19. data/full-imputed/tuning-impact-elo.png.zip +0 -3
  20. data/lite/full-imputed/figures/critical-diagram.png.zip +0 -3
  21. data/lite/full-imputed/leaderboard.tex +0 -53
  22. data/lite/full-imputed/tabarena_leaderboard.csv +0 -46
  23. data/lite/full-imputed/time_plot.png.zip +0 -3
  24. data/lite/full-imputed/tuning-impact-elo-horizontal.png.zip +0 -3
  25. data/lite/full-imputed/tuning-impact-elo.png.zip +0 -3
  26. data/tabicl-imputed/figures/critical-diagram.png.zip +0 -3
  27. data/tabicl-imputed/leaderboard.tex +0 -53
  28. data/tabicl-imputed/tabarena_leaderboard.csv +0 -46
  29. data/tabicl-imputed/time_plot.png.zip +0 -3
  30. data/tabicl-imputed/tuning-impact-elo-horizontal.png.zip +0 -3
  31. data/tabicl-imputed/tuning-impact-elo.png.zip +0 -3
  32. data/tabpfn-imputed/figures/critical-diagram.png.zip +0 -3
  33. data/tabpfn-imputed/leaderboard.tex +0 -53
  34. data/tabpfn-imputed/tabarena_leaderboard.csv +0 -46
  35. data/tabpfn-imputed/time_plot.png.zip +0 -3
  36. data/tabpfn-imputed/tuning-impact-elo-horizontal.png.zip +0 -3
  37. data/tabpfn-imputed/tuning-impact-elo.png.zip +0 -3
  38. data/tabpfn-tabicl/figures/critical-diagram.png.zip +0 -3
  39. data/tabpfn-tabicl/leaderboard.tex +0 -53
  40. data/tabpfn-tabicl/tabarena_leaderboard.csv +0 -46
  41. data/tabpfn-tabicl/time_plot.png.zip +0 -3
  42. data/tabpfn-tabicl/tuning-impact-elo-horizontal.png.zip +0 -3
  43. data/tabpfn-tabicl/tuning-impact-elo.png.zip +0 -3
  44. main.py +242 -238
  45. old_data/v0_1_0/tabarena_leaderboard.csv.zip +0 -3
  46. pyproject.toml +2 -2
  47. website_texts.py +32 -11
README.md CHANGED
@@ -8,7 +8,7 @@ app_file: main.py
8
  pinned: true
9
  license: apache-2.0
10
  short_description: 'TabArena'
11
- sdk_version: 5.33.2
12
  ---
13
 
14
  # TabArena Leaderboard Code
@@ -18,11 +18,13 @@ The leaderboard is hosted on a HuggingFace space.
18
 
19
  Reference:
20
  * Website: https://tabarena.ai
21
- * Paper: TBA
22
  * TabArena Codebase: https://tabarena.ai/code
23
 
24
  # Install LB Code for Development
25
 
26
  ```bash
27
  pip install -e ".[dev]"
 
 
28
  ```
 
8
  pinned: true
9
  license: apache-2.0
10
  short_description: 'TabArena'
11
+ sdk_version: 5.49.1
12
  ---
13
 
14
  # TabArena Leaderboard Code
 
18
 
19
  Reference:
20
  * Website: https://tabarena.ai
21
+ * Paper: https://tabarena.ai/paper-tabular-ml-iid-study
22
  * TabArena Codebase: https://tabarena.ai/code
23
 
24
  # Install LB Code for Development
25
 
26
  ```bash
27
  pip install -e ".[dev]"
28
+ # Or
29
+ uv pip install -r pyproject.toml && uv pip install pdf2image
30
  ```
data/full-imputed-cls/figures/critical-diagram.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:5835744841004196426f3151baa0e22bdc7d9495f41f99bd11af42e8acef0c76
3
- size 338031
 
 
 
 
data/full-imputed-cls/leaderboard.tex DELETED
@@ -1,53 +0,0 @@
1
- \begin{tabular}{llcccccrr}
2
- \toprule
3
- \textbf{Model} & \textbf{Elo ($\uparrow$)} & \textbf{Norm.} & \textbf{Avg.} & \textbf{Harm.} & \textbf{\#wins ($\uparrow$)} & \textbf{Improva-} & \textbf{Train time} & \textbf{Predict time} \\
4
- & & \textbf{score ($\uparrow$)} & \textbf{rank ($\downarrow$)} & \textbf{mean} & & \textbf{bility ($\downarrow$)} & \textbf{per 1K [s]} & \textbf{per 1K [s]} \\
5
- & & & & \textbf{rank ($\downarrow$)} & & & & \\
6
- \midrule
7
- TabM (T+E) & \textcolor{gold}{\textbf{1574${}_{-28,+28}$}} & \textcolor{bronze}{\textbf{0.505}} & \textcolor{gold}{\textbf{9.6}} & 4.5 & 3 & \textcolor{bronze}{\textbf{8.2\%}} & 2466.21 & 1.50 \\
8
- AutoGluon 1.3 (4h) & \textcolor{silver}{\textbf{1573${}_{-34,+30}$}} & \textcolor{gold}{\textbf{0.577}} & \textcolor{silver}{\textbf{9.7}} & \textcolor{silver}{\textbf{3.3}} & \textcolor{silver}{\textbf{6}} & \textcolor{gold}{\textbf{6.9\%}} & 1322.72 & 2.36 \\
9
- RealMLP (T+E) & \textcolor{bronze}{\textbf{1552${}_{-30,+32}$}} & 0.470 & \textcolor{bronze}{\textbf{10.5}} & 7.4 & 0 & \textcolor{bronze}{\textbf{8.2\%}} & 6519.69 & 10.84 \\
10
- LightGBM (T+E) & 1537${}_{-30,+26}$ & 0.421 & 11.1 & 6.4 & 1 & 9.8\% & 382.05 & 1.49 \\
11
- TabICL (D) & 1530${}_{-31,+23}$ & \textcolor{silver}{\textbf{0.512}} & 11.4 & \textcolor{bronze}{\textbf{3.5}} & \textcolor{silver}{\textbf{6}} & \textcolor{silver}{\textbf{7.9\%}} & 8.68 & 1.74 \\
12
- TabM (T) & 1487${}_{-30,+29}$ & 0.420 & 13.2 & 6.4 & 1 & 9.3\% & 2466.21 & 0.18 \\
13
- CatBoost (T+E) & 1479${}_{-26,+24}$ & 0.395 & 13.6 & 8.7 & 0 & 9.1\% & 1372.94 & 0.56 \\
14
- CatBoost (T) & 1469${}_{-27,+25}$ & 0.379 & 13.9 & 7.0 & 1 & 9.3\% & 1372.94 & 0.07 \\
15
- LightGBM (T) & 1463${}_{-29,+30}$ & 0.336 & 14.3 & 12.1 & 0 & 10.6\% & 382.05 & 0.25 \\
16
- CatBoost (D) & 1454${}_{-28,+23}$ & 0.362 & 14.7 & 7.1 & 1 & 10.3\% & 5.72 & 0.08 \\
17
- TabPFNv2 (T+E) & 1454${}_{-31,+25}$ & 0.497 & 14.7 & \textcolor{gold}{\textbf{3.1}} & \textcolor{gold}{\textbf{8}} & 9.7\% & 3008.22 & 20.85 \\
18
- XGBoost (T+E) & 1452${}_{-28,+31}$ & 0.332 & 14.8 & 9.3 & 0 & 10.7\% & 685.87 & 1.45 \\
19
- ModernNCA (T) & 1416${}_{-34,+24}$ & 0.289 & 16.5 & 9.7 & 1 & 10.5\% & 4879.89 & 0.52 \\
20
- XGBoost (T) & 1412${}_{-26,+28}$ & 0.279 & 16.6 & 13.3 & 0 & 11.1\% & 685.87 & 0.21 \\
21
- ModernNCA (T+E) & 1410${}_{-28,+27}$ & 0.382 & 16.7 & 7.5 & 0 & 10.4\% & 4879.89 & 8.74 \\
22
- TabPFNv2 (T) & 1388${}_{-30,+33}$ & 0.385 & 17.8 & 5.2 & 1 & 12.1\% & 3008.22 & 0.51 \\
23
- TabM (D) & 1375${}_{-26,+30}$ & 0.280 & 18.3 & 11.8 & 0 & 12.6\% & 10.21 & 0.14 \\
24
- TabPFNv2 (D) & 1366${}_{-31,+25}$ & 0.354 & 18.8 & 4.8 & 4 & 13.0\% & 3.37 & 0.32 \\
25
- TorchMLP (T+E) & 1364${}_{-28,+29}$ & 0.233 & 19.1 & 14.8 & 0 & 11.6\% & 2389.22 & 2.16 \\
26
- RealMLP (T) & 1357${}_{-27,+29}$ & 0.188 & 19.4 & 16.7 & 0 & 12.0\% & 6519.69 & 0.53 \\
27
- EBM (T+E) & 1356${}_{-28,+26}$ & 0.188 & 19.3 & 13.5 & 0 & 14.9\% & 914.23 & 0.22 \\
28
- FastaiMLP (T+E) & 1323${}_{-29,+23}$ & 0.203 & 21.0 & 12.4 & 0 & 14.6\% & 618.90 & 4.77 \\
29
- ModernNCA (D) & 1306${}_{-28,+26}$ & 0.140 & 21.8 & 12.3 & 1 & 14.7\% & 14.78 & 0.35 \\
30
- EBM (T) & 1295${}_{-30,+26}$ & 0.130 & 22.4 & 17.9 & 0 & 15.6\% & 914.23 & 0.03 \\
31
- EBM (D) & 1265${}_{-36,+29}$ & 0.143 & 23.9 & 11.4 & 1 & 16.6\% & 4.31 & 0.05 \\
32
- RealMLP (D) & 1253${}_{-27,+23}$ & 0.088 & 24.5 & 21.0 & 0 & 14.6\% & 21.83 & 0.90 \\
33
- XGBoost (D) & 1253${}_{-31,+29}$ & 0.116 & 24.5 & 19.3 & 0 & 14.1\% & 1.77 & 0.12 \\
34
- ExtraTrees (T+E) & 1252${}_{-29,+24}$ & 0.110 & 24.6 & 16.8 & 0 & 15.8\% & 189.76 & 0.74 \\
35
- TabDPT (D) & 1239${}_{-35,+28}$ & 0.180 & 25.1 & 8.1 & 2 & 15.5\% & 22.61 & 8.55 \\
36
- TorchMLP (T) & 1237${}_{-26,+26}$ & 0.106 & 25.1 & 21.7 & 0 & 14.1\% & 2389.22 & 0.15 \\
37
- FastaiMLP (T) & 1220${}_{-29,+28}$ & 0.093 & 26.1 & 20.7 & 0 & 16.6\% & 618.90 & 0.30 \\
38
- RandomForest (T+E) & 1214${}_{-24,+25}$ & 0.105 & 26.3 & 14.4 & 0 & 16.6\% & 323.74 & 0.74 \\
39
- LightGBM (D) & 1198${}_{-28,+29}$ & 0.087 & 27.0 & 23.6 & 0 & 15.4\% & 1.79 & 0.12 \\
40
- ExtraTrees (T) & 1196${}_{-33,+24}$ & 0.080 & 27.1 & 17.2 & 0 & 17.2\% & 189.76 & 0.08 \\
41
- RandomForest (T) & 1159${}_{-33,+33}$ & 0.078 & 28.8 & 16.5 & 0 & 17.8\% & 323.74 & 0.08 \\
42
- TorchMLP (D) & 1082${}_{-30,+22}$ & 0.023 & 32.0 & 29.3 & 0 & 19.3\% & 6.83 & 0.15 \\
43
- FastaiMLP (D) & 1053${}_{-31,+29}$ & 0.031 & 33.1 & 29.9 & 0 & 22.1\% & 2.91 & 0.37 \\
44
- RandomForest (D) & 1000${}_{-0,+0}$ & 0.013 & 34.8 & 32.9 & 0 & 24.1\% & 0.38 & 0.04 \\
45
- Linear (T+E) & 998${}_{-37,+26}$ & 0.042 & 34.9 & 25.5 & 0 & 29.2\% & 51.79 & 0.22 \\
46
- Linear (T) & 962${}_{-28,+24}$ & 0.027 & 36.1 & 30.6 & 0 & 30.1\% & 51.79 & 0.08 \\
47
- Linear (D) & 951${}_{-30,+28}$ & 0.019 & 36.3 & 27.8 & 0 & 31.1\% & 1.61 & 0.10 \\
48
- ExtraTrees (D) & 915${}_{-31,+32}$ & 0.010 & 37.3 & 34.3 & 0 & 26.7\% & 0.25 & 0.04 \\
49
- KNN (T+E) & 688${}_{-46,+39}$ & 0.000 & 41.7 & 41.4 & 0 & 48.5\% & 2.97 & 0.19 \\
50
- KNN (T) & 605${}_{-45,+48}$ & 0.000 & 42.7 & 42.5 & 0 & 50.3\% & 2.97 & 0.04 \\
51
- KNN (D) & 462${}_{-71,+70}$ & 0.000 & 43.9 & 43.7 & 0 & 58.7\% & 0.07 & 0.02 \\
52
- \bottomrule
53
- \end{tabular}
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
data/full-imputed-cls/tabarena_leaderboard.csv DELETED
@@ -1,46 +0,0 @@
1
- method,time_train_s,time_infer_s,time_train_s_per_1K,time_infer_s_per_1K,normalized-error,normalized-error-task,imputed,champ_delta,loss_rescaled,time_train_s_rescaled,time_infer_s_rescaled,rank,median_metric_error,median_time_train_s,median_time_infer_s,median_time_train_s_per_1K,median_time_infer_s_per_1K,median_normalized-error,median_normalized-error-task,median_imputed,median_champ_delta,median_loss_rescaled,median_time_train_s_rescaled,median_time_infer_s_rescaled,median_rank,rank=1_count,rank=2_count,rank=3_count,rank>3_count,elo,elo+,elo-,winrate,mrr
2
- TABM_GPU (tuned + ensemble),34341.376551290836,9.60910518720136,4223.58680397795,3.4317121842487635,0.4949870968434646,0.5299923578456494,0.0,0.08239957044412496,0.05800528924519605,45864.929770826835,134.55326674312073,9.644736842105264,0.17047,8245.070318195554,2.6588550435172187,2466.2108648716858,1.5022465694891438,0.4807674997729695,0.5356016471758118,0.0,0.0374695276183053,0.02990990162448854,39661.48674821285,121.43882105497326,9.0,3,2,2,31,1574.1,27.7,27.1,0.8035287081339713,0.22390813053432343
3
- AutoGluon 1.3 (4h),7918.9967654708535,21.9962719316371,2834.6060407147975,3.0873029091200266,0.4234797507066772,0.42851773721350084,0.0,0.06891082208685434,0.03901042356832644,31011.170971980166,263.893057970457,9.710526315789474,0.159155,7086.632828738954,3.4718479580349393,1322.7174946761893,2.35500290690449,0.33567162503552783,0.4181224156574922,0.0,0.03226297629046343,0.018303111934730745,23372.022224152715,142.27612475951975,6.0,6,3,1,28,1573.4,30.0,33.4,0.8020334928229665,0.3009017525052929
4
- REALMLP (tuned + ensemble),88520.97457650169,55.83305884453288,9129.519936677754,18.43403910125304,0.5303101151258073,0.5509587531175434,0.0,0.08216963958005759,0.04917872511787768,141789.21603781544,866.2933837817693,10.460526315789474,0.16568,30350.410282479395,23.161008212301468,6519.687377737515,10.838853332215592,0.4880190327087218,0.5530787180349611,0.0,0.046844798741680516,0.023927651830585747,114204.24024987879,769.6999972028425,9.25,0,1,1,36,1552.0,31.2,29.1,0.784988038277512,0.1359044319221565
5
- GBM (tuned + ensemble),2957.8209862183407,11.98416593290909,759.1477152793035,2.5417865978850473,0.579228860288798,0.5992528140715053,0.0,0.09787395108553403,0.0587719228886129,8568.794143617388,184.93302208876983,11.131578947368421,0.16698000000000002,1552.8742877377404,3.4757837878333198,382.05361557599804,1.4876036641335277,0.6139242992830947,0.6320945623165324,0.0,0.050681941850522993,0.0219954442643497,7514.342598912193,103.35126359750211,10.0,1,1,2,34,1536.6,25.3,29.4,0.7697368421052632,0.15602996440606
6
- TABICL_GPU (default),107.96023476381747,19.280086713506464,9.625347263659664,2.230965763361112,0.4883393558984766,0.5463468792496827,0.05263157894736842,0.0794222885606876,0.05664734932761992,168.76491644336255,230.09998232516136,11.394736842105264,0.17207,25.732762111557854,3.449363695250617,8.684246340890724,1.7433667301105085,0.49313937962268345,0.5681798960380293,0.0,0.03858333399786745,0.018532235328146802,137.81489160855577,127.65482709424812,10.0,6,4,1,27,1529.5,22.7,30.1,0.763755980861244,0.2848797584062183
7
- TABM_GPU (tuned),34341.376551290836,1.0683778852050068,4223.58680397795,0.37888409513825755,0.580460857439686,0.5850736457317778,0.0,0.0927595492056652,0.07202518163358282,45864.929770826835,13.890136033418576,13.223684210526315,0.172585,8245.070318195554,0.2723346948623657,2466.2108648716858,0.17557376557934842,0.5594681994427791,0.6408535524589001,0.0,0.04800849593484757,0.03209545871697756,39661.48674821285,10.546435553124088,14.0,1,3,2,32,1487.1,28.6,29.5,0.722188995215311,0.15718600055013962
8
- CAT (tuned + ensemble),16504.07579820414,2.581779239609925,3201.0265707590415,0.8915752484646539,0.6052324431308997,0.6079016429921736,0.0,0.09115569260254941,0.05171280995480023,27193.868813721343,44.08651163500922,13.552631578947368,0.16040500000000002,5046.900271190538,1.2630292971928916,1372.9411122807264,0.5562989902961428,0.6047135908900858,0.6538224316024461,0.0,0.05787378892595674,0.023358345198772973,20541.46486167592,37.76547447057217,13.0,0,1,3,34,1479.0,23.4,25.1,0.7147129186602871,0.11519163508706867
9
- CAT (tuned),16504.07579820414,0.42638187025025576,3201.0265707590415,0.12024042770987867,0.6205816792872123,0.6195401164987226,0.0,0.09330164070819684,0.05138375883564522,27193.868813721343,6.229177098348047,13.881578947368421,0.161705,5046.900271190538,0.12877851062350804,1372.9411122807264,0.07385371803542701,0.6259023413364113,0.6684527369967126,0.0,0.05645008601114826,0.027822328565894415,20541.46486167592,4.9725379405984995,14.0,1,2,1,34,1469.3,24.3,26.2,0.7072368421052632,0.14306281114130998
10
- GBM (tuned),2957.8209862183407,1.875632255676894,759.1477152793035,0.5049369881097493,0.664414327281209,0.6568405029860219,0.0,0.10551185264608158,0.06848969063218771,8568.794143617388,32.37091249654915,14.302631578947368,0.16877999999999999,1552.8742877377404,0.5045170254177518,382.05361557599804,0.2538713244201605,0.7037360500796572,0.6730961643801268,0.0,0.05207128139882822,0.031112468640158004,7514.342598912193,15.175591971121808,12.5,0,0,0,38,1463.4,29.5,28.3,0.6976674641148325,0.08286433502159408
11
- TABPFNV2_GPU (tuned + ensemble),10002.759252123178,101.4355180454533,2653.4255131981217,44.749282859570734,0.5025912169508552,0.5552745352373752,0.3157894736842105,0.09695040281680861,0.08124534064551475,49565.00539849654,3444.4876051657507,14.671052631578947,0.17261500000000002,2077.023049354553,12.760541562239329,3008.2157047151595,20.848616639963154,0.508383993403513,0.5777919166792829,0.0,0.04061436251124223,0.031106953519191773,28624.579287895787,825.5109643363919,8.5,8,3,2,25,1453.8,24.8,30.9,0.6892942583732058,0.3228293379761294
12
- CAT (default),220.2627424415789,0.2710267032099049,109.9732905896213,0.13662992734425208,0.6377213723455996,0.653981819624179,0.0,0.10281875743582664,0.05525161599592243,404.9759898295662,6.476751807794606,14.68421052631579,0.16373,18.264583627382912,0.17750852637820774,5.723546572951673,0.0761539571798428,0.6679727149434583,0.668034897235229,0.0,0.05481736123368691,0.025874497523709752,103.69107151573829,5.120402547941957,16.0,1,3,1,33,1454.1,22.2,27.3,0.6889952153110048,0.14085028107005457
13
- XGB (tuned + ensemble),5957.374200563333,6.456417350253166,1167.526219278486,2.7907218103162132,0.6678696607918362,0.667447365405449,0.0,0.1067022658124541,0.0693895201734448,10832.944520166355,137.58836217438702,14.763157894736842,0.165745,1680.0658507664998,2.2809726662105985,685.86510540535,1.4547593315263065,0.7199373388598738,0.7259505715046674,0.0,0.0603611320440825,0.027448205694685396,8251.297786111467,74.47019952683146,13.0,0,1,1,36,1452.0,30.3,27.2,0.687200956937799,0.1077066830171761
14
- MNCA_GPU (tuned),57356.284085895306,20.170870465214488,5990.817791505437,2.0485901061394993,0.7110105277591534,0.6595940582554153,0.0,0.1050159299928894,0.0724200510289198,80456.91211763977,129.4389431731015,16.526315789473685,0.17054999999999998,14186.536935488384,0.6020842525694106,4879.890404506269,0.5247194359730172,0.7789380151687756,0.6741481297678236,0.0,0.06755655006003625,0.04344719805270737,66956.02864547497,27.735396922747526,16.5,1,0,0,37,1415.9,23.5,33.3,0.6471291866028708,0.10295238153020833
15
- XGB (tuned),5957.374200563333,1.301870340352867,1167.526219278486,0.6678561539347552,0.7214356698637235,0.7066756243519319,0.0,0.11080653156459744,0.07384263953337422,10832.944520166355,28.47499330573137,16.63157894736842,0.16848000000000002,1680.0658507664998,0.3827125522825453,685.86510540535,0.2050912539994952,0.7533869940641466,0.7531012522750911,0.0,0.0694652487234973,0.03410738014755424,8251.297786111467,11.461364611937853,15.5,0,0,0,38,1412.1,27.2,25.8,0.6447368421052632,0.07527320270823155
16
- MNCA_GPU (tuned + ensemble),57356.284085895306,531.7593351699455,5990.817791505437,50.07821547982156,0.6178778787330054,0.6105886884096879,0.0,0.10389701155615068,0.07981152020070442,80456.91211763977,3301.6213661597712,16.68421052631579,0.183035,14186.536935488384,14.17156207561493,4879.890404506269,8.743516387788919,0.6004419642795275,0.5800963441226324,0.0,0.06497462948315735,0.039747452446492706,66956.02864547497,548.1975258046007,12.5,0,2,4,32,1410.4,26.7,27.6,0.6435406698564593,0.13300821802463408
17
- TABPFNV2_GPU (tuned),10002.759252123178,3.410806728176206,2653.4255131981217,1.555769686886014,0.6152128339751168,0.6368249005546318,0.3157894736842105,0.12130685228554555,0.0956967324044558,49565.00539849654,114.00749994252256,17.842105263157894,0.1868,2077.023049354553,0.5060818235079447,3008.2157047151595,0.5144277113236544,0.6861393226426085,0.651413564581804,0.0,0.08561151707807774,0.042273687584978896,28624.579287895787,25.570859321617846,13.0,1,8,1,28,1387.8,32.2,29.9,0.6172248803827751,0.19384505458173865
18
- TABM_GPU (default),150.0762373017289,1.2507152029645372,19.886819500646006,0.46487430096802723,0.7200913651969231,0.7318128476710409,0.0,0.1257035538283091,0.09205633408961793,189.61145131944258,14.042337078320946,18.32894736842105,0.17246,31.126562476158142,0.20260944763819377,10.213381764059356,0.1381032773929915,0.8315398727681542,0.7950654471852272,0.0,0.06154427249600947,0.03281090889681732,144.96049255349743,10.968725907290855,18.0,0,0,1,37,1375.0,29.4,25.1,0.6061602870813397,0.08448030842994195
19
- TABPFNV2_GPU (default),11.400364582092442,0.8958970822786029,4.227300497658887,0.4575723059305235,0.646401200043383,0.6913643236884485,0.3157894736842105,0.13032850434642715,0.10434014574898658,53.73712299744585,29.471362460513575,18.842105263157894,0.1886,7.994279013739691,0.2908047080039978,3.368600991426515,0.3152861168047789,0.777395370386663,0.7325492470622512,0.0,0.07929605277196083,0.04313507441518306,41.34560285308973,17.75894630310718,17.0,4,1,4,29,1366.1,24.9,30.7,0.5944976076555024,0.21051957029694982
20
- NN_TORCH (tuned + ensemble),24331.947126566527,16.31368946276213,3050.9102763481856,4.325404104362444,0.7674791714524483,0.751965702834341,0.0,0.11637811814114107,0.08020239923609776,56038.718047349466,227.24142067178357,19.05263157894737,0.17071999999999998,9097.789536105262,5.056680162747702,2389.2199648500327,2.157502904371376,0.8880043848397714,0.8073663594549582,0.0,0.06782479524801643,0.04484831234958845,44480.90343297912,173.17543399472862,19.5,0,0,0,38,1364.3,29.0,27.1,0.5897129186602871,0.06737424255513398
21
- EBM (tuned + ensemble),36729.440274559965,1.3371900389766136,6141.104384884199,0.5036823041953499,0.811594312476448,0.8073262691422789,0.0,0.14906209202998957,0.10957472519228896,25271.27145534202,19.518642702178116,19.342105263157894,0.17122500000000002,2366.879786974854,0.4240463972091675,914.2329798556116,0.21634762578811195,0.9034396658321221,0.8509428943624828,0.0,0.0823298574360965,0.040258063765572304,15273.913117491418,11.254406005139426,19.0,0,0,1,37,1356.1,25.2,27.9,0.5831339712918661,0.07392904855395822
22
- REALMLP (tuned),88520.97457650169,2.4448067613512454,9129.519936677754,0.9659323148202512,0.8116906333552832,0.7529096930561373,0.0,0.11967014747000536,0.08502935951276701,141789.21603781544,39.671058690327705,19.355263157894736,0.17194500000000001,30350.410282479395,0.9911958641476102,6519.687377737515,0.5341468926001405,0.8954815082143113,0.7906338583190218,0.0,0.076222073390521,0.04561771439992753,114204.24024987879,35.28258688191838,18.75,0,0,0,38,1356.9,28.2,26.5,0.5828349282296651,0.059857762488610834
23
- FASTAI (tuned + ensemble),7309.51755415473,18.692008190266574,1376.1098802486467,8.473426897342513,0.7967114176056229,0.7935375376704039,0.0,0.14576367767903883,0.08697133573600783,18964.092381812123,455.5897128961598,21.013157894736842,0.178125,3087.37076303694,11.787012616793314,618.8953909329178,4.7655686359255345,0.9993616733332265,0.8596726175256504,0.0,0.08731644523471122,0.054465543302125864,15284.817189242676,443.8905026950689,22.75,0,1,0,37,1323.2,22.6,28.2,0.5451555023923444,0.08034444085096946
24
- MNCA_GPU (default),304.22019695963775,10.484658345144394,17.60828380729721,1.3061868722894623,0.8600476423581978,0.8133618407225838,0.0,0.14684370064110588,0.09623286753523605,254.62493914649235,74.63976386946672,21.789473684210527,0.18519,31.500762327512106,0.5732622504234314,14.777266169000828,0.34634581634079226,1.0,0.893695279008777,0.0,0.09230605494417649,0.051103226670839136,209.09978531409226,23.91921150117286,23.0,1,0,0,37,1306.1,25.7,27.4,0.527511961722488,0.08099601234817519
25
- EBM (tuned),36729.440274559965,0.18373215324000308,6141.104384884199,0.08212434505580339,0.8701293455239967,0.8449855604868557,0.0,0.15617886229027844,0.11711713769734972,25271.27145534202,2.5019643980329747,22.42105263157895,0.17203000000000002,2366.879786974854,0.0449512971772088,914.2329798556116,0.02528246646038782,1.0,0.8943570495572675,0.0,0.08835084225135731,0.04420635043937865,15273.913117491418,1.252412448907763,23.5,0,0,0,38,1295.1,25.2,29.9,0.5131578947368421,0.05576760659451273
26
- EBM (default),119.68166406294058,0.19009479611937763,11.429209024387047,0.10185762188607354,0.8572475476046651,0.8539911315322448,0.0,0.16604276757956396,0.12380006768435853,109.6134925105603,3.4110931424674527,23.894736842105264,0.17447000000000001,9.92618230978648,0.06371633741590713,4.31382445805991,0.0475851422516083,1.0,0.9282824912353611,0.0,0.09554880640873625,0.03755874966541475,60.70178540099199,2.7834768100426253,24.0,1,0,2,35,1264.9,28.3,35.7,0.4796650717703349,0.08753724330105657
27
- REALMLP (default),302.86446626785903,3.0018129969200893,26.277581916467653,2.856467141391241,0.9122374081508348,0.85760098469969,0.0,0.14572889637854225,0.10212034328423684,472.82883048215837,115.85189921010567,24.5,0.17367,103.08245442973242,2.582352219687568,21.83278809042843,0.8957992336206358,1.0,0.8896440117833417,0.0,0.11770994919600164,0.06845697585035651,372.59928806765174,52.85349774079455,25.5,0,0,0,38,1253.4,23.0,26.4,0.4659090909090909,0.04751011701630104
28
- XGB (default),13.116732126927516,0.5742131232518202,3.202512268619222,0.2981186068489448,0.8843514041095932,0.8510762104833869,0.0,0.14066462097580287,0.11671766984062094,31.46292861336276,13.4756369742995,24.5,0.17317,5.653352538744608,0.30113152662913,1.771208861779989,0.11707781619763814,1.0,0.9410060237993583,0.0,0.09869044613168698,0.06054796407572259,28.26053761224749,9.127662964815336,24.0,0,0,0,38,1253.3,28.1,30.7,0.4659090909090909,0.05191659220532364
29
- XT (tuned + ensemble),1317.4209560674533,2.9519454171085915,472.65138083581655,1.3657584923642982,0.889559570547466,0.8556261516301992,0.0,0.1579271337121436,0.11679174830217959,4571.615940832833,75.60244638381447,24.57894736842105,0.17667500000000003,756.8230986197789,1.8136235740449693,189.76252609436062,0.7431041876698922,1.0,0.93225990863037,0.0,0.09299076955407681,0.0685642717298475,2805.66154207989,66.39288968996527,27.5,0,0,1,37,1251.6,23.8,28.6,0.46411483253588515,0.05948664377868326
30
- TABDPT_GPU (default),171.71139350780967,66.09824930987163,27.724576482795502,22.626481529214185,0.8200159004748209,0.8098229820925923,0.0,0.1548400388440821,0.1161870388203104,481.10896045076544,1338.9171702872993,25.105263157894736,0.190385,97.80311637454562,28.07416233751509,22.609050986069803,8.552450841932743,1.0,0.9583506424916117,0.0,0.1092338236572048,0.046097964205681456,400.67828468381333,1123.8959746745188,30.0,2,0,3,33,1238.6,27.7,34.6,0.45215311004784686,0.12327697073991685
31
- NN_TORCH (tuned),24331.947126566527,0.8716282456241854,3050.9102763481856,0.24396545036982029,0.8935160561232315,0.8422311335793125,0.0,0.14120278296028227,0.10508427239733291,56038.718047349466,12.138164397560297,25.144736842105264,0.17446499999999998,9097.789536105262,0.29952494303385413,2389.2199648500327,0.15177921475257505,1.0,0.9030726729865997,0.0,0.10379368194908523,0.06014400941853005,44480.90343297912,9.196372470124006,25.5,0,0,0,38,1237.2,25.5,25.3,0.451255980861244,0.046101906732064955
32
- FASTAI (tuned),7309.51755415473,1.0496307073977955,1376.1098802486467,0.623937267000547,0.9066743837597047,0.8621224292232853,0.0,0.16578900249006634,0.10852452399743645,18964.092381812123,32.00078545496663,26.06578947368421,0.18023499999999998,3087.37076303694,0.8054822285970051,618.8953909329178,0.2978802219128553,1.0,0.9088593404454739,0.0,0.09789639604232409,0.0649858160039401,15284.817189242676,26.536834640421894,26.0,0,0,0,38,1220.2,27.5,28.1,0.43032296650717705,0.04823155314220717
33
- RF (tuned + ensemble),2309.3465478268977,2.3587986137434753,541.3031953907538,1.2662572218585502,0.8949088487058612,0.8653921072161802,0.0,0.16581458532483917,0.12803665978914544,5371.163113535875,67.79453318661523,26.31578947368421,0.177925,871.1966819789675,1.9029027620951335,323.74369638605225,0.7428875097152683,1.0,0.976443274983263,0.0,0.10359622790389916,0.07479078376008455,4278.677975908691,61.67862848692378,28.5,0,1,1,36,1213.5,24.8,23.2,0.4246411483253589,0.06923937840225046
34
- GBM (default),7.9087888545460165,0.5753568479889317,2.951996847158713,0.17011749256975625,0.9125347540697407,0.8867856543564054,0.0,0.15377361970482045,0.11510537200611927,31.85523047906877,10.81395814590769,27.026315789473685,0.172975,5.532836645179325,0.2585195038053725,1.7913477923414471,0.12049981156984965,1.0,0.953002789600119,0.0,0.11068698010353623,0.0625734900521432,25.045220341015206,6.549914554959342,28.0,0,0,0,38,1197.8,28.8,27.4,0.4084928229665072,0.04242553702308852
35
- XT (tuned),1317.4209560674533,0.3043035832762021,472.65138083581655,0.1722314796858175,0.9201818360376706,0.887453690669917,0.0,0.1716073337665082,0.12727956433250776,4571.615940832833,8.349379059762423,27.11842105263158,0.17796,756.8230986197789,0.18769407272338867,189.76252609436062,0.07878183958882805,1.0,0.9660073423997149,0.0,0.10587649481110034,0.07275368947647169,2805.66154207989,8.013491457037578,30.0,0,1,0,37,1195.6,24.0,32.1,0.4063995215311005,0.05828850461349728
36
- RF (tuned),2309.3465478268977,0.23789871352457861,541.3031953907538,0.15647241398955916,0.9224816586070888,0.8914900089794889,0.0,0.17793267481613345,0.14008678334835084,5371.163113535875,7.241889916329925,28.763157894736842,0.178915,871.1966819789675,0.17230602105458576,323.74369638605225,0.07643497412773152,1.0,0.9949804738019044,0.0,0.11788865245982966,0.07710393358859187,4278.677975908691,6.216263662191708,31.5,0,1,1,36,1159.2,32.8,32.1,0.36901913875598086,0.060732505395188326
37
- NN_TORCH (default),48.66164081361559,0.6504810580733227,11.536659167937149,0.237589986723434,0.9768277299582552,0.9516276223313249,0.0,0.19345935921063773,0.14725824727620437,155.18773864824473,10.878820110101595,31.986842105263158,0.180355,26.916632894674937,0.2675716214709811,6.83469910157457,0.14703020953097523,1.0,0.9983912483912485,0.0,0.142508632176782,0.09058242754644441,137.05069981706868,8.619821917518276,33.0,0,0,0,38,1081.6,21.3,29.3,0.29575358851674644,0.0341695156824269
38
- FASTAI (default),31.12571889106293,1.103970385504048,5.0919225272079345,0.5139922583887925,0.9692956872023926,0.9418353795989344,0.0,0.2205946249171078,0.17081144202844967,74.47424680202033,27.97257070113272,33.078947368421055,0.19183499999999998,12.713354892200893,0.7982388072543674,2.9120182447539116,0.36810695156439827,1.0,1.0,0.0,0.16584224786083723,0.11423976777713259,60.1261932941261,24.301698162325565,36.0,0,0,0,38,1052.8,28.1,30.9,0.27093301435406697,0.03346355889150736
39
- RF (default),3.928520040972191,0.1435067986187182,0.8021093004373405,0.07125612411976835,0.9873793306417248,0.9675347593198196,0.0,0.24146620137116087,0.23100094778843142,6.102631035159317,3.7416299644570024,34.80263157894737,0.21025,1.1966572999954224,0.08589340580834282,0.3813090053938437,0.03721195658349352,1.0,1.0,0.0,0.17586669886220047,0.11692775501298655,5.480722222590385,3.4579154094346745,36.5,0,0,0,38,1000.0,0.0,0.0,0.23175837320574164,0.03044058622007152
40
- LR (tuned + ensemble),310.9206490230839,1.856038474618343,112.43523004573552,0.6244822981818133,0.9583388751756515,0.9525027008333006,0.0,0.2917619246531289,0.2639449201736458,1090.6543235756506,24.154836057241496,34.94736842105263,0.20569500000000002,172.99803659651013,0.33067578077316284,51.78500762114817,0.22385943240534312,1.0,1.0,0.0,0.23092004636837282,0.1418450851884247,696.2608974821628,13.25530093456624,38.0,0,0,1,37,998.1,25.5,37.0,0.2284688995215311,0.039205008884686546
41
- LR (tuned),310.9206490230839,0.5155129476597434,112.43523004573552,0.17331679222553462,0.9727265210610188,0.9599648989348959,0.0,0.30105734113761295,0.27391129557139166,1090.6543235756506,6.903044838007482,36.05263157894737,0.20751,172.99803659651013,0.12874411212073433,51.78500762114817,0.07805190196778514,1.0,1.0,0.0,0.23702586233900474,0.14460038903245942,696.2608974821628,4.422167155622043,38.5,0,0,0,38,961.6,23.9,27.7,0.20334928229665072,0.0327143779465347
42
- LR (default),7.59960079534709,0.5285763780973111,2.68894957171299,0.18897855365485863,0.9811233759976523,0.9657118387068965,0.0,0.3109312407898901,0.298004116613438,27.747857838408066,7.876763349331487,36.3421052631579,0.21230500000000002,5.359276652336121,0.13779839674631755,1.6116061178401027,0.09774404154712064,1.0,1.0,0.0,0.23702577885376547,0.1541261802297627,18.023625361427616,4.735441569338942,40.0,0,0,1,37,950.6,27.1,29.7,0.19677033492822968,0.03600631416753817
43
- XT (default),2.7359117016457675,0.18087529669031066,0.7550887156747417,0.07422230753863952,0.9896376739572407,0.9751584322493536,0.0,0.26658722117284267,0.26095559827613696,5.173741811459425,4.243951698357253,37.26315789473684,0.21292,1.01790091726515,0.09051434199015299,0.24605929188859293,0.04072451222400395,1.0,1.0,0.0,0.18424442460299273,0.13780186094528035,4.456264068369281,3.7424721126192138,39.0,0,0,0,38,914.9,32.0,30.5,0.17583732057416268,0.0291505121116296
44
- KNN (tuned + ensemble),167.0455302492917,11.66904854404996,12.260943976874097,0.77695539036087,1.0,0.9962934098751769,0.13157894736842105,0.48462823969745605,0.5970228537293082,72.48328913141673,77.9475913060204,41.671052631578945,0.318405,9.821025305324131,0.2367298404375712,2.969713103492417,0.18997109296167614,1.0,1.0,0.0,0.42625485653324524,0.6596342220080722,56.9019152818522,12.487074071232104,43.0,0,0,0,38,688.4,38.7,45.5,0.0756578947368421,0.024178278159192945
45
- KNN (tuned),167.0455302492917,1.8054817143936601,12.260943976874097,0.13106223823409818,1.0,0.9976590626673897,0.13157894736842105,0.5033620273790689,0.6430012930336472,72.48328913141673,12.511864612142434,42.671052631578945,0.322975,9.821025305324131,0.0851174063152737,2.969713103492417,0.040417757021976156,1.0,1.0,0.0,0.4549174904387682,0.7006933550921264,56.9019152818522,2.342964973116946,44.0,0,0,0,38,605.3,48.0,44.4,0.05293062200956938,0.02351625016097909
46
- KNN (default),1.7449495283483762,0.22627578220869365,0.489346568018936,0.038714559202156204,1.0,1.0,0.13157894736842105,0.5871637410556744,0.939313827722613,1.0055419249185589,2.3649007838074403,43.921052631578945,0.382765,0.27595198154449463,0.036337282922532826,0.07126887487893994,0.021006283652748647,1.0,1.0,0.0,0.5463314318406584,1.0,1.0,1.2165713596834893,45.0,0,0,0,38,462.1,69.7,70.6,0.02452153110047847,0.022868529201343697
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
data/full-imputed-cls/time_plot.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:bbc2ab1a3ea54968a3a62048f7eb8aca8a04c6fde89ea8d5ada95afcede70782
3
- size 460639
 
 
 
 
data/full-imputed-cls/tuning-impact-elo-horizontal.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:2a2a620c17504a88fecf451859711c855ea2229c399a54cd2762da04e25a5af2
3
- size 243316
 
 
 
 
data/full-imputed-cls/tuning-impact-elo.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:ae99128553de5b41518ecdf1611e1cb39792ba074adcc18311e32dea5bf3be5b
3
- size 231751
 
 
 
 
data/full-imputed-reg/figures/critical-diagram.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:7bb27a5f27783da8e6e9d2f72080c15fdd982b77663cc61fb042962fa00d3ac5
3
- size 333644
 
 
 
 
data/full-imputed-reg/leaderboard.tex DELETED
@@ -1,52 +0,0 @@
1
- \begin{tabular}{llcccccrr}
2
- \toprule
3
- \textbf{Model} & \textbf{Elo ($\uparrow$)} & \textbf{Norm.} & \textbf{Avg.} & \textbf{Harm.} & \textbf{\#wins ($\uparrow$)} & \textbf{Improva-} & \textbf{Train time} & \textbf{Predict time} \\
4
- & & \textbf{score ($\uparrow$)} & \textbf{rank ($\downarrow$)} & \textbf{mean} & & \textbf{bility ($\downarrow$)} & \textbf{per 1K [s]} & \textbf{per 1K [s]} \\
5
- & & & & \textbf{rank ($\downarrow$)} & & & & \\
6
- \midrule
7
- AutoGluon 1.3 (4h) & \textcolor{gold}{\textbf{1795${}_{-55,+67}$}} & \textcolor{gold}{\textbf{0.740}} & \textcolor{gold}{\textbf{4.6}} & \textcolor{silver}{\textbf{2.5}} & 2 & \textcolor{silver}{\textbf{2.7\%}} & 1625.74 & 6.76 \\
8
- RealMLP (T+E) & \textcolor{silver}{\textbf{1766${}_{-64,+61}$}} & \textcolor{silver}{\textbf{0.732}} & \textcolor{silver}{\textbf{5.3}} & 3.4 & 0 & \textcolor{gold}{\textbf{2.0\%}} & 7141.94 & 9.66 \\
9
- ModernNCA (T+E) & \textcolor{bronze}{\textbf{1632${}_{-52,+51}$}} & 0.625 & \textcolor{bronze}{\textbf{8.3}} & \textcolor{bronze}{\textbf{2.8}} & \textcolor{silver}{\textbf{3}} & 3.8\% & 3779.52 & 7.69 \\
10
- TabDPT (D) & 1620${}_{-62,+53}$ & \textcolor{bronze}{\textbf{0.651}} & 8.8 & \textcolor{gold}{\textbf{2.3}} & \textcolor{gold}{\textbf{5}} & \textcolor{bronze}{\textbf{2.9\%}} & 22.53 & 8.55 \\
11
- CatBoost (T+E) & 1616${}_{-51,+59}$ & 0.547 & 8.9 & 7.2 & 0 & 4.5\% & 3552.96 & 0.97 \\
12
- LightGBM (T+E) & 1609${}_{-53,+58}$ & 0.542 & 9.2 & 7.1 & 0 & 5.0\% & 700.15 & 9.32 \\
13
- CatBoost (T) & 1568${}_{-46,+64}$ & 0.524 & 10.3 & 6.9 & 0 & 4.6\% & 3552.96 & 0.10 \\
14
- TabM (T+E) & 1562${}_{-64,+54}$ & 0.494 & 10.5 & 6.5 & 0 & 3.3\% & 4158.29 & 1.41 \\
15
- XGBoost (T+E) & 1496${}_{-51,+43}$ & 0.447 & 12.8 & 12.3 & 0 & 5.5\% & 834.93 & 2.61 \\
16
- LightGBM (T) & 1490${}_{-50,+55}$ & 0.445 & 12.9 & 10.8 & 0 & 5.6\% & 700.15 & 0.97 \\
17
- XGBoost (T) & 1474${}_{-46,+46}$ & 0.414 & 13.7 & 13.1 & 0 & 5.6\% & 834.93 & 0.39 \\
18
- ModernNCA (T) & 1446${}_{-49,+52}$ & 0.360 & 14.6 & 7.2 & 0 & 5.9\% & 3779.52 & 0.40 \\
19
- TabM (T) & 1440${}_{-54,+39}$ & 0.392 & 15.0 & 11.6 & 0 & 4.3\% & 4158.29 & 0.17 \\
20
- CatBoost (D) & 1439${}_{-57,+48}$ & 0.400 & 15.0 & 11.5 & 0 & 6.2\% & 10.89 & 0.09 \\
21
- RealMLP (T) & 1405${}_{-53,+53}$ & 0.350 & 16.4 & 13.9 & 0 & 4.6\% & 7141.94 & 0.39 \\
22
- TabPFNv2 (T+E) & 1381${}_{-47,+55}$ & 0.414 & 17.2 & 3.1 & \textcolor{silver}{\textbf{3}} & 5.1\% & 4223.87 & 27.54 \\
23
- ModernNCA (D) & 1340${}_{-54,+50}$ & 0.216 & 18.8 & 14.1 & 0 & 7.4\% & 15.50 & 0.30 \\
24
- TabM (D) & 1329${}_{-48,+53}$ & 0.300 & 19.2 & 15.6 & 0 & 6.0\% & 13.32 & 0.13 \\
25
- TorchMLP (T+E) & 1306${}_{-57,+46}$ & 0.182 & 20.2 & 14.6 & 0 & 7.6\% & 4608.59 & 1.23 \\
26
- TabPFNv2 (T) & 1300${}_{-55,+53}$ & 0.287 & 20.6 & 8.4 & 0 & 6.2\% & 4223.87 & 0.45 \\
27
- RealMLP (D) & 1284${}_{-48,+47}$ & 0.152 & 21.3 & 18.4 & 0 & 7.0\% & 21.86 & 0.84 \\
28
- ExtraTrees (T+E) & 1272${}_{-60,+53}$ & 0.162 & 21.5 & 13.6 & 0 & 10.0\% & 158.22 & 0.84 \\
29
- LightGBM (D) & 1264${}_{-54,+53}$ & 0.091 & 21.9 & 21.3 & 0 & 8.1\% & 2.11 & 0.27 \\
30
- ExtraTrees (T) & 1255${}_{-53,+42}$ & 0.136 & 22.3 & 16.9 & 0 & 10.3\% & 158.22 & 0.15 \\
31
- TabPFNv2 (D) & 1231${}_{-61,+59}$ & 0.238 & 23.3 & 11.5 & 0 & 7.6\% & 2.80 & 0.31 \\
32
- TorchMLP (T) & 1230${}_{-47,+46}$ & 0.131 & 23.3 & 20.1 & 0 & 8.4\% & 4608.59 & 0.10 \\
33
- XGBoost (D) & 1215${}_{-46,+42}$ & 0.114 & 24.1 & 21.5 & 0 & 8.8\% & 2.24 & 0.24 \\
34
- RandomForest (T+E) & 1203${}_{-48,+46}$ & 0.076 & 24.5 & 22.4 & 0 & 10.9\% & 515.73 & 0.77 \\
35
- RandomForest (T) & 1153${}_{-58,+49}$ & 0.055 & 26.2 & 24.5 & 0 & 11.4\% & 515.73 & 0.12 \\
36
- EBM (T+E) & 1136${}_{-61,+46}$ & 0.171 & 27.0 & 13.4 & 0 & 13.7\% & 1890.68 & 0.13 \\
37
- ExtraTrees (D) & 1100${}_{-63,+56}$ & 0.069 & 28.1 & 25.0 & 0 & 12.2\% & 0.47 & 0.06 \\
38
- EBM (T) & 1092${}_{-57,+50}$ & 0.150 & 28.5 & 17.0 & 0 & 14.2\% & 1890.68 & 0.01 \\
39
- FastaiMLP (T+E) & 1040${}_{-50,+62}$ & 0.024 & 30.0 & 28.0 & 0 & 12.2\% & 540.06 & 2.67 \\
40
- TorchMLP (D) & 1038${}_{-63,+58}$ & 0.017 & 30.2 & 28.2 & 0 & 11.9\% & 20.48 & 0.08 \\
41
- EBM (D) & 1034${}_{-67,+44}$ & 0.109 & 30.6 & 27.7 & 0 & 15.1\% & 6.33 & 0.04 \\
42
- RandomForest (D) & 1000${}_{-0,+0}$ & 0.000 & 31.3 & 30.7 & 0 & 12.9\% & 0.53 & 0.06 \\
43
- FastaiMLP (T) & 992${}_{-65,+49}$ & 0.014 & 31.7 & 30.4 & 0 & 12.7\% & 540.06 & 0.32 \\
44
- FastaiMLP (D) & 858${}_{-70,+53}$ & 0.000 & 34.9 & 34.3 & 0 & 17.1\% & 2.60 & 0.39 \\
45
- KNN (T+E) & 532${}_{-96,+78}$ & 0.000 & 39.8 & 39.6 & 0 & 36.1\% & 2.43 & 0.14 \\
46
- Linear (T+E) & 489${}_{-95,+53}$ & 0.000 & 40.3 & 40.2 & 0 & 35.4\% & 45.74 & 0.11 \\
47
- KNN (T) & 442${}_{-135,+68}$ & 0.000 & 40.8 & 40.6 & 0 & 36.8\% & 2.43 & 0.03 \\
48
- Linear (T) & 421${}_{-95,+82}$ & 0.000 & 40.9 & 40.8 & 0 & 35.6\% & 45.74 & 0.05 \\
49
- Linear (D) & 290${}_{-113,+87}$ & 0.000 & 42.2 & 42.1 & 0 & 38.1\% & 1.19 & 0.09 \\
50
- KNN (D) & 239${}_{-146,+87}$ & 0.000 & 42.6 & 42.4 & 0 & 40.8\% & 0.04 & 0.02 \\
51
- \bottomrule
52
- \end{tabular}
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
data/full-imputed-reg/tabarena_leaderboard.csv DELETED
@@ -1,45 +0,0 @@
1
- method,time_train_s,time_infer_s,time_train_s_per_1K,time_infer_s_per_1K,normalized-error,normalized-error-task,imputed,champ_delta,loss_rescaled,time_train_s_rescaled,time_infer_s_rescaled,rank,median_metric_error,median_time_train_s,median_time_infer_s,median_time_train_s_per_1K,median_time_infer_s_per_1K,median_normalized-error,median_normalized-error-task,median_imputed,median_champ_delta,median_loss_rescaled,median_time_train_s_rescaled,median_time_infer_s_rescaled,median_rank,rank=1_count,rank=2_count,rank=3_count,rank>3_count,elo,elo+,elo-,winrate,mrr
2
- AutoGluon 1.3 (4h),10053.09853225386,65.70725388180496,3138.716102354025,8.201870690838728,0.2604881581536672,0.2971194348020468,0.0,0.026995880876839205,0.03762065178367739,79227.35072832079,1344.7158520024777,4.615384615384615,4.16484,12938.261539538702,10.17258334159851,1625.738447135909,6.759745919437983,0.23036881663552905,0.30366624015393634,0.0,0.015883109174547605,0.016643068102528722,85166.92529682687,517.5224332581762,4.0,2,3,1,7,1795.1,67.0,54.8,0.9159212880143113,0.3928904428904429
3
- REALMLP (tuned + ensemble),66543.44573790843,35.91296550086421,7325.429690273165,14.32081689268905,0.2681319612182075,0.3168526714039812,0.0,0.019775498611648384,0.029999838203468132,290863.52338623884,790.182850123545,5.3076923076923075,4.13253,35612.93265602324,25.25520912806193,7141.940159399529,9.664030993978182,0.21463277228840488,0.3314660386375113,0.0,0.015498759976369625,0.017118693739991864,262171.8658686582,808.3809230744637,3.0,0,4,3,6,1765.6,60.7,63.8,0.8998211091234347,0.2966880341880342
4
- MNCA_GPU (tuned + ensemble),31759.101002019288,48.99773550746787,6166.048475961011,17.643273328787647,0.37486519261010337,0.3801843857676789,0.0,0.038093035353910405,0.03729025388197337,186080.1526868885,677.6872983460363,8.307692307692308,4.48697,16310.556293937894,10.227302259869045,3779.5248398651206,7.690422738079533,0.33564273912263787,0.35100871313945403,0.0,0.019057076597787104,0.02404483039332963,198038.2195884601,507.5335424076248,7.0,3,1,1,8,1632.5,51.0,51.8,0.8300536672629696,0.35811118705855544
5
- TABDPT_GPU (default),150.02111775732448,54.38835432835114,28.728808967170515,25.042434382939593,0.34889596641909676,0.3868397999855826,0.0,0.0290119218252697,0.03451422020936026,859.9949957462201,1916.4746059233426,8.846153846153847,4.26747,137.4807067182329,29.239719518025716,22.52690614988171,8.547073882575031,0.2832974905649298,0.3551487188815256,0.0,0.024988993973347418,0.022011234994563023,904.9754515353668,1821.0928942426492,8.0,5,0,0,8,1620.5,52.9,61.7,0.817531305903399,0.44424741924741923
6
- CAT (tuned + ensemble),30302.612278386874,4.803950041991014,6846.603031359736,1.23549623448159,0.4533481852287783,0.4926936982252365,0.0,0.04475906131711523,0.05586568278238831,150000.99966111325,116.73562230823504,8.923076923076923,4.20864,22090.5574801498,2.685193909539117,3552.958864906998,0.9657383741190037,0.4197753110205541,0.5095036442102125,0.0,0.0214910315460225,0.0360010253274791,164350.82166574517,80.04598787510639,9.0,0,0,1,12,1615.9,58.2,51.0,0.815742397137746,0.13981452645420256
7
- GBM (tuned + ensemble),3469.6354441180188,51.50608464469258,806.792675338677,8.57889161766087,0.4578704418351117,0.4915767713552377,0.0,0.049628721858481745,0.06009436123338137,20529.34803212188,1135.938430886033,9.153846153846153,4.21165,3055.419878217909,17.84196005927192,700.1537746143185,9.321818212785747,0.41469312312474516,0.4239687696189567,0.0,0.026696516466175324,0.03428340330572247,21786.570035668443,556.5446710812895,7.0,0,0,0,13,1608.9,57.6,52.5,0.8103756708407871,0.14022351917088757
8
- CAT (tuned),30302.612278386874,0.5550543483505901,6846.603031359736,0.16909210735792113,0.47626653515508005,0.5114892609915386,0.0,0.04596154469633722,0.059493084445231575,150000.99966111325,13.40589862804556,10.26923076923077,4.23064,22090.5574801498,0.3789627022213406,3552.958864906998,0.10450043094654878,0.5098664416231588,0.5042459184199164,0.0,0.026006470182449948,0.03741223775007456,164350.82166574517,10.277501137463553,10.0,0,1,1,11,1568.5,63.6,45.8,0.7844364937388193,0.14579928618390156
9
- TABM_GPU (tuned + ensemble),42663.03407314721,6.401903104782104,7337.2216133773,2.4450322937588758,0.5064842509501634,0.5097062474762206,0.0,0.03285461585754071,0.07923986245243698,180683.6213219068,120.23686774247648,10.538461538461538,4.1458,20820.912871148852,3.0152715841929116,4158.291053548661,1.4096720886484275,0.41808698686448353,0.44811012371201403,0.0,0.031023728849073895,0.02730307401826167,166291.5239595113,107.26839105814781,6.0,0,0,1,12,1562.3,53.2,63.1,0.778175313059034,0.15363693425074498
10
- XGB (tuned + ensemble),6384.208144447945,12.513759183476113,1409.0815132938305,4.086239880001324,0.5528907467847085,0.5842672042727117,0.0,0.05501928538394466,0.06767162618009932,26398.72664659111,353.2622484982949,12.846153846153847,4.22219,2596.930354913076,6.618133616447449,834.9300717202715,2.614265349176195,0.6061614351314211,0.632068655632026,0.0,0.03390767360912417,0.047976586381100285,26370.57574541831,164.97865636299682,13.0,0,0,0,13,1495.9,43.0,50.2,0.7245080500894454,0.0815581414563315
11
- GBM (tuned),3469.6354441180188,7.385669826034806,806.792675338677,1.273710690539978,0.5553229125473071,0.5751848987218714,0.0,0.0557293790854645,0.07273694432227554,20529.34803212188,170.85098129275468,12.923076923076923,4.23482,3055.419878217909,2.690451833936903,700.1537746143185,0.9682498776019389,0.5552965190223861,0.5297118411020986,0.0,0.027972044011269404,0.03922298758674819,21786.570035668443,130.77892053638826,12.0,0,0,0,13,1490.1,54.7,49.3,0.7227191413237924,0.09231764967059085
12
- XGB (tuned),6384.208144447945,2.5463058025409016,1409.0815132938305,0.7958717570786976,0.5861202672596529,0.6108958009331424,0.0,0.056497224579359025,0.07175664097406265,26398.72664659111,74.35901041305158,13.73076923076923,4.23559,2596.930354913076,1.7797584003872342,834.9300717202715,0.3883258596259137,0.6620816436761664,0.6284024475165967,0.0,0.03317270939304562,0.04927060675426248,26370.57574541831,37.05707269936294,12.0,0,0,0,13,1474.5,45.4,45.1,0.7039355992844365,0.07642316833493304
13
- MNCA_GPU (tuned),31759.101002019288,2.180037997319148,6166.048475961011,0.7699041848491119,0.640014197538291,0.6006804484966409,0.0,0.05893528628182787,0.07225555979752735,186080.1526868885,32.66971509741366,14.615384615384615,4.74895,16310.556293937894,0.48738079600863987,3779.5248398651206,0.3958729871859153,0.6568741279318248,0.634131414678689,0.0,0.027829209062692706,0.05013010527584701,198038.2195884601,22.639832589759934,16.0,0,1,2,10,1446.4,51.4,48.7,0.6833631484794276,0.13858206746510898
14
- TABM_GPU (tuned),42663.03407314721,0.6790118826760185,7337.2216133773,0.2826346699151617,0.6078346969283082,0.6089177873032084,0.0,0.04291459744083105,0.1000977395685804,180683.6213219068,13.142332198971314,15.0,4.26932,20820.912871148852,0.3543446593814426,4158.291053548661,0.16724776809008424,0.5483643325463751,0.5720139858507244,0.0,0.039608740417227284,0.035365730806074164,166291.5239595113,12.524825096296247,13.0,0,0,0,13,1440.3,38.9,53.5,0.6744186046511628,0.08607273216026531
15
- CAT (default),101.69209800418625,0.27084102752881173,23.09400681118272,0.13269262448628016,0.5995482740998302,0.6304472090224245,0.0,0.06246023871361243,0.07708318676535167,496.3567653671193,9.26273127760114,15.038461538461538,4.21395,62.699359814325966,0.2736650307973226,10.889876924735699,0.09199146861016888,0.5823029883647647,0.640700383892448,0.0,0.030438989353365198,0.03712451320809096,412.72250349454265,8.72239577968297,13.0,0,0,0,13,1438.7,47.7,56.4,0.6735241502683363,0.08685217342752263
16
- REALMLP (tuned),66543.44573790843,1.7411092819311682,7325.429690273165,0.8126671252939139,0.6503917944927049,0.6335765314526475,0.0,0.04638057342567833,0.08360703671071543,290863.52338623884,43.982520909683025,16.384615384615383,4.34383,35612.93265602324,1.0202796989017062,7141.940159399529,0.394499832888234,0.6897823031948702,0.6773094419226776,0.0,0.042257970846457216,0.05482988743609004,262171.8658686582,39.332347707117826,17.0,0,0,0,13,1404.9,52.4,52.4,0.6422182468694096,0.07190526635498172
17
- TABPFNV2_GPU (tuned + ensemble),7396.527001981857,78.03043969614893,3418.770967200307,56.25472027093109,0.5864638170450713,0.6417803537075669,0.46153846153846156,0.05066211692688948,0.09308967259297021,101895.60372403836,4530.894138806997,17.192307692307693,4.11819,3945.7989165067675,13.798940539360046,4223.8673583405725,27.542795487744606,0.7230870148989033,0.7944604017471419,0.0,0.03408450486195502,0.03467029575770307,99491.49864758716,2157.1863969834576,19.0,3,1,1,8,1380.7,54.4,46.1,0.6234347048300537,0.31937392494449274
18
- MNCA_GPU (default),99.58075828980176,2.0463109099966847,18.34482305470451,0.7039566701063034,0.7844574247003018,0.7513144031861464,0.0,0.07422761878566567,0.09797180456330291,547.7971597202185,27.012858819060174,18.76923076923077,4.94033,49.107323222690155,0.41290783882141113,15.50085128260229,0.29868905742963153,0.9390070942659281,0.8904304199346637,0.0,0.03934956674595991,0.09658469357099869,536.5510136544884,15.897990578305649,18.0,0,0,0,13,1340.3,49.6,53.2,0.5867620751341681,0.07094113282642321
19
- TABM_GPU (default),109.08048485323914,0.5565733836247371,30.11061011517182,0.2424694191265854,0.6999854926260701,0.720056022702389,0.0,0.060357587291533545,0.131331635834194,580.6229196238922,9.300928058329742,19.23076923076923,4.27071,65.0270922978719,0.1736939483218723,13.315938751381204,0.13061717308512005,0.6917071760987948,0.6600858527247827,0.0,0.05131334419415079,0.05222355107697133,354.8698900002396,7.016702417594013,17.0,0,0,0,13,1329.0,52.5,47.2,0.5760286225402504,0.06404901956439267
20
- NN_TORCH (tuned + ensemble),44960.17932087947,4.03415755573501,5714.412959426929,1.3298489659738517,0.8184873120659005,0.8128616776400956,0.0,0.07610055505274997,0.12013496922800454,220430.49479235077,97.5953017172201,20.23076923076923,4.65351,15497.21247045199,2.8239229255252414,4608.594420268999,1.2325370779691025,0.9269135314055867,0.8896068156264655,0.0,0.05961664423383828,0.08340398612928321,177697.9133637009,96.53385472639289,21.0,0,0,0,13,1306.3,46.0,56.5,0.552772808586762,0.06851055964209914
21
- TABPFNV2_GPU (tuned),7396.527001981857,3.794137261464045,3418.770967200307,1.977808999126841,0.7134155719592534,0.7418050587973032,0.46153846153846156,0.06180041784216738,0.11851504520949105,101895.60372403836,193.9605838613745,20.576923076923077,4.17059,3945.7989165067675,0.8012150287628174,4223.8673583405725,0.45159866325593434,1.0,1.0,0.0,0.05688219535982175,0.06558436878663254,99491.49864758716,19.472884837488856,28.0,0,1,1,11,1299.8,52.5,54.5,0.5447227191413238,0.11949963987283921
22
- REALMLP (default),188.56736958373307,2.8279882822281275,22.387772498457988,3.1534644940370367,0.8482043138660138,0.8308100952567204,0.0,0.07049246793231062,0.12882128240536025,856.0818546821283,166.2876065138886,21.307692307692307,4.68787,113.99487728542752,2.6188460985819497,21.860990319181454,0.8382976743420607,0.985771610248003,0.9237062317944604,0.0,0.056187515565281676,0.10042346984448043,594.293460461532,57.629925208930345,22.0,0,0,0,13,1283.5,46.1,47.1,0.5277280858676208,0.054416123092593686
23
- XT (tuned + ensemble),1206.5194327423715,3.367576224375994,442.36567464742467,1.069316176291616,0.8375900418733949,0.8513977817862853,0.0,0.10048529388089425,0.120775550104976,8220.916376158188,88.72398178178156,21.53846153846154,5.04794,766.4287914435068,3.576531834072537,158.22496863160976,0.8436571643025992,1.0,0.9970859984141835,0.0,0.04988373521321621,0.12269401261109883,5379.1926370125775,93.77443700336585,25.0,0,0,0,13,1272.5,52.4,59.1,0.5223613595706619,0.07374831493706267
24
- GBM (default),10.696644908750159,2.245801542559241,3.3219193518156818,0.4939224454467883,0.9087857894621156,0.8834968815958729,0.0,0.08060606825955649,0.11976658053442996,80.53396018410795,58.38494586351267,21.923076923076923,4.4838,7.6057972113291425,0.9335102770063612,2.1107352135741744,0.2745991643955073,0.9712364739383097,0.9121375216694869,0.0,0.04937573951369445,0.10636020418552539,91.35379089849596,24.071571166022103,21.0,0,0,0,13,1263.8,53.0,53.2,0.5134168157423972,0.04684267482014358
25
- XT (tuned),1206.5194327423715,0.45855456478575357,442.36567464742467,0.18106308268317445,0.8639666647767655,0.8723069971625098,0.0,0.10266445862489976,0.12783943156365074,8220.916376158188,14.733646953552363,22.346153846153847,5.07281,766.4287914435068,0.34996385044521755,158.22496863160976,0.15116311924152026,1.0,0.9903008544022194,0.0,0.050434912392149145,0.12806468799723258,5379.1926370125775,14.85253877204164,24.0,0,0,0,13,1254.9,41.8,52.3,0.5035778175313059,0.05933488091441373
26
- TABPFNV2_GPU (default),13.930354648573786,0.6212240172247602,5.213536664770012,0.4399683707018391,0.7617874026169423,0.7938464861305419,0.46153846153846156,0.07567956002207855,0.13811994843145634,115.71517682132266,30.565681397453538,23.26923076923077,4.25916,9.373317972819011,0.4424108028411865,2.803336175829957,0.3130195506083406,1.0,1.0,0.0,0.06281203604096552,0.07601826824112161,120.93804022420875,33.21203399192692,28.5,0,0,0,13,1231.2,58.6,60.3,0.4821109123434705,0.08671869218804537
27
- NN_TORCH (tuned),44960.17932087947,0.2904610134597517,5714.412959426929,0.10578639607158029,0.8688481869538994,0.8668830387716895,0.0,0.0842812846349075,0.1388441315022607,220430.49479235077,7.008617614539204,23.307692307692307,4.7148,15497.21247045199,0.18749599986606175,4608.594420268999,0.09690652350320204,1.0,0.9449347108759663,0.0,0.0730528979706524,0.0987961550603162,177697.9133637009,6.6162587674624795,23.0,0,0,0,13,1230.2,46.0,47.0,0.481216457960644,0.0496801955102473
28
- XGB (default),12.109628748486186,0.9366908014330091,3.1242101566662273,0.3058622971724495,0.8862480764613536,0.8643492470670666,0.0,0.08752819812522358,0.12572826983269936,67.76669386312491,28.981192639888032,24.115384615384617,4.75458,7.79438853263855,0.5301833947499593,2.2441525977447814,0.24247013095584213,1.0,0.9587276133688395,0.0,0.053268364359151055,0.105372572651016,75.78126190262087,16.70456778360137,24.0,0,0,0,13,1214.6,41.8,46.0,0.462432915921288,0.046556052745046386
29
- RF (tuned + ensemble),2056.9684221919783,3.432567118171953,500.6828145780351,1.0841887922659657,0.9244962574709313,0.9210959637362763,0.0,0.10851512257257524,0.13041932100317657,10863.776799780928,90.24173062908258,24.46153846153846,5.10626,1088.11842862765,1.444333102968004,515.7302180242054,0.7709478321252661,1.0,0.9973408733427616,0.0,0.06747831829737316,0.13942046203027075,11909.387526812274,91.13733276245972,25.0,0,0,0,13,1202.6,45.8,47.1,0.4543828264758497,0.044566348818597695
30
- RF (tuned),2056.9684221919783,0.3944415463341607,500.6828145780351,0.1540690468576434,0.9446007082199963,0.9408578430892015,0.0,0.11360740147863871,0.13890562008885568,10863.776799780928,12.055960583687282,26.23076923076923,5.18858,1088.11842862765,0.3136819733513726,515.7302180242054,0.12356183047078627,1.0,1.0,0.0,0.07199386349783587,0.14203697859305436,11909.387526812274,11.406268840053338,28.0,0,0,0,13,1152.7,48.4,57.1,0.41323792486583183,0.040812401020999874
31
- EBM (tuned + ensemble),26125.740811508127,1.1628193525167612,3543.4601988059676,0.44619286429284205,0.8289294744561572,0.8515101676633261,0.0,0.1367713941762849,0.19454116603903474,102743.72801916403,13.991542249573758,27.0,4.37076,6865.337549757957,0.3269915845659044,1890.6770917738593,0.13287644820933492,1.0,1.0,0.0,0.1248421766855935,0.1374516591403558,100996.51953892264,10.735277156769923,33.0,0,1,0,12,1136.0,45.1,60.4,0.3953488372093023,0.07486975066160587
32
- XT (default),4.038539225015885,0.28929371426248146,0.7743502790637721,0.08386602222003156,0.9314402600373994,0.9226381137036738,0.0,0.12238733986993501,0.17918453000563647,12.035409694284047,7.5346994725559195,28.115384615384617,5.13889,2.058894846174452,0.22108591927422416,0.46766450701756074,0.055025941487738636,1.0,1.0,0.0,0.08970042626125052,0.15512507020900368,11.279879616649893,6.868103658105229,31.0,0,0,0,13,1100.4,55.5,62.8,0.3694096601073345,0.04000377617175779
33
- EBM (tuned),26125.740811508127,0.09807351307991222,3543.4601988059676,0.04003457596727012,0.8498896422587312,0.8690004985226352,0.0,0.14218261386610534,0.20871356711851735,102743.72801916403,1.2806753432569877,28.53846153846154,4.42885,6865.337549757957,0.03976681497361925,1890.6770917738593,0.012767925630469206,1.0,1.0,0.0,0.1292889595673019,0.16421526338237233,100996.51953892264,1.0,34.0,0,0,1,12,1091.5,49.5,56.7,0.3595706618962433,0.05876238598299918
34
- FASTAI (tuned + ensemble),4642.340904501768,8.77000401672135,1248.4159649024482,5.455805855994076,0.9763387226193126,0.966122715762154,0.0,0.12202126425715551,0.19741406156575694,32660.757986061028,336.5557366220283,30.0,5.27232,3620.151651991738,7.070411682128906,540.0550122797715,2.672383567926809,1.0,1.0,0.0,0.1022000155980195,0.1756471698456369,33252.590479353305,324.2617231497696,31.0,0,0,0,13,1040.0,61.4,49.3,0.32558139534883723,0.0357112186773915
35
- NN_TORCH (default),151.43337847534409,0.22200924249795764,23.930992166336175,0.08705592871966639,0.9827422496757343,0.9678685866022871,0.0,0.1193977421999948,0.19547728530185762,831.2534637345914,6.296769612194,30.23076923076923,4.81722,65.27966655625238,0.14955372280544704,20.47535029226034,0.07966387341594139,1.0,1.0,0.0,0.14050249681884142,0.13614151881375058,546.0088939544379,6.398331846186531,32.0,0,0,0,13,1038.3,57.6,62.3,0.32021466905187834,0.03550445228984282
36
- EBM (default),61.24195487458481,0.11774356426336827,10.057660454061297,0.07098698960857609,0.8908598695337496,0.8984005515464777,0.0,0.1505943663504528,0.22072991542366915,260.56038663650384,2.782386904835081,30.615384615384617,4.44221,19.539602756500244,0.04366175333658854,6.327684720357259,0.039228283502797535,1.0,1.0,0.0,0.13598656576261503,0.17428198310764084,214.4635432482436,1.7877726532004268,35.0,0,0,0,13,1034.2,43.3,66.2,0.3112701252236136,0.036159887762040425
37
- RF (default),9.267833071284823,0.31600932585887426,1.1524694810969494,0.08818027662752802,1.0,0.9870278022164788,0.0,0.12896216114203193,0.1901035125951938,27.793203837891944,7.952067403429296,31.346153846153847,5.26146,5.924832847383287,0.26005201869540745,0.5271703851240811,0.062155184511682476,1.0,1.0,0.0,0.08142906388988314,0.17217694118713336,20.351375021697393,6.812325017295935,31.5,0,0,0,13,1000.0,0.0,0.0,0.29427549194991054,0.03259951679594785
38
- FASTAI (tuned),4642.340904501768,0.9693611326380673,1248.4159649024482,0.5658202849858516,0.9859539042371194,0.9812210227156647,0.0,0.1268703134805104,0.21643571616372462,32660.757986061028,38.043706128121705,31.692307692307693,5.18701,3620.151651991738,1.0039918687608507,540.0550122797715,0.32495714500374384,1.0,1.0,0.0,0.11787324661401322,0.18136473255372101,33252.590479353305,36.96325797937462,33.0,0,0,0,13,991.5,48.2,64.3,0.28622540250447226,0.03287029132617368
39
- FASTAI (default),18.837261917856004,0.970951892168094,4.540025666114834,0.4814105255967321,1.0,0.9959412543360979,0.0,0.1713566965946914,0.27575692774796806,123.96432879696866,34.056295479841545,34.92307692307692,6.36878,15.734567880630493,1.036571078830295,2.6035035917474465,0.39168673048638547,1.0,1.0,0.0,0.11912950408368406,0.27762932440609267,138.38888737159849,36.35774230804583,36.0,0,0,0,13,858.2,52.8,69.4,0.2110912343470483,0.029119709509087932
40
- KNN (tuned + ensemble),20.494465350493407,2.240403470422468,2.7608828307729607,0.3822937952790146,1.0,0.9953083923448788,0.07692307692307693,0.3613185363694843,0.6538216817153736,74.30220244876604,25.830539833366643,39.76923076923077,8.1617,12.562016169230143,0.2777775393591987,2.427842858706766,0.14225338857865483,1.0,1.0,0.0,0.41352430110915517,0.7463425706140449,61.12257192545374,15.449077396683121,41.0,0,0,0,13,532.3,77.3,95.3,0.09838998211091235,0.025256773577856426
41
- LR (tuned + ensemble),260.53517347665934,0.6450919024964683,90.82426667518703,0.24319307436598073,1.0,1.0,0.0,0.35422421997100256,0.6560326250854102,2016.752248654558,10.333316070921164,40.30769230769231,8.15038,155.9093332555559,0.2263851695590549,45.737119844257606,0.10677772758861735,1.0,1.0,0.0,0.3837007856022804,0.7329121008806585,1735.7243863195374,7.455644136363483,40.0,0,0,0,13,488.7,52.5,94.2,0.08586762075134168,0.024850461752159034
42
- KNN (tuned),20.494465350493407,0.2574708763350788,2.7608828307729607,0.05410989827741383,1.0,0.9988376690148505,0.07692307692307693,0.36756573770309675,0.6864360755861012,74.30220244876604,3.3744578525576268,40.76923076923077,8.25638,12.562016169230143,0.09626038869222005,2.427842858706766,0.028318042556444805,1.0,1.0,0.0,0.42023780750084094,0.7514125894823619,61.12257192545374,2.1032822899149863,42.0,0,0,0,13,441.9,67.3,134.2,0.07513416815742398,0.024635899785160127
43
- LR (tuned),260.53517347665934,0.13893676680377404,90.82426667518703,0.0633906225067962,1.0,1.0,0.0,0.3558634149624985,0.6630785837896581,2016.752248654558,3.076869056672641,40.92307692307692,8.15927,155.9093332555559,0.05223793453640408,45.737119844257606,0.05009355954825878,1.0,1.0,0.0,0.38692302721983185,0.7430806596590566,1735.7243863195374,2.8910146707817423,41.0,0,0,0,13,420.6,81.6,94.5,0.07155635062611806,0.02449625775032643
44
- LR (default),4.903454369968838,0.19863318235446245,2.033074367074023,0.07563726553282422,1.0,1.0,0.0,0.3808091655367071,0.7702257899846487,45.803264898726965,4.508173919929611,42.15384615384615,8.22758,5.286295996771918,0.09712121221754286,1.191311693685635,0.0859815087197251,1.0,1.0,0.0,0.38694683448707623,0.8770019722050186,34.79737790723628,4.100500531123858,42.0,0,0,0,13,290.1,86.6,112.6,0.04293381037567084,0.023757512031433232
45
- KNN (default),2.0401845610039864,0.07292802802517882,0.5185761600539422,0.029646415007926124,1.0,0.999847626563869,0.07692307692307693,0.4076491058824559,0.8492569035691879,1.1215157323318734,1.57971988992164,42.61538461538461,8.55152,0.15191650390625,0.04789047771030002,0.03752343922831045,0.021942545900661376,1.0,1.0,0.0,0.4573857803481276,1.0,1.0,1.5317182357480712,44.0,0,0,0,13,238.6,87.0,145.9,0.03220035778175313,0.023578620171993768
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
data/full-imputed-reg/time_plot.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:da71d8796986e598f595ef583a35349dd78b721b6d2ad86658f6df61f9e4e254
3
- size 84316
 
 
 
 
data/full-imputed-reg/tuning-impact-elo-horizontal.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:968b8479bc66540750d1aa46e52693671b0d70021fe64bf2dc19f372a70141a1
3
- size 233696
 
 
 
 
data/full-imputed-reg/tuning-impact-elo.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:a588d59f9a719375d59bd42052e31c2a66093f268e635e796c600edf5a50472a
3
- size 214856
 
 
 
 
data/full-imputed/figures/critical-diagram.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:75cafd963414fd0e3c846b10958b9ccab2099954c699ac664e46a0da24ee2f1b
3
- size 335244
 
 
 
 
data/full-imputed/leaderboard.tex DELETED
@@ -1,53 +0,0 @@
1
- \begin{tabular}{llcccccrr}
2
- \toprule
3
- \textbf{Model} & \textbf{Elo ($\uparrow$)} & \textbf{Norm.} & \textbf{Avg.} & \textbf{Harm.} & \textbf{\#wins ($\uparrow$)} & \textbf{Improva-} & \textbf{Train time} & \textbf{Predict time} \\
4
- & & \textbf{score ($\uparrow$)} & \textbf{rank ($\downarrow$)} & \textbf{mean} & & \textbf{bility ($\downarrow$)} & \textbf{per 1K [s]} & \textbf{per 1K [s]} \\
5
- & & & & \textbf{rank ($\downarrow$)} & & & & \\
6
- \midrule
7
- AutoGluon 1.3 (4h) & \textcolor{gold}{\textbf{1590${}_{-28,+24}$}} & \textcolor{gold}{\textbf{0.618}} & \textcolor{gold}{\textbf{8.4}} & \textcolor{gold}{\textbf{3.1}} & \textcolor{silver}{\textbf{8}} & \textcolor{gold}{\textbf{5.8\%}} & 1408.78 & 3.34 \\
8
- RealMLP (T+E) & \textcolor{silver}{\textbf{1565${}_{-23,+28}$}} & \textcolor{silver}{\textbf{0.537}} & \textcolor{silver}{\textbf{9.1}} & 5.7 & 0 & \textcolor{silver}{\textbf{6.6\%}} & 6564.71 & 10.26 \\
9
- TabM (T+E) & \textcolor{bronze}{\textbf{1544${}_{-22,+27}$}} & \textcolor{bronze}{\textbf{0.502}} & \textcolor{bronze}{\textbf{9.9}} & 4.9 & 3 & \textcolor{bronze}{\textbf{7.0\%}} & 3285.87 & 1.47 \\
10
- LightGBM (T+E) & 1526${}_{-24,+24}$ & 0.452 & 10.6 & 6.6 & 1 & 8.6\% & 416.98 & 2.64 \\
11
- CatBoost (T+E) & 1484${}_{-26,+19}$ & 0.433 & 12.4 & 8.2 & 0 & 7.9\% & 1658.41 & 0.65 \\
12
- CatBoost (T) & 1470${}_{-23,+18}$ & 0.416 & 13.0 & 7.0 & 1 & 8.1\% & 1658.41 & 0.08 \\
13
- TabM (T) & 1453${}_{-27,+20}$ & 0.413 & 13.7 & 7.2 & 1 & 8.0\% & 3285.87 & 0.17 \\
14
- LightGBM (T) & 1448${}_{-27,+23}$ & 0.363 & 14.0 & 11.7 & 0 & 9.3\% & 416.98 & 0.33 \\
15
- XGBoost (T+E) & 1439${}_{-20,+21}$ & 0.361 & 14.3 & 9.9 & 0 & 9.4\% & 693.49 & 1.69 \\
16
- ModernNCA (T+E) & 1434${}_{-27,+24}$ & 0.444 & 14.5 & 5.3 & 3 & 8.7\% & 4621.67 & 8.15 \\
17
- CatBoost (D) & 1428${}_{-20,+27}$ & 0.372 & 14.8 & 7.9 & 1 & 9.3\% & 6.83 & 0.08 \\
18
- TabPFNv2 (T+E) & 1415${}_{-26,+27}$ & 0.476 & 15.4 & \textcolor{gold}{\textbf{3.1}} & \textcolor{gold}{\textbf{11}} & 8.5\% & 3030.15 & 21.44 \\
19
- XGBoost (T) & 1405${}_{-23,+22}$ & 0.313 & 15.9 & 13.2 & 0 & 9.7\% & 693.49 & 0.31 \\
20
- ModernNCA (T) & 1402${}_{-21,+18}$ & 0.307 & 16.0 & 8.9 & 1 & 9.3\% & 4621.67 & 0.47 \\
21
- TabICL (D) & 1390${}_{-24,+24}$ & 0.381 & 16.6 & \textcolor{bronze}{\textbf{4.5}} & 6 & 9.2\% & 6.63 & 1.48 \\
22
- TabPFNv2 (T) & 1348${}_{-24,+24}$ & 0.360 & 18.6 & 5.7 & 1 & 10.6\% & 3030.15 & 0.46 \\
23
- RealMLP (T) & 1348${}_{-21,+20}$ & 0.229 & 18.6 & 15.9 & 0 & 10.1\% & 6564.71 & 0.49 \\
24
- TabM (D) & 1347${}_{-19,+20}$ & 0.285 & 18.6 & 12.6 & 0 & 10.9\% & 10.49 & 0.13 \\
25
- TorchMLP (T+E) & 1332${}_{-21,+25}$ & 0.220 & 19.4 & 14.8 & 0 & 10.6\% & 2874.67 & 1.95 \\
26
- TabPFNv2 (D) & 1321${}_{-23,+21}$ & 0.324 & 20.0 & 5.6 & 4 & 11.6\% & 3.36 & 0.31 \\
27
- TabDPT (D) & 1300${}_{-28,+25}$ & 0.300 & 21.0 & 4.9 & \textcolor{bronze}{\textbf{7}} & 12.3\% & 22.53 & 8.55 \\
28
- ModernNCA (D) & 1299${}_{-23,+25}$ & 0.159 & 21.0 & 12.8 & 1 & 12.8\% & 14.87 & 0.31 \\
29
- EBM (T+E) & 1292${}_{-25,+24}$ & 0.184 & 21.4 & 13.5 & 0 & 14.6\% & 1331.68 & 0.20 \\
30
- FastaiMLP (T+E) & 1250${}_{-19,+24}$ & 0.158 & 23.5 & 14.5 & 0 & 14.0\% & 593.24 & 4.47 \\
31
- RealMLP (D) & 1244${}_{-25,+21}$ & 0.104 & 23.7 & 20.3 & 0 & 12.7\% & 21.86 & 0.84 \\
32
- ExtraTrees (T+E) & 1243${}_{-21,+22}$ & 0.124 & 23.8 & 15.8 & 0 & 14.3\% & 183.02 & 0.76 \\
33
- EBM (T) & 1236${}_{-21,+18}$ & 0.135 & 24.1 & 17.7 & 0 & 15.3\% & 1331.68 & 0.02 \\
34
- XGBoost (D) & 1232${}_{-24,+19}$ & 0.115 & 24.4 & 19.8 & 0 & 12.7\% & 1.94 & 0.12 \\
35
- TorchMLP (T) & 1223${}_{-21,+25}$ & 0.113 & 24.8 & 21.3 & 0 & 12.7\% & 2874.67 & 0.13 \\
36
- EBM (D) & 1202${}_{-29,+22}$ & 0.134 & 25.7 & 13.5 & 1 & 16.2\% & 4.67 & 0.04 \\
37
- LightGBM (D) & 1202${}_{-23,+23}$ & 0.088 & 25.7 & 23.0 & 0 & 13.5\% & 1.96 & 0.14 \\
38
- RandomForest (T+E) & 1201${}_{-24,+21}$ & 0.098 & 25.8 & 15.9 & 0 & 15.1\% & 373.18 & 0.77 \\
39
- ExtraTrees (T) & 1200${}_{-28,+18}$ & 0.094 & 25.9 & 17.1 & 0 & 15.4\% & 183.02 & 0.09 \\
40
- FastaiMLP (T) & 1161${}_{-19,+24}$ & 0.073 & 27.7 & 22.6 & 0 & 15.6\% & 593.24 & 0.31 \\
41
- RandomForest (T) & 1152${}_{-25,+19}$ & 0.072 & 28.1 & 18.0 & 0 & 16.2\% & 373.18 & 0.09 \\
42
- TorchMLP (D) & 1066${}_{-21,+24}$ & 0.022 & 31.6 & 29.0 & 0 & 17.5\% & 9.99 & 0.13 \\
43
- FastaiMLP (D) & 1011${}_{-23,+23}$ & 0.023 & 33.7 & 31.0 & 0 & 20.8\% & 2.86 & 0.37 \\
44
- RandomForest (D) & 1000${}_{-0,+0}$ & 0.009 & 34.0 & 32.4 & 0 & 21.3\% & 0.43 & 0.05 \\
45
- ExtraTrees (D) & 972${}_{-28,+21}$ & 0.025 & 35.0 & 31.4 & 0 & 23.0\% & 0.25 & 0.05 \\
46
- Linear (T+E) & 917${}_{-21,+31}$ & 0.031 & 36.6 & 28.3 & 0 & 30.8\% & 47.49 & 0.17 \\
47
- Linear (T) & 882${}_{-30,+32}$ & 0.020 & 37.5 & 32.8 & 0 & 31.5\% & 47.49 & 0.07 \\
48
- Linear (D) & 862${}_{-30,+23}$ & 0.014 & 38.1 & 30.5 & 0 & 32.9\% & 1.52 & 0.09 \\
49
- KNN (T+E) & 685${}_{-32,+27}$ & 0.000 & 41.4 & 41.1 & 0 & 45.3\% & 2.74 & 0.18 \\
50
- KNN (T) & 608${}_{-48,+32}$ & 0.000 & 42.4 & 42.3 & 0 & 46.9\% & 2.74 & 0.04 \\
51
- KNN (D) & 459${}_{-45,+42}$ & 0.000 & 43.8 & 43.6 & 0 & 54.1\% & 0.05 & 0.02 \\
52
- \bottomrule
53
- \end{tabular}
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
data/full-imputed/tabarena_leaderboard.csv DELETED
@@ -1,46 +0,0 @@
1
- method,time_train_s,time_infer_s,time_train_s_per_1K,time_infer_s_per_1K,normalized-error,normalized-error-task,imputed,champ_delta,loss_rescaled,time_train_s_rescaled,time_infer_s_rescaled,rank,median_metric_error,median_time_train_s,median_time_infer_s,median_time_train_s_per_1K,median_time_infer_s_per_1K,median_normalized-error,median_normalized-error-task,median_imputed,median_champ_delta,median_loss_rescaled,median_time_train_s_rescaled,median_time_infer_s_rescaled,median_rank,rank=1_count,rank=2_count,rank=3_count,rank>3_count,elo,elo+,elo-,winrate,mrr
2
- AutoGluon 1.3 (4h),8462.98349033711,33.138286938542635,2912.1242917208747,4.391016265244401,0.381932874173557,0.3938004850803849,0.0,0.05822662138626224,0.038656168015376684,43301.56973340033,539.3969074295995,8.411764705882353,0.2055,7367.614226023356,3.9490213659074573,1408.7828331379249,3.337414261487274,0.32575782629688715,0.34354944358388295,0.0,0.022224886416280176,0.016643068102528722,33121.993698706065,185.27028732280235,5.0,8,6,2,35,1589.7,23.3,27.4,0.8315508021390374,0.32434985005444883
3
- REALMLP (tuned + ensemble),82918.85938235045,50.75538799222518,8669.653795437369,17.385570695148495,0.4634803896199485,0.48980490794993603,0.0,0.0662652507057572,0.04428998923773407,179788.549283492,846.8926595159473,9.147058823529411,0.20859,30826.14291326205,23.329282654656303,6564.713231881598,10.263652729721208,0.4233707478718706,0.4394625063276673,0.0,0.03156540515236528,0.021508501844188344,136595.8709190495,808.3809230744637,7.0,0,5,4,42,1564.6,27.5,23.0,0.8148395721925134,0.17688848740169394
4
- TABM_GPU (tuned + ensemble),36462.58337058756,8.791583087761158,5017.258422060137,3.1802055454964386,0.49791774396870153,0.5234045251267629,0.0,0.06977046437303486,0.06341802359213981,80230.4785975727,130.9039885664859,9.872549019607844,0.20535,8420.993191123009,2.7141049438052707,3285.8688373170553,1.4723486146222118,0.4651893659350262,0.5081840173827434,0.0,0.03250086419089859,0.027567784172117953,47614.825535817596,111.53030159160512,8.0,3,2,3,43,1543.9,26.8,21.9,0.7983511586452763,0.20599586481497986
5
- GBM (tuned + ensemble),3088.2834950947295,22.058380506500956,771.2925090199282,4.080656504886727,0.5482943614672702,0.5697587102764388,0.0,0.08557614755706973,0.059109015015710746,11617.562781863631,427.3461655076801,10.627450980392156,0.2112,1667.667911251386,3.7860276963975696,416.9832926671224,2.6387318875879693,0.5661942244497019,0.5831585230842022,0.0,0.04023364809387353,0.02358455413295459,9501.568897853978,118.5632719261967,10.0,1,1,2,47,1526.2,23.7,23.4,0.7811942959001783,0.1520008705225847
6
- CAT (tuned + ensemble),20021.3498029566,3.148214934334516,4130.291158755297,0.9792413821552454,0.5665168479793786,0.5762538702782276,0.0,0.07932910031410541,0.05277138538144033,58497.64726501732,62.60491239485109,12.372549019607844,0.21079,7127.110804247856,1.3727677133348253,1658.412974283854,0.65278892715772,0.5695869531891277,0.5907800269434179,0.0,0.04606752490069621,0.02774139238223889,22349.13952253535,41.43902270876451,11.0,0,1,4,46,1484.5,18.5,25.4,0.7415329768270945,0.12146805837673025
7
- CAT (tuned),20021.3498029566,0.45918073721700764,4130.291158755297,0.13269281663977184,0.5837954660770609,0.5897551898062613,0.0,0.08123455741105615,0.05345084183416724,58497.64726501732,8.058537488270943,12.96078431372549,0.21123,7127.110804247856,0.16668947537740073,1658.412974283854,0.08101450844552308,0.5845247966032667,0.6225038615712537,0.0,0.05045951030687079,0.030933776248240515,22349.13952253535,5.582269457200397,12.5,1,3,2,45,1470.5,17.3,22.3,0.7281639928698752,0.1437603439953039
8
- TABM_GPU (tuned),36462.58337058756,0.9691277276976176,5017.258422060137,0.3543499279245272,0.5874385027995309,0.5895097011198668,0.0,0.08005397326560945,0.07918093169544495,80230.4785975727,13.69951936973594,13.696078431372548,0.21042,8420.993191123009,0.2791590425703261,3285.8688373170553,0.1728818165133111,0.559085035183601,0.6013449948327668,0.0,0.041022605221466724,0.035365730806074164,47614.825535817596,11.564361688735561,14.0,1,3,2,45,1452.9,20.0,26.7,0.7114527629233511,0.1390380052974814
9
- GBM (tuned),3088.2834950947295,3.280151636356362,771.2925090199282,0.7008989122586311,0.6366067117608026,0.6336828329141213,0.0,0.09282220252278703,0.0695723239257395,11617.562781863631,67.66975356224859,13.950980392156863,0.21181,1667.667911251386,0.5984517203436958,416.9832926671224,0.33384935590955944,0.6296869528283883,0.6590584882044693,0.0,0.05048061255226499,0.03622173132751954,9501.568897853978,17.561755180687065,12.0,0,0,0,51,1447.5,22.7,26.1,0.7056595365418895,0.08527400346153444
10
- XGB (tuned + ensemble),6066.175009788823,8.000445660682546,1229.0991373608288,3.120951906510457,0.6385613101625683,0.6434959772946812,0.0,0.09352817276204972,0.06895162562612145,14800.692905333448,192.56405868832437,14.27450980392157,0.21458,2256.9276883072325,3.0598979949951173,693.4907982506384,1.6904262577061315,0.671301017368657,0.6581535714975986,0.0,0.05552089944600691,0.03200108485206017,11769.825378159274,82.2877578838671,13.0,0,1,1,49,1438.7,20.3,19.4,0.698306595365419,0.1010413685016667
11
- MNCA_GPU (tuned + ensemble),50831.51192726024,408.7024568246081,6035.4844365627405,41.810485127597225,0.5559334685448146,0.5505192494459418,0.0,0.08712344899479532,0.06897276604102788,107380.48324313454,2632.7754273052897,14.549019607843137,0.21294,14486.050127214856,13.240725604693095,4621.665633563503,8.148513113458952,0.5384753932635177,0.5315179526728905,0.0,0.035276378418488075,0.031199094919591976,89084.88026764008,537.8274695033359,11.0,3,3,5,40,1433.8,23.6,26.7,0.6920677361853832,0.1903874062097513
12
- CAT (default),190.03885268302784,0.2709793741147243,87.82759080296049,0.13562630112555335,0.6279909747535407,0.646125708691512,0.0,0.0925312918791838,0.06081652619205146,428.26912869207973,7.186903437353133,14.794117647058824,0.21142,28.268621895048355,0.1854714552561442,6.827020885706225,0.08026752106160412,0.6534673037587888,0.6439696146756235,0.0,0.04582469430261593,0.029179417473348953,113.47556087857672,6.139565341592893,16.0,1,3,1,46,1428.5,26.2,19.6,0.6864973262032086,0.12705337781476866
13
- TABPFNV2_GPU (tuned + ensemble),9338.42554130284,95.46951768190513,2848.5135701006398,47.682041415407696,0.5239705071709494,0.5769312699511674,0.35294117647058826,0.08515142798212334,0.08426448408310144,62904.17752069347,3721.414760799794,15.372549019607844,0.22709,2098.550690642993,13.798940539360046,3030.145221648074,21.443889733771147,0.5915448073500406,0.6069332103645363,0.0,0.03866296841083827,0.03250100481138207,29840.56658077224,861.272449686595,11.0,11,4,3,33,1414.7,26.3,25.1,0.6733511586452763,0.3218844858197347
14
- XGB (tuned),6066.175009788823,1.6190793797341305,1229.0991373608288,0.7004875821871327,0.6869435084156271,0.6795624628612148,0.0,0.09696298272522294,0.07331091441041244,14800.692905333448,40.17091923504828,15.892156862745098,0.21499,2256.9276883072325,0.41774741808573407,693.4907982506384,0.3083513292273418,0.7193303230280006,0.7149553857561783,0.0,0.06100086263844462,0.03462012868860662,11769.825378159274,12.136706817415504,15.0,0,0,0,51,1404.9,21.4,22.2,0.6615418894830659,0.07556633120131231
15
- MNCA_GPU (tuned),50831.51192726024,15.584971993005873,6035.4844365627405,1.7226505575752833,0.692913423977365,0.6430754651403978,0.0,0.09326988355673647,0.07237812189150603,107380.48324313454,104.77227719302421,16.03921568627451,0.21328,14486.050127214856,0.585451708899604,4621.665633563503,0.4747242314576746,0.7553848478357736,0.6366009339265687,0.0,0.06113391902932641,0.0493534926522164,89084.88026764008,26.6476237825924,16.0,1,1,2,47,1402.5,17.6,20.6,0.6581996434937611,0.1120344583371438
16
- TABICL_GPU (default),82.80334805787777,14.446106202929627,7.4655941033985815,1.6847655412525513,0.6187626573361198,0.6590285685589051,0.29411764705882354,0.09205009921867735,0.09066558702327601,132.83095048510535,173.47404322746496,16.607843137254903,0.21659,20.0619904200236,1.316457470258077,6.625686376434735,1.479660415649414,0.6419318555001297,0.737430199336512,0.0,0.05183434289459943,0.0307061823959099,110.37274820175764,91.25701004379168,14.0,6,4,1,40,1389.8,23.7,23.2,0.6452762923351159,0.22043663312484155
17
- REALMLP (tuned),82918.85938235045,2.265432894048088,8669.653795437369,0.9268647174900084,0.770575243056979,0.7198426366318216,0.0,0.10098849134105925,0.08466680664165582,179788.549283492,40.77005886388886,18.61764705882353,0.21508,30826.14291326205,1.0006839964124892,6564.713231881598,0.487512478277248,0.866614722551564,0.7557285234401128,0.0,0.06823335132903552,0.05320262249246514,136595.8709190495,36.173930978956705,18.0,0,0,0,51,1347.8,19.9,20.3,0.5995989304812834,0.06290892887580088
18
- TABPFNV2_GPU (tuned),9338.42554130284,3.508518432739773,2848.5135701006398,1.6633483351042642,0.6402449044416614,0.6630497140643348,0.35294117647058826,0.10613854546664524,0.10151316508024909,62904.17752069347,134.3876978041907,18.61764705882353,0.22751,2098.550690642993,0.5097733656565349,3030.145221648074,0.46198440414964803,0.7603547647708915,0.7104310097848833,0.0,0.07667977096051137,0.04654872982993791,29840.56658077224,22.48627830890164,15.5,1,9,2,39,1348.2,23.9,23.8,0.5995989304812834,0.1748060544077359
19
- TABM_GPU (default),139.62633961878052,1.0737770921524312,22.4928837749369,0.40818286049864,0.7149663388553331,0.7265875725216725,0.0,0.10904673882873887,0.10206768551470595,289.28104127940037,12.833742622244758,18.61764705882353,0.21188,40.332407061258955,0.18133597903781468,10.492230631952996,0.13221126395693922,0.8197432069948619,0.7908796299405498,0.0,0.05780353607735633,0.03944837820705307,186.77830789084513,10.724711035853641,17.0,0,0,1,50,1346.8,19.5,18.3,0.5995989304812834,0.07920722957248505
20
- NN_TORCH (tuned + ensemble),29590.123960411,13.183612702147375,3729.842332819238,3.5618312259496654,0.7804812465107793,0.7650139226387963,0.0,0.1061112883342963,0.09038128962619164,97942.50427646744,194.194370742189,19.392156862745097,0.21479,10848.660208092795,3.946354971991645,2874.6743506773596,1.9516254299583915,0.910350140134773,0.8406524459297455,0.0,0.06624137678637454,0.057854463852261596,61441.834728019065,144.9992483814283,20.0,0,0,0,51,1332.3,24.5,20.5,0.5819964349376114,0.06762812359455618
21
- TABPFNV2_GPU (default),12.045264010803372,0.825882379421741,4.478694030451919,0.4530850283232118,0.6758133693268394,0.7167075967625335,0.35294117647058826,0.11639838128335792,0.11295068368765535,69.53545044274777,29.75030650326298,20.04901960784314,0.22887,9.117408725950453,0.42119165261586505,3.3572004182270923,0.3130195506083406,0.956340758739766,0.8514943913344442,0.0,0.07592598037026033,0.052795487060598015,52.331266670190345,18.512009051191257,21.0,4,1,4,42,1320.6,20.6,22.2,0.5670677361853832,0.1788864671189465
22
- TABDPT_GPU (default),166.18249968905855,63.11337411850366,27.980557312145997,23.24231264879125,0.6999265055194402,0.699617586999111,0.0,0.12276620509418873,0.09536847721359762,577.6877537613715,1486.1376930964866,20.96078431372549,0.22562,99.10453534126282,28.39870807859633,22.52690614988171,8.550738306685618,0.9903591350487536,0.8691048403402053,0.0,0.05029899916003,0.0433680924463882,528.528670151851,1255.434427440434,22.0,7,0,3,41,1300.5,24.7,27.1,0.5463458110516934,0.2050929674182998
23
- MNCA_GPU (default),252.05720278889527,8.333707038145958,17.796029105655933,1.1526772129094414,0.840779547661087,0.7959093866791253,0.0,0.12833371899167995,0.09667612599337075,329.3551130182264,62.49957238602975,21.03921568627451,0.21885,36.0034454398685,0.5316168732113309,14.869495839530392,0.30768591310277943,1.0,0.8857694762766525,0.0,0.07424254289688792,0.06351531911154724,236.99495784925895,20.43419277465042,23.0,1,0,0,50,1299.1,24.9,22.8,0.5445632798573975,0.07840885625256006
24
- EBM (tuned + ensemble),34026.536489860475,1.2927426090946903,5478.959788432886,0.48902813323980865,0.8160130792555895,0.8175846459536137,0.0,0.1459291690476727,0.13123283756498888,45019.152540237825,18.109773959357398,21.431372549019606,0.21615,2925.6548613442314,0.400875727335612,1331.6775166450918,0.19908260374566636,0.958308811651104,0.8761716756909942,0.0,0.08565806627599937,0.05590445639895017,17751.99098903195,11.13922354039944,21.0,0,1,1,49,1292.5,23.9,24.9,0.535650623885918,0.07406174849391879
25
- FASTAI (tuned + ensemble),6629.648996400055,16.16286987151975,1343.5604508466745,7.704229376998794,0.8424987698640145,0.8369589067092069,0.0,0.13971168994404898,0.11512340310437526,22455.399300542234,425.24771894393024,23.46078431372549,0.21679,3182.895098288854,10.795994228786892,593.237788402893,4.466873745216533,1.0,0.9552259937574975,0.0,0.0932055660765102,0.07243348401502496,19851.125229101002,409.94392325616803,25.0,0,1,0,50,1249.7,24.0,19.0,0.48952762923351156,0.06883106383498735
26
- REALMLP (default),273.729912211121,2.9575047363123343,25.2860618687397,2.9321723489284044,0.8959152468625471,0.8483293899440195,0.0,0.1265509832451891,0.10892646502099379,570.5207778272487,128.70805989538366,23.745098039215687,0.21516,107.08700556225247,2.6188460985819497,21.860990319181454,0.8382976743420607,1.0,0.8905283644114734,0.0,0.09824845633171009,0.07444845646092337,497.27886689729087,54.5386788723178,24.0,0,0,0,51,1244.4,20.7,24.5,0.483065953654189,0.049206352618688116
27
- XT (tuned + ensemble),1289.1519403179227,3.0578905248434194,464.9314949446578,1.2901947647379284,0.8763124357874087,0.8533692619097849,0.0,0.14328509610809964,0.11780722719308848,5501.829777288316,78.94715148525705,23.823529411764707,0.21814,763.5855970117781,1.8364481396145291,183.01944048073585,0.761281055543471,1.0,0.9521925032545069,0.0,0.08603990026311015,0.07137952858629182,3537.1622158448404,68.20231667792302,27.0,0,0,1,50,1243.2,21.4,20.4,0.48128342245989303,0.06310725111805937
28
- EBM (tuned),34026.536489860475,0.16189759790507796,5478.959788432886,0.07139558038617727,0.8649702054759879,0.8501629933938847,0.0,0.15261119112333235,0.14046524715764735,45019.152540237825,2.1906554232861546,24.11764705882353,0.21772,2925.6548613442314,0.0440410852432251,1331.6775166450918,0.022730636596679687,1.0,0.9362089365721089,0.0,0.09714090003058773,0.06554952824985644,17751.99098903195,1.2223545537568268,24.0,0,0,1,50,1236.3,17.4,20.5,0.47459893048128343,0.056429653896796474
29
- XGB (default),12.860019501050314,0.6666093941607507,3.1825529067488505,0.3000924886961127,0.8848348696110224,0.853007192078055,0.0,0.12712004260212578,0.11901448944644484,40.716829559380564,17.42803351650834,24.441176470588236,0.21667,5.82192047437032,0.3283502260843913,1.9409389396340444,0.12262328807300621,1.0,0.9508124068469325,0.0,0.09769803191981652,0.0669685923578734,34.48133232859512,10.22021081972956,24.0,0,0,0,51,1232.4,18.8,23.5,0.4672459893048128,0.05049212098034601
30
- NN_TORCH (tuned),29590.123960411,0.7234875786018787,3729.842332819238,0.20874333848987678,0.8872281679035978,0.8461065690351314,0.0,0.1266933814263632,0.11368972667898117,97942.50427646744,10.830632864633353,24.754901960784313,0.21784,10848.660208092795,0.2631075382232666,2874.6743506773596,0.131112832826372,1.0,0.9115838155711167,0.0,0.1030878662833985,0.06896829387646289,61441.834728019065,8.236695976463633,25.0,0,0,0,51,1223.2,25.0,20.4,0.46011586452762926,0.04694203272118342
31
- EBM (default),104.78526760316363,0.17165232525412033,11.079598604500092,0.09398863718788793,0.8658153943709024,0.8643480238945523,0.0,0.1621049398152807,0.1485076759316338,148.0901517975655,3.2508346897376326,25.745098039215687,0.21677,11.465454594294231,0.05977429548899333,4.6738599788414605,0.03961948198354664,1.0,0.9628721496233147,0.0,0.10271933543623024,0.06419214710978799,75.98904610132907,2.400460311914132,26.0,1,0,2,48,1202.5,21.1,28.4,0.4376114081996435,0.07434503258700696
32
- GBM (default),8.619418829147072,1.0011564760166576,3.0462908189340183,0.25265600996978405,0.9115791356403461,0.8832381096058513,0.0,0.13512306737563554,0.11629352319961023,44.26353412937288,22.939896191571705,25.745098039215687,0.21706,5.971681065029568,0.2858244842953152,1.9600326879612946,0.14173548842042513,1.0,0.9463131041198098,0.0,0.09851412306889507,0.07460814115278391,32.72231234173676,8.612355874421967,26.0,0,0,0,51,1202.3,22.3,22.7,0.4376114081996435,0.04352354270813194
33
- RF (tuned + ensemble),2245.014868743487,2.632504310950734,530.9489806737863,1.2198476221584795,0.9024507372146047,0.8782930186177115,0.0,0.15120883991739936,0.12864400480448673,6771.2411119904955,73.51636782881279,25.84313725490196,0.21966,886.9249708387587,1.8479143513573542,373.17861356387994,0.7709478321252661,1.0,0.9768978814786328,0.0,0.0916564450840821,0.08516072052322858,5833.628398157948,63.708677480395814,28.0,0,1,1,49,1200.7,20.1,23.2,0.4353832442067736,0.06295017478288799
34
- XT (tuned),1289.1519403179227,0.3436224609158917,464.9314949446578,0.1744826726067124,0.9058524786574399,0.8823009390085037,0.0,0.15403365971080407,0.127422275587505,5501.829777288316,9.97674146406182,25.92156862745098,0.21915,763.5855970117781,0.19129647148980033,183.01944048073585,0.09120693679998855,1.0,0.970701256454361,0.0,0.10343347628405763,0.07994728986866971,3537.1622158448404,8.867381424714122,29.0,0,1,0,50,1199.9,17.5,28.0,0.4336007130124777,0.05853966620405146
35
- FASTAI (tuned),6629.648996400055,1.0291698354002177,1343.5604508466745,0.6091231343301344,0.9268828889794378,0.8920727938696232,0.0,0.1558685523503756,0.1360312984711962,22455.399300542234,33.54113778341792,27.65686274509804,0.2173,3182.895098288854,0.8112125396728516,593.237788402893,0.306391541190021,1.0,0.9746050760858544,0.0,0.0988046136347035,0.08330206169020851,19851.125229101002,31.29559841059079,28.0,0,0,0,51,1160.6,23.5,18.9,0.39416221033868093,0.04418776652877106
36
- RF (tuned),2245.014868743487,0.2778017885544721,530.9489806737863,0.15585979099514927,0.9281198477241044,0.9030193429154391,0.0,0.1615360365144191,0.13978570251749914,6771.2411119904955,8.469005968793565,28.137254901960784,0.22016,886.9249708387587,0.17402595943874782,373.17861356387994,0.08526202343425164,1.0,0.9971929103870871,0.0,0.10456209496862212,0.09283342230804408,5833.628398157948,7.94881742200239,31.0,0,1,1,49,1151.9,18.9,24.1,0.38324420677361853,0.055631086241924914
37
- NN_TORCH (default),74.85835825680128,0.5412627521423472,14.695998951842784,0.19921856017345402,0.978335352631338,0.9550533922310487,0.0,0.17458090781576796,0.15954937461607677,327.51821759182326,9.71084645377221,31.637254901960784,0.22759,34.192402362823486,0.22645958264668783,9.990997226772679,0.1258046787872722,1.0,1.0,0.0,0.14050249681884142,0.10372091439545034,204.15320945265722,7.844934898940183,33.0,0,0,0,51,1065.7,23.7,20.8,0.303698752228164,0.03443490177911753
38
- FASTAI (default),27.993367113578813,1.070063710732138,4.951242935164595,0.5056871108143457,0.9771222767390375,0.9555114040073619,0.0,0.2080437804427664,0.19756225211381712,87.08936574190912,29.523324076097715,33.745098039215684,0.24127,12.973222759034899,0.8480017715030246,2.8561126039251374,0.37317813888351864,1.0,1.0,0.0,0.15801035292114685,0.14150143443329027,77.36570184770021,28.70383139894662,36.0,0,0,0,51,1010.9,22.5,22.1,0.2557932263814617,0.03221137350683143
39
- RF (default),5.289521401640116,0.18747803105248345,0.8914167974682212,0.07557012377860906,0.9905963640075596,0.9728548321405758,0.0,0.2127887009205986,0.22057611136662575,11.63160057311077,4.814878723410724,34.049019607843135,0.2484,1.2628253036075168,0.08775801128811306,0.43425701731008426,0.05354175385774865,1.0,1.0,0.0,0.15453679020731148,0.13517433236964935,7.010079512208331,3.9076874559468244,35.5,0,0,0,51,1000.0,0.0,0.0,0.24888591800356505,0.03085450482928122
40
- XT (default),3.067954011524425,0.20851136391458946,0.7599985259503769,0.07668050932017083,0.9748030390364969,0.9607261977937244,0.0,0.22983038868386624,0.2401119926385584,6.922794409042171,5.082769758447109,34.990196078431374,0.24428,1.029017792807685,0.0932659043206109,0.2473449527431567,0.04980480471851431,1.0,1.0,0.0,0.17836788960429084,0.15384123117868195,5.707021421143385,4.456198846093199,38.0,0,0,0,51,972.1,20.7,27.8,0.22749554367201427,0.03187662324031601
41
- LR (tuned + ensemble),298.0772925112502,1.5473658189794337,106.92655310814473,0.5272909274052285,0.968958377581858,0.9646098555228514,0.0,0.3076836862047438,0.36388884495507595,1326.7185005565484,20.631703511708864,36.568627450980394,0.25172,171.24826147821216,0.28876688745286727,47.49214683366935,0.1676693138737952,1.0,1.0,0.0,0.24584842785173022,0.2585604322609332,1064.131515124883,12.175213508812917,40.0,0,0,1,50,917.3,30.7,20.1,0.19162210338680927,0.03539216425693013
42
- LR (tuned),298.0772925112502,0.41952294077488855,106.92655310814473,0.14529639602271893,0.9796785843199748,0.970169924696589,0.0,0.31502751681846614,0.3731108004113419,1326.7185005565484,5.927745129039777,37.549019607843135,0.25255,171.24826147821216,0.10993631680806477,47.49214683366935,0.0665447885938415,1.0,1.0,0.0,0.2610498572359117,0.2575122541077485,1064.131515124883,4.067640041721783,41.0,0,0,0,51,881.6,31.6,29.4,0.16934046345811052,0.03046989425942536
43
- LR (default),6.912347784956555,0.4444732105550163,2.521765696020704,0.1600876370747322,0.9859350644688389,0.9744519582521974,0.0,0.3287432608233925,0.4183743470806094,32.35021650084229,7.018103298699637,38.07843137254902,0.2546,5.298751910527547,0.12218634287516277,1.5162047581506444,0.0887949061495271,1.0,1.0,0.0,0.26947668808304803,0.29385559917332205,22.991652687961256,4.725399415979996,41.5,0,0,1,50,862.4,22.1,29.5,0.1573083778966132,0.03274333914572857
44
- KNN (tuned + ensemble),129.6893764515588,9.265668427242952,9.839359763162042,0.6763553759282402,1.0,0.9960130371949129,0.11764705882352941,0.45319635453542406,0.6115009863532073,72.94693370250576,64.66285269534394,41.431372549019606,0.34658,10.182021194034153,0.27287014325459796,2.7415773839237785,0.17676389939136214,1.0,1.0,0.0,0.41352430110915517,0.6662116649574654,57.25701094840839,13.90517184260326,43.0,0,0,0,51,685.3,26.5,31.2,0.08110516934046345,0.02430200505969366
45
- KNN (tuned),129.6893764515588,1.4108907164571591,9.839359763162042,0.11144693589219826,1.0,0.9979197340731149,0.11764705882352941,0.4687472868734289,0.6540729042725081,72.94693370250576,10.182721712640424,42.431372549019606,0.35012,10.182021194034153,0.08784447775946723,2.7415773839237785,0.03624739765555662,1.0,1.0,0.0,0.4456786012073394,0.740394718900499,57.25701094840839,2.2791618782574123,44.0,0,0,0,51,607.8,31.1,47.9,0.05837789661319073,0.023658093377554222
46
- KNN (default),1.8202055170645122,0.18718713898544476,0.4967972483415847,0.03640307146637206,1.0,0.9999597757876026,0.11764705882352941,0.5414051085605402,0.9163581411737007,1.035103875827835,2.1647566343855695,43.833333333333336,0.40859,0.22567404641045463,0.036888705359564886,0.050692503527237594,0.021942545900661376,1.0,1.0,0.0,0.5109534123207524,1.0,1.0,1.2564234767690339,45.0,0,0,0,51,459.1,41.7,44.9,0.026515151515151516,0.022918450740172708
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
data/full-imputed/time_plot.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:613f30371e61ed5f71cd5cf8b5685d3b0f7703794b48bb20aa121dac593ece71
3
- size 460639
 
 
 
 
data/full-imputed/tuning-impact-elo-horizontal.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:93269a1a183c73727bb39b44ad54d60b2f8a5115a1e1045fb1c9e76c64b48002
3
- size 242032
 
 
 
 
data/full-imputed/tuning-impact-elo.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:067403096c8c20f1ba659da5990f9e0bac207da4f46c52cd8a8c2047346083c4
3
- size 226642
 
 
 
 
data/lite/full-imputed/figures/critical-diagram.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:1f32a80e8cbf7007a3cbb744fc1c47c1ce2e1d62b1214d3d8f6c3fff5240b0f7
3
- size 341376
 
 
 
 
data/lite/full-imputed/leaderboard.tex DELETED
@@ -1,53 +0,0 @@
1
- \begin{tabular}{llcccccrr}
2
- \toprule
3
- \textbf{Model} & \textbf{Elo ($\uparrow$)} & \textbf{Norm.} & \textbf{Avg.} & \textbf{Harm.} & \textbf{\#wins ($\uparrow$)} & \textbf{Improva-} & \textbf{Train time} & \textbf{Predict time} \\
4
- & & \textbf{score ($\uparrow$)} & \textbf{rank ($\downarrow$)} & \textbf{mean} & & \textbf{bility ($\downarrow$)} & \textbf{per 1K [s]} & \textbf{per 1K [s]} \\
5
- & & & & \textbf{rank ($\downarrow$)} & & & & \\
6
- \midrule
7
- AutoGluon 1.3 (4h) & \textcolor{gold}{\textbf{1542${}_{-33,+32}$}} & \textcolor{gold}{\textbf{0.618}} & \textcolor{gold}{\textbf{7.6}} & \textcolor{gold}{\textbf{3.2}} & \textcolor{silver}{\textbf{7}} & \textcolor{gold}{\textbf{6.4\%}} & 1453.27 & 3.15 \\
8
- RealMLP (T+E) & \textcolor{silver}{\textbf{1469${}_{-20,+29}$}} & \textcolor{silver}{\textbf{0.537}} & \textcolor{silver}{\textbf{10.0}} & 4.9 & 3 & \textcolor{silver}{\textbf{8.3\%}} & 6559.12 & 8.60 \\
9
- LightGBM (T+E) & \textcolor{bronze}{\textbf{1425${}_{-25,+27}$}} & 0.452 & \textcolor{bronze}{\textbf{11.9}} & 8.3 & 0 & 10.2\% & 416.56 & 2.24 \\
10
- TabM (T+E) & 1417${}_{-26,+32}$ & \textcolor{bronze}{\textbf{0.502}} & 12.2 & 6.2 & 2 & \textcolor{bronze}{\textbf{8.9\%}} & 3133.91 & 1.27 \\
11
- CatBoost (T+E) & 1398${}_{-23,+29}$ & 0.433 & 13.1 & 8.5 & 0 & 9.5\% & 1665.53 & 0.56 \\
12
- ModernNCA (T+E) & 1382${}_{-21,+26}$ & 0.444 & 13.7 & \textcolor{bronze}{\textbf{4.5}} & \textcolor{bronze}{\textbf{5}} & 9.8\% & 4618.50 & 7.74 \\
13
- CatBoost (T) & 1376${}_{-26,+22}$ & 0.416 & 14.1 & 8.6 & 0 & 10.0\% & 1665.53 & 0.07 \\
14
- XGBoost (T+E) & 1373${}_{-23,+22}$ & 0.361 & 14.2 & 8.1 & 1 & 10.8\% & 700.96 & 1.44 \\
15
- LightGBM (T) & 1355${}_{-21,+29}$ & 0.363 & 14.9 & 10.4 & 0 & 11.1\% & 416.56 & 0.38 \\
16
- CatBoost (D) & 1354${}_{-26,+24}$ & 0.372 & 15.2 & 9.3 & 1 & 10.8\% & 6.70 & 0.09 \\
17
- TabM (T) & 1354${}_{-27,+27}$ & 0.413 & 15.1 & 8.3 & 0 & 9.8\% & 3133.91 & 0.13 \\
18
- XGBoost (T) & 1348${}_{-24,+25}$ & 0.313 & 15.4 & 8.9 & 1 & 11.0\% & 700.96 & 0.21 \\
19
- ModernNCA (T) & 1344${}_{-21,+25}$ & 0.307 & 15.5 & 6.8 & 2 & 10.5\% & 4618.50 & 0.47 \\
20
- TabPFNv2 (T+E) & 1324${}_{-26,+26}$ & 0.476 & 16.6 & \textcolor{silver}{\textbf{3.4}} & \textcolor{gold}{\textbf{11}} & 10.4\% & 2942.08 & 17.37 \\
21
- TabM (D) & 1284${}_{-19,+23}$ & 0.285 & 18.5 & 11.6 & 0 & 12.7\% & 11.56 & 0.13 \\
22
- RealMLP (T) & 1282${}_{-26,+24}$ & 0.229 & 18.7 & 12.9 & 0 & 12.3\% & 6559.12 & 0.35 \\
23
- TabICL (D) & 1279${}_{-22,+27}$ & 0.381 & 18.8 & 6.2 & 4 & 11.5\% & 6.86 & 1.52 \\
24
- TabPFNv2 (T) & 1269${}_{-18,+26}$ & 0.360 & 19.3 & 6.4 & 1 & 12.1\% & 2942.08 & 0.26 \\
25
- TorchMLP (T+E) & 1258${}_{-22,+25}$ & 0.220 & 19.8 & 14.0 & 0 & 12.4\% & 2832.80 & 1.80 \\
26
- EBM (T+E) & 1230${}_{-23,+23}$ & 0.184 & 21.4 & 11.5 & 0 & 15.8\% & 1323.39 & 0.18 \\
27
- TabPFNv2 (D) & 1228${}_{-21,+24}$ & 0.324 & 21.4 & 6.5 & 3 & 13.1\% & 3.27 & 0.32 \\
28
- ModernNCA (D) & 1224${}_{-24,+24}$ & 0.159 & 21.7 & 11.5 & 1 & 15.4\% & 13.74 & 0.32 \\
29
- TabDPT (D) & 1215${}_{-27,+27}$ & 0.300 & 22.1 & 6.3 & 4 & 14.7\% & 20.56 & 8.62 \\
30
- RealMLP (D) & 1207${}_{-21,+26}$ & 0.104 & 22.4 & 16.2 & 0 & 14.0\% & 21.59 & 1.49 \\
31
- ExtraTrees (T+E) & 1204${}_{-24,+23}$ & 0.124 & 22.7 & 13.7 & 0 & 15.8\% & 191.44 & 0.76 \\
32
- EBM (T) & 1187${}_{-27,+23}$ & 0.135 & 23.5 & 11.7 & 1 & 16.5\% & 1323.39 & 0.02 \\
33
- TorchMLP (T) & 1186${}_{-19,+25}$ & 0.113 & 23.6 & 17.7 & 0 & 14.3\% & 2832.80 & 0.11 \\
34
- XGBoost (D) & 1185${}_{-24,+21}$ & 0.115 & 23.7 & 14.1 & 1 & 14.3\% & 2.06 & 0.12 \\
35
- ExtraTrees (T) & 1169${}_{-25,+25}$ & 0.094 & 24.4 & 12.8 & 0 & 16.7\% & 191.44 & 0.10 \\
36
- FastaiMLP (T+E) & 1164${}_{-22,+27}$ & 0.158 & 24.6 & 16.1 & 0 & 16.5\% & 594.95 & 4.65 \\
37
- RandomForest (T+E) & 1158${}_{-21,+23}$ & 0.098 & 25.0 & 15.1 & 0 & 16.4\% & 377.08 & 0.75 \\
38
- EBM (D) & 1150${}_{-26,+27}$ & 0.134 & 25.3 & 13.4 & 1 & 17.4\% & 5.48 & 0.06 \\
39
- LightGBM (D) & 1148${}_{-20,+28}$ & 0.088 & 25.4 & 21.6 & 0 & 15.2\% & 2.20 & 0.17 \\
40
- RandomForest (T) & 1115${}_{-16,+22}$ & 0.072 & 27.1 & 21.5 & 0 & 17.3\% & 377.08 & 0.09 \\
41
- FastaiMLP (T) & 1098${}_{-20,+23}$ & 0.073 & 27.8 & 19.5 & 0 & 18.1\% & 594.95 & 0.34 \\
42
- TorchMLP (D) & 1027${}_{-29,+20}$ & 0.022 & 31.0 & 27.5 & 0 & 19.8\% & 8.96 & 0.13 \\
43
- RandomForest (D) & 1000${}_{-0,+0}$ & 0.009 & 32.2 & 27.3 & 0 & 22.6\% & 0.43 & 0.05 \\
44
- FastaiMLP (D) & 974${}_{-23,+24}$ & 0.023 & 33.2 & 29.9 & 0 & 22.2\% & 3.12 & 0.31 \\
45
- ExtraTrees (D) & 967${}_{-26,+26}$ & 0.025 & 33.5 & 29.6 & 0 & 24.4\% & 0.26 & 0.05 \\
46
- Linear (T+E) & 870${}_{-28,+26}$ & 0.031 & 36.7 & 21.4 & 1 & 32.7\% & 47.11 & 0.16 \\
47
- Linear (T) & 831${}_{-22,+28}$ & 0.020 & 37.7 & 30.6 & 0 & 33.3\% & 47.11 & 0.06 \\
48
- Linear (D) & 798${}_{-31,+26}$ & 0.014 & 38.5 & 36.4 & 0 & 34.6\% & 1.53 & 0.09 \\
49
- KNN (T+E) & 715${}_{-36,+28}$ & 0.000 & 40.4 & 36.3 & 0 & 45.8\% & 2.61 & 0.16 \\
50
- KNN (T) & 650${}_{-36,+35}$ & 0.000 & 41.5 & 33.4 & 0 & 47.5\% & 2.61 & 0.03 \\
51
- KNN (D) & 466${}_{-48,+39}$ & 0.000 & 43.7 & 43.2 & 0 & 54.7\% & 0.05 & 0.02 \\
52
- \bottomrule
53
- \end{tabular}
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
data/lite/full-imputed/tabarena_leaderboard.csv DELETED
@@ -1,46 +0,0 @@
1
- method,time_train_s,time_infer_s,time_train_s_per_1K,time_infer_s_per_1K,normalized-error,normalized-error-task,imputed,champ_delta,loss_rescaled,time_train_s_rescaled,time_infer_s_rescaled,rank,median_metric_error,median_time_train_s,median_time_infer_s,median_time_train_s_per_1K,median_time_infer_s_per_1K,median_normalized-error,median_normalized-error-task,median_imputed,median_champ_delta,median_loss_rescaled,median_time_train_s_rescaled,median_time_infer_s_rescaled,median_rank,rank=1_count,rank=2_count,rank=3_count,rank>3_count,elo,elo+,elo-,winrate,mrr
2
- AutoGluon 1.3 (4h),8355.364120885437,32.82596967734543,2898.240203433117,4.500812758268476,0.3819328741735569,0.3770688966591946,0.0,0.06390622359293244,0.041217519495144274,41965.2048341603,616.5290381859439,7.598039215686274,0.20731,7964.962425470352,4.336968660354614,1453.2679445956664,3.1473262800156863,0.32575782629688715,0.3195055666666006,0.0,0.036220438279793044,0.016044582675404963,27194.492186888157,175.7696866507318,6.0,7,7,3,34,1542.2,32.0,33.0,0.8500445632798574,0.31160926120777893
3
- REALMLP (tuned + ensemble),82067.04150357434,46.620822546528835,8632.9301863134,17.370629146496764,0.4634803896199485,0.45924468792998446,0.0,0.08266241385217407,0.05360332482663363,176783.27502952865,866.0711810995012,10.029411764705882,0.20693,31275.486653327942,20.346505880355835,6559.124623720848,8.595013673531266,0.4233707478718706,0.4266057238194146,0.0,0.041676367869615816,0.02894053041859053,137543.59089717592,754.0990720578875,8.0,3,3,4,41,1468.6,28.5,19.5,0.7947860962566845,0.2059245232356837
4
- GBM (tuned + ensemble),3051.8140357896395,23.19597311113395,765.6312795719994,3.893554958066526,0.5482943614672702,0.5802324566147745,0.0,0.10195425970004891,0.07582209751094779,11175.64311228686,518.7842315699711,11.92156862745098,0.2094,1670.2684843540192,3.933397054672241,416.5640684155986,2.2357814013957977,0.5661942244497019,0.5969263542300586,0.0,0.056825533415868046,0.03541199658807983,9062.280137688022,129.84563236620332,12.0,0,0,5,46,1425.1,26.5,24.7,0.7517825311942959,0.12037470444670732
5
- TABM_GPU (tuned + ensemble),36745.76943855192,8.411393100140142,5149.522818881001,3.299204733305397,0.49791774396870153,0.5366251533007463,0.0,0.08910568804791694,0.08145841683955529,79103.63593427226,131.94770411319155,12.176470588235293,0.20702,8818.62617468834,3.054008960723877,3133.9115732644937,1.2727714166408632,0.4651893659350262,0.5236377793041334,0.0,0.048678737735947,0.03776447060427699,49987.707956643724,112.26280083003438,10.0,2,0,4,45,1417.3,31.4,25.3,0.7459893048128342,0.16063806759656885
6
- CAT (tuned + ensemble),19506.186602929058,3.204344211840162,4009.454268521621,1.0312463789689519,0.5665168479793786,0.5765483867503225,0.0,0.0952275764012927,0.06667748555681205,53079.26276755286,76.04544084479011,13.068627450980392,0.20994,7158.277770042419,1.5355591773986816,1665.5315644683697,0.5594151152585728,0.5695869531891277,0.5656478386606927,0.0,0.04848406576376174,0.03278210845362601,22918.764355268395,47.71552750484114,11.0,0,0,5,46,1398.0,28.5,22.7,0.7257130124777184,0.11706898907023527
7
- MNCA_GPU (tuned + ensemble),50606.46933756155,274.8289722134085,6200.95174646742,33.492227920869496,0.5559334685448146,0.5359099538014397,0.0,0.09840471071491197,0.08190173524814136,106030.16699977385,1982.0660894780217,13.72549019607843,0.2069,14787.927460432053,13.568246603012085,4618.502341732761,7.737494511886756,0.5384753932635177,0.49725904468662846,0.0,0.05042203805715495,0.026551744085150493,92410.46095689958,607.7040280210158,12.0,5,3,2,41,1382.0,25.1,20.9,0.7107843137254902,0.2206650694868955
8
- CAT (tuned),19506.186602929058,0.4801775754666796,4009.454268521621,0.12298834764615349,0.5837954660770609,0.6105658884215094,0.0,0.09997676304978957,0.07166766279017629,53079.26276755286,9.224590724231657,14.107843137254902,0.2099,7158.277770042419,0.14083647727966309,1665.5315644683697,0.06528969211261772,0.5845247966032667,0.6121904865449591,0.0,0.052825231831448494,0.039234174928719376,22918.764355268395,4.080719588963731,14.0,0,3,0,48,1375.8,21.9,25.2,0.7020944741532977,0.11684766359499667
9
- XGB (tuned + ensemble),5938.998393348619,7.841493718764362,1213.0777679819278,3.1002279660820715,0.6385613101625683,0.6367323927592745,0.0,0.10771210097547056,0.08419601774905479,13936.971832341356,215.6442792684636,14.186274509803921,0.20933,2063.2708826065063,2.946662425994873,700.9607998746845,1.4386570273653,0.671301017368657,0.6269820382919887,0.0,0.06457358707058436,0.04050001929889269,11430.712273277264,98.30128646762262,13.0,1,2,1,47,1372.9,21.9,22.3,0.7003119429590018,0.12270752448323713
10
- GBM (tuned),3051.8140357896395,3.3178744736839745,765.6312795719994,0.6787985114862476,0.6366067117608026,0.6570773231970207,0.0,0.11117221268454557,0.09077971360559027,11175.64311228686,83.45556234970765,14.941176470588236,0.20979,1670.2684843540192,0.7316880226135254,416.5640684155986,0.3811195826851199,0.6296869528283883,0.6894469650860278,0.0,0.05983857821112848,0.04176567096628159,9062.280137688022,21.854692683376335,13.0,0,1,0,50,1355.3,28.8,20.5,0.6831550802139037,0.09660409682118858
11
- TABM_GPU (tuned),36745.76943855192,0.9327322455013499,5149.522818881001,0.3522030333025916,0.5874385027995309,0.6143195596137613,0.0,0.09841965478146567,0.09572048915752591,79103.63593427226,14.22083187601248,15.068627450980392,0.20774,8818.62617468834,0.25295114517211914,3133.9115732644937,0.13006170213303378,0.559085035183601,0.6325459317585411,0.0,0.04950878682353699,0.05438833715883013,49987.707956643724,9.039602142310635,14.0,0,2,4,45,1353.7,27.0,26.1,0.6802584670231729,0.12038975799158216
12
- CAT (default),186.54889291408014,0.4368261870215921,85.73305725415607,0.1697456044717195,0.6279909747535407,0.661300443911613,0.0,0.10765800375861413,0.07724638069569476,425.6698111187737,11.747370317227338,15.176470588235293,0.20726,25.447056531906128,0.4332277774810791,6.700347814449044,0.08827101352602937,0.6534673037587888,0.6999391357273155,0.0,0.06252446837171821,0.03385856499128612,120.27803545982526,7.036160265837067,15.0,1,1,0,49,1354.2,23.8,25.1,0.6778074866310161,0.10725471626676596
13
- XGB (tuned),5938.998393348619,1.6265712625840132,1213.0777679819278,0.6984321316309747,0.6869435084156271,0.6676202920248872,0.0,0.10974734030407483,0.08824361033497355,13936.971832341356,46.56690686937765,15.362745098039216,0.20927,2063.2708826065063,0.5078930854797363,700.9607998746845,0.21305428930075773,0.7193303230280006,0.7184922523909244,0.0,0.06599393337115189,0.04761154585613394,11430.712273277264,15.141274536165636,14.0,1,2,0,48,1348.1,24.2,23.5,0.6735739750445633,0.11261030495027546
14
- MNCA_GPU (tuned),50606.46933756155,14.969161491768032,6200.95174646742,1.6696617082037302,0.692913423977365,0.6230114933852047,0.0,0.10481212169907277,0.09343196398343544,106030.16699977385,104.17477306753874,15.53921568627451,0.20447,14787.927460432053,0.6251811981201172,4618.502341732761,0.46952933073043823,0.7553848478357736,0.6315515001589479,0.0,0.06144927536231881,0.05466681384890431,92410.46095689958,31.933384765725453,13.0,2,2,1,46,1344.5,24.7,21.0,0.6695632798573975,0.14651187196898943
15
- TABPFNV2_GPU (tuned + ensemble),9280.937638614692,101.0832102392234,2836.0220285499013,48.21038294992157,0.5239705071709494,0.5730376960557855,0.35294117647058826,0.10366348703715766,0.0977540660157534,61984.06966966662,4163.892043743008,16.558823529411764,0.21666,1926.0209152698517,7.814479351043701,2942.0790635597914,17.37173979844504,0.5915448073500406,0.6153846153846155,0.0,0.08318481827952429,0.03857004834586989,26428.31699976118,914.2037538981667,15.0,11,1,2,37,1324.0,26.0,25.5,0.6463903743315508,0.2928543983886399
16
- TABM_GPU (default),142.54126416468154,1.0708042359819598,23.88140219729834,0.4005178443372889,0.7149663388553331,0.7152499495938582,0.0,0.1268778920544805,0.12106183448201867,305.2739016434669,13.641685471262758,18.54901960784314,0.2124,38.16493201255798,0.17726969718933105,11.555005609392405,0.12732556548192603,0.8197432069948619,0.7811252019030427,0.0,0.07455117151840984,0.05717855281816143,191.5780246731349,9.78619298588067,17.0,0,1,1,49,1284.0,23.0,18.3,0.6011586452762924,0.08586090377844187
17
- REALMLP (tuned),82067.04150357434,2.0937890398736094,8632.9301863134,0.8676478578817012,0.770575243056979,0.738222159537682,0.0,0.12250815024892807,0.10245511989368125,176783.27502952865,34.039924606828805,18.666666666666668,0.21531,31275.486653327942,0.5825295448303223,6559.124623720848,0.349657589934167,0.866614722551564,0.7710071255853564,0.0,0.0698656429942418,0.05347283246893191,137543.59089717592,25.842995839112344,17.0,0,1,0,50,1282.5,23.5,25.8,0.5984848484848485,0.07744336875028272
18
- TABICL_GPU (default),89.99167278701184,15.994201370314055,7.671787946091385,1.7323065455173496,0.6187626573361198,0.6945869568436454,0.29411764705882354,0.11541084719646724,0.10728998455140165,134.32668884285462,203.85595038983618,18.784313725490197,0.20899,19.30116367340088,1.4308764934539795,6.856698312130071,1.5204305848178132,0.6419318555001297,0.8203500230278312,0.0,0.06412908564335906,0.048797809240279176,96.03934388429276,99.16403825333815,18.0,4,2,2,43,1278.8,26.8,21.6,0.5958110516934046,0.16207391452431377
19
- TABPFNV2_GPU (tuned),9280.937638614692,4.082395156224568,2836.0220285499013,1.9953713679087344,0.6402449044416614,0.6909552209814358,0.35294117647058826,0.12100773771158657,0.11491559646058619,61984.06966966662,159.46255672145648,19.264705882352942,0.22795,1926.0209152698517,0.3721911907196045,2942.0790635597914,0.2619314172957417,0.7603547647708915,0.9374660731733828,0.0,0.09148856392950888,0.05918640451838397,26428.31699976118,18.72396576888414,21.0,1,6,4,40,1269.0,25.7,17.4,0.5848930481283422,0.1557082671662397
20
- NN_TORCH (tuned + ensemble),29357.971773493522,13.246924213334626,3694.4132392234865,3.45030206949641,0.7804812465107793,0.7942281910602976,0.0,0.12369156459717033,0.10097362308285791,94556.98181855923,212.38450473432735,19.823529411764707,0.21375,10620.000384569168,3.6842262744903564,2832.7960851387297,1.800873875617981,0.9103501401347731,0.9282133174880375,0.0,0.07922111196515602,0.06654974418641565,57294.450258986595,164.18272158737435,21.0,0,0,1,50,1258.4,24.8,21.8,0.5721925133689839,0.0711803947184648
21
- EBM (tuned + ensemble),33815.09879914452,1.2886618586147534,6555.968895975794,0.4924074534971456,0.8160130792555895,0.7922116536748646,0.0,0.15830919949651442,0.14484597408084915,41186.37921180388,19.776872049406894,21.401960784313726,0.21048,2774.86767077446,0.33999180793762207,1323.3940540554784,0.18426990509033203,0.958308811651104,0.966938614382516,0.0,0.11314432082836512,0.06365614798694313,17200.38772644666,11.089686715499157,21.0,0,2,1,48,1230.3,22.2,22.9,0.536319073083779,0.08663578954059327
22
- TABPFNV2_GPU (default),12.262318129632988,0.840339740117391,4.40142189361026,0.44785573341984464,0.6758133693268394,0.7298195241925087,0.35294117647058826,0.1312587810821333,0.12700343736871175,68.41353371526327,34.453290769213694,21.41176470588235,0.22782,9.203422546386719,0.3870081901550293,3.265341403173851,0.3151157987035174,0.956340758739766,1.0,0.0,0.08973027820339718,0.08167997085223988,43.42507231078893,20.360866660523406,24.5,3,2,6,40,1227.9,23.6,20.7,0.536096256684492,0.15413057435361202
23
- MNCA_GPU (default),245.15467273020278,8.083605813045128,18.524091854412294,1.1280259623754931,0.840779547661087,0.8081572743289607,0.0,0.15404946603042724,0.11095777674889704,315.39790519179047,62.504831773745536,21.705882352941178,0.21297,40.294944524765015,0.5322227478027344,13.739717303881024,0.31576028319217697,1.0,0.9641203703703644,0.0,0.07931711197987557,0.08713555135923144,220.32243700895648,24.025524470708156,21.0,1,1,0,49,1224.1,23.8,23.1,0.5294117647058824,0.08712998268466692
24
- TABDPT_GPU (default),165.2279131459255,62.46656582402248,26.956914471170485,21.63883475001047,0.6999265055194402,0.7216180431261457,0.0,0.14742503114565889,0.11679298135501644,555.3035574597842,1632.683869591928,22.068627450980394,0.21757,99.63741898536682,27.656949520111084,20.559618245438234,8.61670085798515,0.9903591350487535,1.0,0.0,0.09511484956324834,0.05489681185866076,445.8805538455051,1247.4102886322382,25.0,4,3,0,44,1214.7,26.4,26.9,0.5211675579322638,0.15809881346420043
25
- REALMLP (default),265.33954032262164,3.8989146456998935,24.880994813527074,3.185210885583871,0.8959152468625471,0.852274326929426,0.0,0.14042432469169164,0.11551037868997248,556.4275782424967,165.1494501601144,22.392156862745097,0.21633,100.95172715187073,3.2151546478271484,21.58663305185609,1.4943033874536162,1.0,0.9992770772609277,0.0,0.09708065314200909,0.08128250140770318,474.8079401992076,108.65711984315752,22.0,0,1,0,50,1207.3,25.5,20.9,0.5138146167557932,0.06168773598671817
26
- XT (tuned + ensemble),1293.0170739772273,3.137296905704573,465.97356682221573,1.4220614441065493,0.8763124357874087,0.8301977318774126,0.0,0.15758629775903474,0.13448385543428582,5360.97691412289,92.16814260531523,22.705882352941178,0.21292,775.4602122306824,2.0695900917053223,191.43562446750065,0.7604422990939085,1.0,1.0,0.0,0.10874752469158522,0.0792857142857151,3382.254534218774,83.98689534392173,26.0,0,1,0,50,1204.3,22.1,23.2,0.5066844919786097,0.07296872869442048
27
- EBM (tuned),33815.09879914452,0.15010464892667882,6555.968895975794,0.06527171144492658,0.8649702054759879,0.8288322534431326,0.0,0.1645737449972851,0.15598296035475784,41186.37921180388,2.297509957092046,23.480392156862745,0.21528,2774.86767077446,0.03941011428833008,1323.3940540554784,0.01867789003336745,1.0,1.0,0.0,0.11378284728865762,0.0787717680247385,17200.38772644666,1.1641602772099668,24.0,1,1,1,48,1186.8,22.2,26.4,0.4890819964349376,0.08578632571936438
28
- NN_TORCH (tuned),29357.971773493522,0.60710202478895,3694.4132392234865,0.18506171262100615,0.8872281679035978,0.8398169698453672,0.0,0.14294716948269776,0.12285127642924888,94556.98181855923,10.572861046995973,23.647058823529413,0.21109,10620.000384569168,0.22349023818969727,2832.7960851387297,0.11152923107147217,1.0,1.0,0.0,0.09826070825346578,0.07345521618835778,57294.450258986595,8.23840338809871,23.0,0,0,0,51,1185.5,24.3,18.9,0.4852941176470588,0.05642920128249442
29
- XGB (default),13.003299437317194,0.6573358236574659,3.2763217873435058,0.3062962827156932,0.8848348696110224,0.8451108258000312,0.0,0.1428411707448398,0.1398063099437038,39.750102662431935,19.332008402042312,23.745098039215687,0.21746,6.575343370437622,0.39388179779052734,2.0589472404367264,0.12183857766160212,1.0,1.0,0.0,0.09723220704529156,0.0632886818093575,32.87694416794321,9.490148793740056,24.0,1,0,0,50,1184.7,20.7,23.7,0.483065953654189,0.07091975928291895
30
- XT (tuned),1293.0170739772273,0.34508594344643984,465.97356682221573,0.17473237760756105,0.9058524786574399,0.8395888691154477,0.0,0.16724894940491772,0.14049436138011492,5360.97691412289,10.510664591668581,24.441176470588236,0.21281,775.4602122306824,0.17961668968200684,191.43562446750065,0.10098353169828128,1.0,1.0,0.0,0.12289592760181012,0.08661653630785078,3382.254534218774,9.053845119655822,29.0,0,2,0,49,1168.6,24.3,24.1,0.4672459893048128,0.07797298351915956
31
- FASTAI (tuned + ensemble),6591.795950641819,16.245619264303468,1342.3410902155488,7.489539632560419,0.8424987698640145,0.8625844225885649,0.0,0.1645508356317889,0.13121098785431634,21804.052352671944,459.2322661590378,24.58823529411765,0.22665,3142.961499929428,10.290910243988037,594.9528585230638,4.650812904660155,1.0,1.0,0.0,0.10209375579025382,0.0868129401727155,17514.935036133815,369.5983579935041,27.0,0,0,0,51,1164.2,26.2,21.9,0.46390374331550804,0.062026986093925165
32
- RF (tuned + ensemble),2227.1437476008546,2.46178646181144,528.9202336737816,1.2212931064922863,0.9024507372146047,0.8804069247412386,0.0,0.1641730645746029,0.14796534274167691,6570.203557563642,82.81791093485884,24.96078431372549,0.21651,876.37784075737,1.7210140228271484,377.08301133934634,0.7469802601991967,1.0,1.0,0.0,0.08806938150372812,0.0845151199165799,5196.472002023166,66.34473572972412,27.0,0,2,0,49,1158.1,22.7,20.9,0.4554367201426025,0.06636587025579366
33
- EBM (default),83.37771165604686,0.18329166431053012,10.41915882393762,0.09845206874115144,0.8658153943709024,0.850811039552388,0.0,0.17382418641801253,0.16315261858436833,128.33936987969741,3.956299459273312,25.34313725490196,0.21098,11.009112119674683,0.06519913673400879,5.48102419993599,0.05915899401478989,1.0,1.0,0.0,0.1212644465105922,0.07181719260065285,66.69318849626488,3.0422808378588053,25.0,1,0,0,50,1150.4,26.4,25.5,0.44674688057041,0.07488201637851248
34
- GBM (default),9.322186521455354,1.1440442029167623,3.199181885596564,0.3147197775938453,0.9115791356403461,0.8847842293676881,0.0,0.15247368372829376,0.14137233583288317,46.93362314737936,31.99287107427668,25.38235294117647,0.21883,6.97411584854126,0.6232860088348389,2.2021467310523017,0.17114277689525664,1.0,1.0,0.0,0.10076080905548301,0.07712613158180785,26.977037743860404,12.72044936701149,26.0,0,0,0,51,1147.7,27.2,20.0,0.44585561497326204,0.046209500110957015
35
- RF (tuned),2227.1437476008546,0.2597549288880591,528.9202336737816,0.14418858473403337,0.9281198477241044,0.9248817449766008,0.0,0.1727900363849873,0.15908060362975035,6570.203557563642,8.720931698010416,27.137254901960784,0.21762,876.37784075737,0.15849757194519043,377.08301133934634,0.09141294871228287,1.0,1.0,0.0,0.11332046274795926,0.10944640481286505,5196.472002023166,7.735832448019426,28.0,0,0,0,51,1114.6,21.9,15.8,0.40597147950089124,0.046476658714260786
36
- FASTAI (tuned),6591.795950641819,1.031626776152966,1342.3410902155488,0.6256477184380718,0.9268828889794378,0.9090286238924099,0.0,0.18088316750549646,0.15962504324274882,21804.052352671944,39.15043952663921,27.823529411764707,0.23007,3142.961499929428,0.8062961101531982,594.9528585230638,0.33651872811068495,1.0,1.0,0.0,0.13059040884116846,0.09939086593924433,17514.935036133815,29.379474918801876,29.0,0,0,1,50,1098.5,22.5,19.5,0.39037433155080214,0.051224691983945664
37
- NN_TORCH (default),73.82550601865731,0.5703827493331012,14.544520367121898,0.24243997995483998,0.978335352631338,0.9586040127970135,0.0,0.19812515552294435,0.1801377474036213,302.60238317827014,12.920116170611008,31.0,0.21872,30.76021695137024,0.2736239433288574,8.95763915486452,0.12885630130767822,1.0,1.0,0.0,0.12796036111893505,0.12220480897522226,159.50005812712556,8.385666110468058,33.0,0,0,0,51,1027.2,19.6,28.9,0.3181818181818182,0.03632288625440765
38
- RF (default),5.493931228039312,0.18195290658988206,0.9395711378622816,0.07441601332167096,0.9905963640075596,0.9798381459527805,0.0,0.225775704832811,0.2287570644353416,12.413845548911803,5.3715309875205834,32.166666666666664,0.23701,1.763962745666504,0.09250092506408691,0.4316611380184793,0.05251745318464858,1.0,1.0,0.0,0.16975626521036435,0.15585831773749825,7.440479734614894,4.454035040702154,34.0,0,0,0,51,1000.0,0.0,0.0,0.2916666666666667,0.036570885765985314
39
- FASTAI (default),27.62558492492227,1.0500701642503925,4.982714620465621,0.48138517234406897,0.9771222767390375,0.958723139710105,0.0,0.22174105570586602,0.2081137632674128,85.33850461343444,31.816885619382305,33.15686274509804,0.23658,14.524733304977417,0.6894059181213379,3.1153293528650394,0.31195542895805195,1.0,1.0,0.0,0.15646970534797477,0.132224835215793,80.02261780548187,26.24796931235529,35.0,0,0,0,51,973.6,23.4,22.8,0.26916221033868093,0.033449943264817414
40
- XT (default),3.237064623365215,0.20349772771199545,0.7705107029297205,0.07802443108906316,0.9748030390364969,0.9594558445418607,0.0,0.24396773959995877,0.2511012609659614,6.714510413684847,5.481544853676871,33.470588235294116,0.24149,0.9253263473510742,0.08609175682067871,0.25976606260770435,0.0542029349444295,1.0,1.0,0.0,0.174332553882657,0.1548442777574803,5.482206453649765,4.486837961877324,37.0,0,0,0,51,966.8,25.3,25.8,0.2620320855614973,0.03383207547893805
41
- LR (tuned + ensemble),297.7768230718725,1.372081803340538,106.57523410916248,0.517828397925699,0.968958377581858,0.9783679131342927,0.0,0.3267575967555976,0.3734069213226812,1284.9569849451952,23.77736863105658,36.65686274509804,0.24535,167.935129404068,0.30171775817871094,47.11049555587997,0.15632276260139144,1.0,1.0,0.0,0.2711660718812774,0.2552683896620274,878.5162567525001,9.250054132082282,40.0,1,0,0,50,869.8,25.6,27.7,0.18961675579322637,0.04682116614524594
42
- LR (tuned),297.7768230718725,0.41669238314909096,106.57523410916248,0.1447260883613805,0.9796785843199748,0.9851104149746683,0.0,0.33297402038592483,0.3859918587868161,1284.9569849451952,6.250094556288328,37.68627450980392,0.25329,167.935129404068,0.11377191543579102,47.11049555587997,0.0626428060263515,1.0,1.0,0.0,0.274940797173291,0.2552683896620274,878.5162567525001,4.008480462038967,40.5,0,0,1,50,830.7,27.6,21.8,0.1662210338680927,0.03268777023075063
43
- LR (default),7.020498261732214,0.4395605956806856,2.5089988631155826,0.17274864462745904,0.9859350644688389,0.9900235669288296,0.0,0.3460595840578667,0.4330522609318273,32.25963024407083,8.287063835475498,38.51960784313726,0.25334,5.345212697982788,0.13631606101989746,1.534023110424607,0.08993275118189932,1.0,1.0,0.0,0.27343206074359794,0.31807101339614474,17.290213667932342,5.167879066497201,41.0,0,0,0,51,797.9,25.8,31.0,0.14728163992869875,0.02745926796466409
44
- KNN (tuned + ensemble),129.77637638765222,6.6349917159361,9.578319752293481,0.5506054195292707,1.0,0.9903128920917535,0.11764705882352941,0.4578660927908407,0.614637528386039,73.00625608345185,45.17733544160728,40.38235294117647,0.34596,10.405000686645508,0.23362517356872559,2.6097757270537225,0.15550157631257452,1.0,1.0,0.0,0.4299249109597012,0.6963553049981372,58.272580217378014,12.85103075181078,43.0,0,0,0,51,715.2,27.7,35.4,0.10494652406417113,0.02754382727615239
45
- KNN (tuned),129.77637638765222,1.469562432345222,9.578319752293481,0.11124081145987941,1.0,0.9881581588022348,0.11764705882352941,0.47519157962572545,0.6600331749536156,73.00625608345185,9.204095260134709,41.509803921568626,0.34503,10.405000686645508,0.07799005508422852,2.6097757270537225,0.030069828033447266,1.0,1.0,0.0,0.456326948689309,0.7494456544265635,58.272580217378014,2.2192320534223704,44.0,0,0,1,50,650.1,34.4,35.7,0.07932263814616755,0.029943152461680767
46
- KNN (default),1.8986088995840036,0.1759517753825468,0.5259164666571766,0.03511341794167653,1.0,1.0,0.11764705882352941,0.5469569269271738,0.9181123518799813,1.049764217774364,2.1650106683540193,43.666666666666664,0.42464,0.2273859977722168,0.03137516975402832,0.0509044812778989,0.01926569938659668,1.0,1.0,0.0,0.5196494204127793,1.0,1.0,1.4495271718572689,45.0,0,0,0,51,465.9,38.6,48.0,0.030303030303030304,0.023129909171668465
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
data/lite/full-imputed/time_plot.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:a7e19da04108e6a0d8bd806750790d0091f4dd89110d8ef755dc1de557455ee0
3
- size 461215
 
 
 
 
data/lite/full-imputed/tuning-impact-elo-horizontal.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:5ddaae5f1486cdbcbdcf15c440763ac4b33e344dba37d1ffac7b28084bd04b34
3
- size 242150
 
 
 
 
data/lite/full-imputed/tuning-impact-elo.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:62c27a818e24d800c3d78a53b1aabb85ae98f1c9c0963d7475d5fd5b016bf159
3
- size 221898
 
 
 
 
data/tabicl-imputed/figures/critical-diagram.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:dfe05743ad36dc6fa0971aed082a30f6e53986114ec792b70ceeac6dc33adbf3
3
- size 336147
 
 
 
 
data/tabicl-imputed/leaderboard.tex DELETED
@@ -1,53 +0,0 @@
1
- \begin{tabular}{llcccccrr}
2
- \toprule
3
- \textbf{Model} & \textbf{Elo ($\uparrow$)} & \textbf{Norm.} & \textbf{Avg.} & \textbf{Harm.} & \textbf{\#wins ($\uparrow$)} & \textbf{Improva-} & \textbf{Train time} & \textbf{Predict time} \\
4
- & & \textbf{score ($\uparrow$)} & \textbf{rank ($\downarrow$)} & \textbf{mean} & & \textbf{bility ($\downarrow$)} & \textbf{per 1K [s]} & \textbf{per 1K [s]} \\
5
- & & & & \textbf{rank ($\downarrow$)} & & & & \\
6
- \midrule
7
- AutoGluon 1.3 (4h) & \textcolor{gold}{\textbf{1602${}_{-28,+35}$}} & \textcolor{gold}{\textbf{0.586}} & \textcolor{gold}{\textbf{9.4}} & \textcolor{bronze}{\textbf{3.5}} & \textcolor{bronze}{\textbf{5}} & \textcolor{gold}{\textbf{7.0\%}} & 1222.55 & 2.36 \\
8
- TabM (T+E) & \textcolor{silver}{\textbf{1594${}_{-29,+40}$}} & 0.502 & \textcolor{silver}{\textbf{9.6}} & 4.3 & 3 & 8.6\% & 2387.83 & 1.47 \\
9
- RealMLP (T+E) & \textcolor{bronze}{\textbf{1577${}_{-34,+30}$}} & 0.477 & \textcolor{bronze}{\textbf{10.3}} & 7.3 & 0 & \textcolor{bronze}{\textbf{8.5\%}} & 6446.89 & 10.45 \\
10
- TabICL (D) & 1569${}_{-29,+34}$ & \textcolor{silver}{\textbf{0.530}} & 10.6 & \textcolor{silver}{\textbf{3.4}} & \textcolor{silver}{\textbf{6}} & \textcolor{silver}{\textbf{7.3\%}} & 8.68 & 1.81 \\
11
- LightGBM (T+E) & 1548${}_{-32,+30}$ & 0.407 & 11.4 & 6.5 & 1 & 10.3\% & 374.34 & 1.46 \\
12
- TabM (T) & 1508${}_{-32,+34}$ & 0.417 & 13.1 & 6.2 & 1 & 9.6\% & 2387.83 & 0.16 \\
13
- CatBoost (T+E) & 1498${}_{-30,+29}$ & 0.393 & 13.5 & 8.5 & 0 & 9.5\% & 1233.49 & 0.52 \\
14
- CatBoost (T) & 1488${}_{-33,+36}$ & 0.375 & 14.0 & 6.9 & 1 & 9.7\% & 1233.49 & 0.07 \\
15
- TabPFNv2 (T+E) & 1488${}_{-35,+30}$ & \textcolor{bronze}{\textbf{0.515}} & 14.0 & \textcolor{gold}{\textbf{3.0}} & \textcolor{gold}{\textbf{8}} & 9.2\% & 3031.01 & 27.04 \\
16
- CatBoost (D) & 1477${}_{-33,+30}$ & 0.359 & 14.5 & 6.9 & 1 & 10.7\% & 5.31 & 0.07 \\
17
- LightGBM (T) & 1474${}_{-37,+31}$ & 0.320 & 14.6 & 12.3 & 0 & 11.0\% & 374.34 & 0.22 \\
18
- XGBoost (T+E) & 1468${}_{-29,+31}$ & 0.333 & 14.9 & 10.1 & 0 & 11.1\% & 637.94 & 1.22 \\
19
- ModernNCA (T) & 1438${}_{-34,+33}$ & 0.286 & 16.4 & 9.5 & 1 & 10.8\% & 4614.64 & 0.52 \\
20
- XGBoost (T) & 1429${}_{-37,+33}$ & 0.278 & 16.8 & 13.8 & 0 & 11.6\% & 637.94 & 0.17 \\
21
- ModernNCA (T+E) & 1428${}_{-32,+25}$ & 0.391 & 16.7 & 7.3 & 0 & 10.8\% & 4614.64 & 8.40 \\
22
- TabPFNv2 (T) & 1414${}_{-35,+32}$ & 0.396 & 17.3 & 4.9 & 1 & 11.7\% & 3031.01 & 0.59 \\
23
- TabM (D) & 1404${}_{-30,+24}$ & 0.281 & 18.0 & 11.5 & 0 & 12.9\% & 9.82 & 0.13 \\
24
- TabPFNv2 (D) & 1394${}_{-40,+28}$ & 0.363 & 18.5 & 4.6 & 4 & 12.7\% & 3.33 & 0.33 \\
25
- TorchMLP (T+E) & 1387${}_{-30,+29}$ & 0.239 & 18.8 & 14.5 & 0 & 11.9\% & 2372.22 & 2.09 \\
26
- RealMLP (T) & 1377${}_{-28,+34}$ & 0.182 & 19.3 & 16.6 & 0 & 12.3\% & 6446.89 & 0.47 \\
27
- EBM (T+E) & 1373${}_{-36,+28}$ & 0.179 & 19.4 & 13.5 & 0 & 15.5\% & 895.61 & 0.20 \\
28
- FastaiMLP (T+E) & 1351${}_{-31,+32}$ & 0.215 & 20.5 & 12.1 & 0 & 14.9\% & 582.77 & 4.52 \\
29
- ModernNCA (D) & 1337${}_{-35,+29}$ & 0.148 & 21.2 & 11.9 & 1 & 14.5\% & 14.53 & 0.34 \\
30
- EBM (T) & 1312${}_{-31,+27}$ & 0.118 & 22.5 & 18.0 & 0 & 16.2\% & 895.61 & 0.02 \\
31
- RealMLP (D) & 1280${}_{-29,+32}$ & 0.093 & 24.1 & 20.6 & 0 & 14.7\% & 20.98 & 0.70 \\
32
- EBM (D) & 1277${}_{-25,+36}$ & 0.126 & 24.1 & 11.6 & 1 & 17.0\% & 3.72 & 0.04 \\
33
- XGBoost (D) & 1264${}_{-30,+27}$ & 0.111 & 24.8 & 19.5 & 0 & 14.6\% & 1.58 & 0.11 \\
34
- TabDPT (D) & 1261${}_{-34,+32}$ & 0.190 & 24.9 & 7.8 & 2 & 15.9\% & 20.51 & 8.54 \\
35
- TorchMLP (T) & 1258${}_{-36,+30}$ & 0.096 & 25.2 & 21.8 & 0 & 14.5\% & 2372.22 & 0.15 \\
36
- ExtraTrees (T+E) & 1255${}_{-29,+33}$ & 0.085 & 25.2 & 19.1 & 0 & 16.5\% & 182.30 & 0.74 \\
37
- FastaiMLP (T) & 1245${}_{-28,+36}$ & 0.099 & 25.7 & 20.3 & 0 & 16.9\% & 582.77 & 0.29 \\
38
- RandomForest (T+E) & 1206${}_{-33,+26}$ & 0.067 & 27.6 & 17.6 & 0 & 17.5\% & 260.01 & 0.74 \\
39
- LightGBM (D) & 1200${}_{-30,+31}$ & 0.062 & 27.8 & 25.8 & 0 & 16.1\% & 1.41 & 0.12 \\
40
- ExtraTrees (T) & 1195${}_{-35,+30}$ & 0.056 & 27.9 & 23.9 & 0 & 17.9\% & 182.30 & 0.07 \\
41
- RandomForest (T) & 1151${}_{-32,+29}$ & 0.044 & 29.9 & 22.8 & 0 & 18.7\% & 260.01 & 0.07 \\
42
- TorchMLP (D) & 1102${}_{-36,+37}$ & 0.024 & 31.9 & 29.1 & 0 & 19.7\% & 6.03 & 0.13 \\
43
- FastaiMLP (D) & 1071${}_{-35,+35}$ & 0.032 & 33.0 & 29.7 & 0 & 22.3\% & 2.81 & 0.32 \\
44
- Linear (T+E) & 1031${}_{-34,+26}$ & 0.044 & 34.4 & 24.9 & 0 & 28.0\% & 44.46 & 0.20 \\
45
- RandomForest (D) & 1000${}_{-0,+0}$ & 0.003 & 35.3 & 33.8 & 0 & 24.4\% & 0.33 & 0.04 \\
46
- Linear (T) & 992${}_{-32,+33}$ & 0.029 & 35.6 & 30.1 & 0 & 28.9\% & 44.46 & 0.07 \\
47
- Linear (D) & 984${}_{-36,+30}$ & 0.020 & 35.9 & 27.2 & 0 & 30.0\% & 1.43 & 0.09 \\
48
- ExtraTrees (D) & 923${}_{-40,+31}$ & 0.011 & 37.5 & 34.5 & 0 & 26.9\% & 0.24 & 0.04 \\
49
- KNN (T+E) & 686${}_{-48,+31}$ & 0.000 & 41.8 & 41.5 & 0 & 49.7\% & 2.79 & 0.18 \\
50
- KNN (T) & 598${}_{-48,+39}$ & 0.000 & 42.8 & 42.7 & 0 & 51.5\% & 2.79 & 0.04 \\
51
- KNN (D) & 410${}_{-76,+57}$ & 0.000 & 44.2 & 44.1 & 0 & 60.0\% & 0.07 & 0.02 \\
52
- \bottomrule
53
- \end{tabular}
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
data/tabicl-imputed/tabarena_leaderboard.csv DELETED
@@ -1,46 +0,0 @@
1
- method,time_train_s,time_infer_s,time_train_s_per_1K,time_infer_s_per_1K,normalized-error,normalized-error-task,imputed,champ_delta,loss_rescaled,time_train_s_rescaled,time_infer_s_rescaled,rank,median_metric_error,median_time_train_s,median_time_infer_s,median_time_train_s_per_1K,median_time_infer_s_per_1K,median_normalized-error,median_normalized-error-task,median_imputed,median_champ_delta,median_loss_rescaled,median_time_train_s_rescaled,median_time_infer_s_rescaled,median_rank,rank=1_count,rank=2_count,rank=3_count,rank>3_count,elo,elo+,elo-,winrate,mrr
2
- AutoGluon 1.3 (4h),7750.377155031171,22.84719785201697,2751.5674214709147,2.9633558383567813,0.4144505963692869,0.4352619231211416,0.0,0.07000986007620899,0.039268860832908876,32689.64667720609,276.28827431069953,9.38888888888889,0.159155,6752.988276031282,3.4718479580349393,1222.5529738029095,2.35500290690449,0.33567162503552783,0.4181224156574922,0.0,0.03226297629046343,0.018303111934730745,26140.06177622487,154.15434639040092,6.0,5,3,1,27,1601.6,34.4,27.8,0.8093434343434344,0.288914812829661
3
- TABM_GPU (tuned + ensemble),32820.849396629244,7.890166007074309,3102.495096524168,1.8344804606248617,0.49788606507212485,0.5371562727976057,0.0,0.08573854850398603,0.05963110226502587,48157.60289475875,128.99098784852723,9.63888888888889,0.17235499999999998,8134.35737352901,2.5847251441743637,2387.831479177676,1.4722284599091153,0.4807674997729695,0.552934558160541,0.0,0.03899671266638771,0.02990990162448854,44088.87353980956,121.43882105497326,9.0,3,2,2,29,1593.6,39.4,28.8,0.8036616161616161,0.2303289526010451
4
- REALMLP (tuned + ensemble),86961.40895521553,48.95350425258095,7053.8901372884375,11.533281107318116,0.5232701355856864,0.5484379442361398,0.0,0.08541598556741743,0.05063432894298128,149033.04908471802,856.2486449740531,10.291666666666666,0.16568,30350.410282479395,22.800243775049843,6446.893200302383,10.450137112404576,0.4880190327087218,0.5530787180349611,0.0,0.04889132639035737,0.023927651830585747,120902.17712034364,769.6999972028425,9.25,0,1,1,34,1577.3,29.6,33.7,0.7888257575757576,0.13750229718767315
5
- TABICL_GPU (default),112.87139065964723,20.33866377333064,9.733153451045638,2.345033367232273,0.46986010190672167,0.5341505383842204,0.0,0.07335178204321441,0.04928097738364384,178.07934402242574,242.8201765656135,10.61111111111111,0.17207,25.732762111557854,3.564396858215332,8.684246340890724,1.808164283651731,0.4743522890813915,0.5681798960380293,0.0,0.03487584587193143,0.016731980584033038,151.54793227614402,132.97465082181864,9.5,6,4,1,25,1569.4,33.3,29.0,0.7815656565656566,0.29813182434220914
6
- GBM (tuned + ensemble),2757.3124551578803,12.456019589415302,657.304398009506,2.529125543711048,0.5933117628464099,0.6164710188936273,0.0,0.10258435392468589,0.0611872038336614,9019.445489261983,194.08976472235491,11.430555555555555,0.16698000000000002,1484.5727322684393,3.4757837878333198,374.34433555986243,1.4573305804667545,0.6420006136885861,0.6386644676612248,0.0,0.05156709998972453,0.0219954442643497,8683.556526223521,104.28918180708692,10.5,1,1,2,32,1548.2,29.8,31.2,0.7629419191919192,0.15405014761380412
7
- TABM_GPU (tuned),32820.849396629244,0.8654038354202553,3102.495096524168,0.19252750021110288,0.5825327072195123,0.5896172732916097,0.0,0.09626470934526843,0.07382049807856697,48157.60289475875,13.240850473332038,13.097222222222221,0.174175,8134.35737352901,0.26544706026713055,2387.831479177676,0.1626291486781835,0.5594681994427791,0.6408535524589001,0.0,0.05532096025125255,0.03209545871697756,44088.87353980956,10.02168824405339,14.0,1,3,2,30,1507.7,33.1,31.5,0.7250631313131313,0.1618180270357294
8
- CAT (tuned + ensemble),14620.521881076362,2.36174942917294,2269.8595810438947,0.6522846741920234,0.6068511428259387,0.6119523691645283,0.0,0.0949145350489712,0.053163717044543084,28484.88552620336,44.38703157771985,13.527777777777779,0.16040500000000002,4812.184866507848,1.2190414004855685,1233.4944988708792,0.5157389806383175,0.6047135908900858,0.6672206540324936,0.0,0.05899268008834285,0.023358345198772973,20694.132105891145,37.76547447057217,13.0,0,1,3,32,1497.8,28.2,29.2,0.7152777777777778,0.1170427623125036
9
- CAT (tuned),14620.521881076362,0.4177007556697469,2269.8595810438947,0.10130234094803403,0.6252212171690145,0.6263034353785987,0.0,0.09729472701692973,0.052955674971925916,28484.88552620336,6.396687248681741,13.958333333333334,0.161705,4812.184866507848,0.12577376100752088,1233.4944988708792,0.07246540449837217,0.6530696602129458,0.6906073359413165,0.0,0.058863950439092316,0.027822328565894415,20694.132105891145,5.269306492183282,14.0,1,2,1,32,1488.2,35.2,32.5,0.7054924242424242,0.1454992812488078
10
- TABPFNV2_GPU (tuned + ensemble),10557.381464538972,107.05828573483008,2800.4111063818664,47.2254791910091,0.48490373301756556,0.5435741752601181,0.2777777777777778,0.0917776117050395,0.07513500024875362,52318.555408411885,3635.7848895640136,13.958333333333334,0.17261500000000002,2277.7389435132345,14.569461973508199,3031.00823805294,27.044389802716765,0.455190744457759,0.5777919166792829,0.0,0.04061436251124223,0.029417287395027428,29915.904357304615,1226.8083686763093,7.0,8,3,2,23,1487.6,29.8,34.7,0.7054924242424242,0.3382689516212349
11
- CAT (default),225.3881851879167,0.2547764916478852,113.28441441630724,0.11944997568510514,0.641469722817324,0.659484163823669,0.0,0.10699265488288708,0.05641613967680766,427.0412955109482,6.664774458427448,14.527777777777779,0.16373,17.732768376668293,0.17626664373609757,5.3143473208136776,0.0720373699728269,0.6679727149434583,0.7200930114073563,0.0,0.06110300628994497,0.025874497523709752,105.75875711549895,5.435542513489766,16.0,1,3,1,31,1477.4,29.5,32.2,0.6925505050505051,0.14547719726985292
12
- GBM (tuned),2757.3124551578803,1.9554996257946817,657.304398009506,0.5137681973731403,0.6801768631137874,0.6740824460484856,0.0,0.1104195696992849,0.07112526333243879,9019.445489261983,34.03980226584826,14.597222222222221,0.17060999999999998,1484.5727322684393,0.5311000559065078,374.34433555986243,0.2150466969054991,0.7198335031990888,0.6756468486633254,0.0,0.055167136392065996,0.031112468640158004,8683.556526223521,16.198247708611177,14.0,0,0,0,36,1474.2,30.2,36.6,0.6909722222222222,0.08121790918946042
13
- XGB (tuned + ensemble),5493.491215199232,6.412096421880487,917.6069037638632,2.6250017135715122,0.667221504669457,0.6703236269477952,0.0,0.11134706183921922,0.07209031540826369,11372.296711293906,142.78657719786145,14.86111111111111,0.165745,1589.3805764291023,2.081376380390591,637.9415221019913,1.2169344189421265,0.7199373388598738,0.7259505715046674,0.0,0.0611042440313741,0.030607277073101638,9210.443722124051,76.49553505940113,13.0,0,0,1,35,1468.1,30.4,28.1,0.6849747474747475,0.09864409133294516
14
- MNCA_GPU (tuned),57657.94978451405,20.4386415688344,5175.01000177568,1.4870657842617447,0.7139051650464947,0.6598180805923327,0.0,0.10801788904290464,0.07238169665437456,84656.4189016225,131.8799800913606,16.36111111111111,0.175815,13573.088971928755,0.5870524644851685,4614.640089198099,0.5156200362715774,0.7789380151687756,0.6741481297678236,0.0,0.06755655006003625,0.04344719805270737,72619.42076015053,27.073934594746163,16.5,1,0,0,35,1437.5,32.9,33.5,0.6508838383838383,0.10546683007675836
15
- MNCA_GPU (tuned + ensemble),57657.94978451405,541.5858152206297,5175.01000177568,37.18030109053553,0.6091755565497485,0.6070494848080898,0.0,0.1079738249244821,0.0824875514497159,84656.4189016225,3366.8719867776967,16.72222222222222,0.19167499999999998,13573.088971928755,13.54958667092853,4614.640089198099,8.395978282094799,0.6004419642795275,0.5800963441226324,0.0,0.07521383093199485,0.039747452446492706,72619.42076015053,536.5448946581089,12.5,0,2,4,30,1427.7,24.9,31.6,0.6426767676767676,0.13635715943004306
16
- XGB (tuned),5493.491215199232,1.3085634729008617,917.6069037638632,0.6530885382876873,0.7222715149155855,0.7103087290569601,0.0,0.11556707073041328,0.07662471890863082,11372.296711293906,29.704007586433832,16.77777777777778,0.16848000000000002,1589.3805764291023,0.36310173670450846,637.9415221019913,0.1716870718901738,0.7533869940641466,0.7531012522750911,0.0,0.07553227424104386,0.0351477420959022,9210.443722124051,11.556301431859715,15.5,0,0,0,36,1428.8,32.3,36.1,0.6414141414141414,0.07269176227897872
17
- TABPFNV2_GPU (tuned),10557.381464538972,3.5877571221487026,2800.4111063818664,1.6323263976196694,0.6037821065431752,0.6296551164283334,0.2777777777777778,0.11748719725537295,0.0903892471054136,52318.555408411885,120.27811182838367,17.305555555555557,0.1868,2277.7389435132345,0.5116564194361368,3031.00823805294,0.5868151874902603,0.6617013701332746,0.651413564581804,0.0,0.08561151707807774,0.03658630335592475,29915.904357304615,28.32255960244214,10.5,1,8,1,26,1414.2,31.2,34.1,0.6294191919191919,0.20211887470493353
18
- TABM_GPU (default),138.894014670893,0.939259964963536,13.258693701508712,0.18837149821625798,0.7187944376125526,0.7305819252435415,0.0,0.1292600200409365,0.09224775627676693,198.59294054356857,12.617551449180137,18.041666666666668,0.175255,25.32820102903578,0.18552133904563056,9.819120211215754,0.1330683562425531,0.8315398727681542,0.7950654471852272,0.0,0.06154427249600947,0.03281090889681732,147.95052750875777,10.814297716965921,16.5,0,0,1,35,1403.5,23.8,29.2,0.6126893939393939,0.08661374604424155
19
- TABPFNV2_GPU (default),10.947083245604126,0.9331302737012321,4.035215198044816,0.47311805105554033,0.636703159615234,0.6872245075140289,0.2777777777777778,0.1270861209281617,0.09962337360619754,56.66111760729146,31.04552226404085,18.47222222222222,0.1886,7.492631395657857,0.30309558312098184,3.3292708959332433,0.32914197487032515,0.777395370386663,0.7325492470622512,0.0,0.07929605277196083,0.034405727147343024,46.89348887212364,18.74748191899635,17.0,4,1,4,27,1394.2,28.0,39.3,0.6029040404040404,0.21964051467131465
20
- NN_TORCH (tuned + ensemble),25121.675461051862,15.268945999498719,2998.2377676701126,3.008285271799925,0.7609802892009417,0.7461165137257577,0.0,0.11924702116888536,0.07969728268752577,59110.75454743922,227.46421879425387,18.77777777777778,0.17071999999999998,8702.23048403528,3.8643554978900485,2372.2159626094954,2.0891774346723144,0.8880043848397714,0.8073663594549582,0.0,0.06782479524801643,0.04484831234958845,46965.794452564674,173.17543399472862,19.0,0,0,0,36,1387.1,28.1,29.5,0.5959595959595959,0.06873630364946683
21
- REALMLP (tuned),86961.40895521553,2.0594779475235643,7053.8901372884375,0.6072534092763734,0.8175152448539208,0.7559562210409045,0.0,0.12322394182013983,0.08533470139842285,149033.04908471802,39.01479821938993,19.26388888888889,0.17271999999999998,30350.410282479395,0.9161458677715726,6446.893200302383,0.46776237641023743,0.8954815082143113,0.7906338583190218,0.0,0.076222073390521,0.04561771439992753,120902.17712034364,34.892936572121684,18.75,0,0,0,36,1377.4,33.7,27.7,0.5849116161616161,0.06030253530176412
22
- EBM (tuned + ensemble),28019.426662537786,1.0875822747195207,2272.8378900325356,0.27343562526003057,0.821456563024862,0.8165703903512577,0.0,0.15463342826689588,0.11176088893587201,26169.487179400727,18.583552688722236,19.444444444444443,0.177485,2278.5297676722207,0.3938958803812663,895.6082584208186,0.2030472330989584,0.9034396658321221,0.8509428943624828,0.0,0.0823298574360965,0.040258063765572304,16406.757135657386,11.254406005139426,19.0,0,0,1,35,1373.4,27.8,35.2,0.5808080808080808,0.07414732902917813
23
- FASTAI (tuned + ensemble),7322.968421160292,18.779504420875032,1297.972449752507,8.192088559474888,0.785417607472602,0.7833111232699402,0.0,0.1490031432620876,0.08536595095231317,19993.18321597409,475.72057536620804,20.51388888888889,0.17986000000000002,2947.7962622510063,11.628821227285597,582.7734793353083,4.517300460862554,0.9845344380551451,0.8198069065455255,0.0,0.08731644523471122,0.054465543302125864,17127.566654953327,460.36404533374946,21.75,0,1,0,35,1350.7,31.1,30.7,0.5565025252525253,0.08293746347213256
24
- MNCA_GPU (default),316.27131853140435,10.342945864171158,16.66944361197823,0.8061333025000202,0.8522725113780977,0.8033578681232494,0.0,0.14526299205359128,0.09098388342703735,268.415346120585,74.8577885297529,21.194444444444443,0.18519,30.626362359523775,0.5374451610777113,14.527288647230225,0.336567721078897,0.9980966808032234,0.8596665365861158,0.0,0.07974632039399476,0.04958848985494747,215.5139276781307,21.914790751288265,22.5,1,0,0,35,1337.2,28.3,34.6,0.5410353535353535,0.0837859844146563
25
- EBM (tuned),28019.426662537786,0.13116049015963518,2272.8378900325356,0.036898209286527296,0.882219325030256,0.8553855811613422,0.0,0.16163826216098914,0.11889561879265102,26169.487179400727,2.281962543329168,22.52777777777778,0.179115,2278.5297676722207,0.04349470535914103,895.6082584208186,0.02459548072380017,1.0,0.8943570495572675,0.0,0.08835084225135731,0.04420635043937865,16406.757135657386,1.2360778780029142,23.5,0,0,0,36,1311.7,26.2,30.8,0.5107323232323232,0.055593137739929466
26
- REALMLP (default),307.19483907590677,2.6816586940376848,22.77049793090805,2.6308630545615657,0.907361708603659,0.8496899282941173,0.0,0.14686744057987944,0.09943126476890098,497.9849494950257,119.73458834741044,24.055555555555557,0.17367,93.51136196984186,2.413340753979153,20.981206012229578,0.7005397999454532,1.0,0.8881513455425256,0.0,0.11770994919600164,0.057919641115812444,400.75291081158025,56.102746119908915,25.0,0,0,0,36,1280.3,31.1,28.2,0.476010101010101,0.048436517149928554
27
- EBM (default),118.50732698676026,0.14401131014765045,8.954596469527505,0.06265208718847394,0.8738538637847371,0.8699357973169751,0.0,0.1703532243519598,0.12258013909243326,115.00830556550737,3.283968403256123,24.055555555555557,0.18057,9.348468089103699,0.062386990918053525,3.724321540044417,0.037089624442184096,1.0,0.9282824912353611,0.0,0.09554880640873625,0.03755874966541475,64.88241965125457,2.244816344727411,24.0,1,0,2,33,1277.1,35.8,24.9,0.476010101010101,0.08609411717814229
28
- XGB (default),11.683355353643865,0.5746506972813311,2.5273290583117265,0.2898449163711814,0.888577046225069,0.8545602306330853,0.0,0.14626308667682397,0.12097652375124089,33.06311547785161,14.058796569155744,24.833333333333332,0.17317,5.340686360994974,0.2856193900108337,1.5761941364945817,0.11450144426278887,1.0,0.9410060237993583,0.0,0.10102883170605775,0.06626116550706775,33.38866095520069,9.412365893361752,24.0,0,0,0,36,1263.8,26.8,29.3,0.4583333333333333,0.05120721873421302
29
- TABDPT_GPU (default),176.01594519710835,69.14393293923803,27.198725725356752,23.388429074834338,0.8100167838345331,0.8002959126184359,0.0,0.15922158223192923,0.1170304072369534,507.4757748221604,1409.9337088973832,24.944444444444443,0.200525,98.94233159224191,28.93788754940033,20.513119835829798,8.53560138835576,1.0,0.9480547222991438,0.0,0.1092338236572048,0.046097964205681456,462.0657004935143,1196.5084355075069,30.0,2,0,3,31,1260.8,31.9,33.9,0.45580808080808083,0.12811852287779754
30
- NN_TORCH (tuned),25121.675461051862,0.8252101601641856,2998.2377676701126,0.18188676084331395,0.9042033228970179,0.8492772637013142,0.0,0.14458099864130583,0.10402676940209679,59110.75454743922,12.21885557115889,25.15277777777778,0.17446499999999998,8702.23048403528,0.23337317837609184,2372.2159626094954,0.14653810958067576,1.0,0.9030726729865997,0.0,0.10379368194908523,0.06014400941853005,46965.794452564674,9.196372470124006,25.5,0,0,0,36,1257.7,29.7,35.8,0.4510732323232323,0.04590739185033665
31
- XT (tuned + ensemble),1031.1996921384775,2.988154384089105,357.27527748345153,1.3397702898173773,0.915145874394795,0.8805972333887039,0.0,0.16523038416167826,0.12115553964497544,4802.300435215387,79.00786738708753,25.25,0.18232500000000001,744.239438480801,1.8136235740449693,182.30061053451453,0.7431041876698922,1.0,0.9451500237846994,0.0,0.09673248817237246,0.0685642717298475,3258.869343201663,69.06723555057565,28.0,0,0,0,36,1255.1,32.9,28.2,0.44886363636363635,0.05226957180005792
32
- FASTAI (tuned),7322.968421160292,1.0651015653286453,1297.972449752507,0.6247631304455121,0.9014896273019105,0.859902106736277,0.0,0.1694950028869433,0.1070255440996003,19993.18321597409,33.55086488448845,25.73611111111111,0.18178,2947.7962622510063,0.8198094805081686,582.7734793353083,0.28744913553656576,1.0,0.9088593404454739,0.0,0.09789639604232409,0.0649858160039401,17127.566654953327,27.88231049239245,25.5,0,0,0,36,1244.8,35.2,27.7,0.4378156565656566,0.049159578672548734
33
- RF (tuned + ensemble),2044.775237460416,2.3872891816092126,416.2783457096328,1.255027602569144,0.9334558803432285,0.9040062206127278,0.0,0.17454870740132913,0.13450116486800084,5642.134899127388,70.94429168864653,27.583333333333332,0.178315,852.5537050988939,1.9029027620951335,260.0125674942402,0.7428875097152683,1.0,0.9837131231084293,0.0,0.11236159795018158,0.07590657171377442,4493.239522943328,63.634247370273194,29.5,0,1,0,35,1205.8,25.5,32.5,0.3958333333333333,0.056882306832005125
34
- GBM (default),6.758794430523743,0.5789695977428813,2.490182745714093,0.15718383308630232,0.9383512113871156,0.9113955280706146,0.0,0.16095596388481948,0.11966195291867884,33.5255932823166,11.26499493738597,27.805555555555557,0.18472,5.414635124471452,0.25708606508043075,1.4105395256066293,0.11924102542301018,1.0,0.9556137019600007,0.0,0.11742663255155822,0.07438366295477852,28.347402616029974,6.917663985246303,28.0,0,0,0,36,1200.3,30.6,29.1,0.3907828282828283,0.03876399278363048
35
- XT (tuned),1031.1996921384775,0.29890986717777485,357.27527748345153,0.16424540531360832,0.9435242908687488,0.910514407220008,0.0,0.1790239805568848,0.13118779231114652,4802.300435215387,8.701857547554141,27.944444444444443,0.18223,744.239438480801,0.18342396948072645,182.30061053451453,0.07494392763268541,1.0,0.9738454334228958,0.0,0.10725622533000512,0.07275368947647169,3258.869343201663,8.07141276727868,30.5,0,0,0,36,1195.2,29.6,34.4,0.38762626262626265,0.041800506882685134
36
- RF (tuned),2044.775237460416,0.23278873113938317,416.2783457096328,0.1507722695594024,0.9558991992102567,0.9248235247053782,0.0,0.1867739997900619,0.1463725402069691,5642.134899127388,7.55687197585497,29.88888888888889,0.18139,852.5537050988939,0.17230602105458576,260.0125674942402,0.06758631242886748,1.0,0.9975246590645634,0.0,0.12445981137673867,0.08415493384155445,4493.239522943328,6.782207740325264,32.0,0,0,1,35,1151.1,28.7,31.7,0.3434343434343434,0.04379590025883977
37
- NN_TORCH (default),48.70578088156971,0.5936917389616555,11.125118656665736,0.17673793806000127,0.9755403816226027,0.9495104680314707,0.0,0.19710881774212397,0.1446123174347958,163.6050788464746,10.908192435522393,31.875,0.180355,25.35211862458123,0.2432508071263631,6.0317417768089925,0.12717011148259783,1.0,0.9983912483912485,0.0,0.142508632176782,0.09058242754644441,147.85412694112986,8.619821917518276,33.0,0,0,0,36,1101.9,36.6,35.5,0.29829545454545453,0.03438131417272046
38
- FASTAI (default),31.09969735491423,1.10650484436824,4.683882685740998,0.49602131438348224,0.9675898920469699,0.9386040117988751,0.0,0.22259183226174764,0.16760850439725888,78.50334203533181,29.20329942994179,32.97222222222222,0.19183499999999998,12.376497785250347,0.7602158255047269,2.8056596594025227,0.32364197153238683,1.0,1.0,0.0,0.16139709053294848,0.10101015982783891,62.830982755166104,26.31473180904403,36.0,0,0,0,36,1070.8,35.0,34.4,0.27335858585858586,0.03372359578898876
39
- LR (tuned + ensemble),247.81690773551847,1.5417047552120537,86.9527487524539,0.32565721661614494,0.9560243682409655,0.9498639619907061,0.0,0.2795481968011861,0.22419397642880423,1145.6529776499187,22.80693330823688,34.44444444444444,0.20382,171.4822693798277,0.2983522944980197,44.45837271402539,0.19711479228248058,1.0,1.0,0.0,0.2029910210337117,0.129932605346946,713.1690352739483,13.25530093456624,37.5,0,0,1,35,1030.6,25.6,33.9,0.2398989898989899,0.040119786151177166
40
- RF (default),3.0601362299771955,0.1389405298380204,0.4197356009776281,0.0653398591441877,0.9966245196912615,0.9787377451249206,0.0,0.24439813445426953,0.23332088687005598,6.380264980433452,3.8863601848700204,35.31944444444444,0.21025,1.1198331514994302,0.08397722244262695,0.3277067685609757,0.03542500892500126,1.0,1.0,0.0,0.17586669886220047,0.11692775501298655,5.654732996519019,3.4893450212553976,37.25,0,0,0,36,1000.0,0.0,0.0,0.22001262626262627,0.02955714259016535
41
- LR (tuned),247.81690773551847,0.4449969635333544,86.9527487524539,0.10396867681281166,0.9712113277866309,0.9577407266535012,0.0,0.2890832289893475,0.23359851930175468,1145.6529776499187,6.67768937474542,35.611111111111114,0.20357,171.4822693798277,0.11908173561096191,44.45837271402539,0.07001089403664298,1.0,1.0,0.0,0.209915797201436,0.13996577713587863,713.1690352739483,4.422167155622043,38.0,0,0,0,36,991.9,32.6,31.2,0.21338383838383837,0.03326921712538259
42
- LR (default),6.169420995020572,0.45878614010634244,2.106396664251635,0.12050053609932035,0.9800746746641885,0.9638069408572796,0.0,0.29953612908715793,0.25915776832734055,29.155480122785303,7.705503358920759,35.916666666666664,0.20875,5.325402127371894,0.12779696782430014,1.4313534079421033,0.0896510462814927,1.0,1.0,0.0,0.2098231228117865,0.14283438851684813,21.98095065107635,4.735441569338942,39.5,0,0,1,35,983.9,29.9,35.6,0.20643939393939395,0.036743386171964994
43
- XT (default),1.8043978815461381,0.17975940917745048,0.3713595805715374,0.06954273446410252,0.9890619891770874,0.97405903165696,0.0,0.2693928217092323,0.2625924038360423,5.400149437442951,4.422216908920569,37.5,0.21292,0.9855960739983453,0.08559976683722602,0.24198385110028492,0.03905757784211604,1.0,1.0,0.0,0.18424442460299273,0.1366333464189622,4.712255061768779,3.9447928104085292,39.0,0,0,0,36,923.2,30.1,39.7,0.17045454545454544,0.028951201937937068
44
- KNN (tuned + ensemble),160.6654753932982,12.105770299979199,6.688012200811168,0.6510287601474662,1.0,0.9960874882015757,0.1111111111111111,0.49677011612086325,0.6120942087885226,74.9110387789707,80.9066319119832,41.80555555555556,0.31954499999999997,8.516376826498243,0.18374058273103502,2.7901906181635594,0.18360265549810623,1.0,1.0,0.0,0.4552261138058871,0.666412349042599,57.50317350047459,12.487074071232104,43.0,0,0,0,36,686.0,30.5,47.7,0.0726010101010101,0.024106030242002884
45
- KNN (tuned),160.6654753932982,1.8655599888460135,6.688012200811168,0.10631861792797924,1.0,0.9975290105933559,0.1111111111111111,0.515322871939524,0.6579120910305495,74.9110387789707,12.961828932017745,42.80555555555556,0.32433,8.516376826498243,0.07810062832302517,2.7901906181635594,0.039402452723266784,1.0,1.0,0.0,0.4955223463378593,0.7068928996407389,57.50317350047459,2.342964973116946,44.0,0,0,0,36,597.7,38.9,47.3,0.049873737373737376,0.02344196684773788
46
- KNN (default),0.8106582238350385,0.22655490967962477,0.11174909219233083,0.031188125017345747,1.0,1.0,0.1111111111111111,0.59998652527636,0.9596846467502638,1.0058498096362567,2.4347704716843976,44.208333333333336,0.392575,0.22842825783623588,0.035103811158074275,0.06799989431884934,0.01944144969012079,1.0,1.0,0.0,0.619455920315494,1.0,1.0,1.2342542936555674,45.0,0,0,0,36,410.2,56.2,76.0,0.017992424242424244,0.0226951547710286
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
data/tabicl-imputed/time_plot.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:7527eec600ee276e2a1b9d764892cb03aa40c030e35de80ab944078074f13b90
3
- size 460639
 
 
 
 
data/tabicl-imputed/tuning-impact-elo-horizontal.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:b635b6f74d9db45745e652d477b6e36f190b5bcbb05709c76491e4cf5186282f
3
- size 238622
 
 
 
 
data/tabicl-imputed/tuning-impact-elo.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:ccec1c46eecc638d633f173ba42e1d538fc17d0a46565755f23b2b6f22540e0a
3
- size 220550
 
 
 
 
data/tabpfn-imputed/figures/critical-diagram.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:bb556e905204b2c56a0a1abab914cbe998787de8932bcffa4225cae2817b7c9e
3
- size 340328
 
 
 
 
data/tabpfn-imputed/leaderboard.tex DELETED
@@ -1,53 +0,0 @@
1
- \begin{tabular}{llcccccrr}
2
- \toprule
3
- \textbf{Model} & \textbf{Elo ($\uparrow$)} & \textbf{Norm.} & \textbf{Avg.} & \textbf{Harm.} & \textbf{\#wins ($\uparrow$)} & \textbf{Improva-} & \textbf{Train time} & \textbf{Predict time} \\
4
- & & \textbf{score ($\uparrow$)} & \textbf{rank ($\downarrow$)} & \textbf{mean} & & \textbf{bility ($\downarrow$)} & \textbf{per 1K [s]} & \textbf{per 1K [s]} \\
5
- & & & & \textbf{rank ($\downarrow$)} & & & & \\
6
- \midrule
7
- TabPFNv2 (T+E) & \textcolor{gold}{\textbf{1720${}_{-31,+40}$}} & \textcolor{gold}{\textbf{0.725}} & \textcolor{gold}{\textbf{5.8}} & \textcolor{gold}{\textbf{2.1}} & \textcolor{gold}{\textbf{11}} & \textcolor{gold}{\textbf{3.8\%}} & 3899.42 & 55.83 \\
8
- RealMLP (T+E) & \textcolor{silver}{\textbf{1603${}_{-32,+38}$}} & \textcolor{bronze}{\textbf{0.524}} & \textcolor{silver}{\textbf{9.2}} & 5.5 & 0 & \textcolor{bronze}{\textbf{7.6\%}} & 7141.94 & 12.29 \\
9
- TabM (T+E) & \textcolor{bronze}{\textbf{1583${}_{-35,+31}$}} & 0.492 & \textcolor{bronze}{\textbf{9.8}} & 4.9 & 2 & 7.9\% & 3372.56 & 1.66 \\
10
- AutoGluon 1.3 (4h) & 1560${}_{-35,+30}$ & 0.507 & 10.7 & 4.7 & 2 & 8.2\% & 2727.51 & 3.24 \\
11
- TabPFNv2 (T) & 1558${}_{-33,+34}$ & \textcolor{silver}{\textbf{0.545}} & 10.8 & \textcolor{bronze}{\textbf{4.0}} & 1 & \textcolor{silver}{\textbf{7.1\%}} & 3899.42 & 0.98 \\
12
- LightGBM (T+E) & 1521${}_{-37,+26}$ & 0.376 & 12.4 & 7.4 & 1 & 10.4\% & 771.57 & 2.49 \\
13
- TabPFNv2 (D) & 1501${}_{-39,+29}$ & 0.490 & 13.2 & \textcolor{silver}{\textbf{3.9}} & \textcolor{bronze}{\textbf{4}} & 8.7\% & 4.22 & 0.55 \\
14
- CatBoost (T+E) & 1481${}_{-32,+31}$ & 0.354 & 14.1 & 9.9 & 0 & 10.1\% & 2034.85 & 0.80 \\
15
- TabICL (D) & 1474${}_{-31,+27}$ & 0.432 & 14.4 & 4.1 & \textcolor{bronze}{\textbf{4}} & 8.9\% & 7.27 & 1.64 \\
16
- TabM (T) & 1472${}_{-28,+26}$ & 0.390 & 14.5 & 8.3 & 0 & 9.1\% & 3372.56 & 0.21 \\
17
- CatBoost (T) & 1460${}_{-29,+28}$ & 0.327 & 15.0 & 9.6 & 0 & 10.3\% & 2034.85 & 0.10 \\
18
- LightGBM (T) & 1453${}_{-33,+28}$ & 0.303 & 15.3 & 13.4 & 0 & 10.9\% & 771.57 & 0.32 \\
19
- XGBoost (T+E) & 1427${}_{-29,+26}$ & 0.281 & 16.6 & 13.5 & 0 & 11.5\% & 828.74 & 2.31 \\
20
- CatBoost (D) & 1419${}_{-29,+38}$ & 0.272 & 16.7 & 10.3 & 0 & 11.9\% & 8.51 & 0.12 \\
21
- TabM (D) & 1400${}_{-28,+29}$ & 0.302 & 17.8 & 11.6 & 0 & 12.1\% & 12.24 & 0.15 \\
22
- ModernNCA (T) & 1399${}_{-32,+33}$ & 0.235 & 17.7 & 9.9 & 1 & 11.0\% & 6147.69 & 0.48 \\
23
- XGBoost (T) & 1390${}_{-25,+24}$ & 0.241 & 18.1 & 16.0 & 0 & 11.8\% & 828.74 & 0.34 \\
24
- ModernNCA (T+E) & 1390${}_{-28,+32}$ & 0.332 & 18.2 & 8.5 & 0 & 11.4\% & 6147.69 & 8.15 \\
25
- TabDPT (D) & 1379${}_{-27,+29}$ & 0.378 & 18.6 & 4.2 & \textcolor{silver}{\textbf{5}} & 12.3\% & 28.84 & 9.01 \\
26
- RealMLP (T) & 1366${}_{-25,+27}$ & 0.206 & 19.3 & 16.1 & 0 & 11.1\% & 7141.94 & 0.71 \\
27
- EBM (T+E) & 1343${}_{-32,+41}$ & 0.192 & 20.5 & 11.8 & 0 & 15.2\% & 1331.68 & 0.21 \\
28
- ModernNCA (D) & 1339${}_{-31,+32}$ & 0.160 & 20.6 & 11.3 & 1 & 14.1\% & 16.16 & 0.31 \\
29
- TorchMLP (T+E) & 1335${}_{-33,+27}$ & 0.197 & 20.8 & 15.9 & 0 & 12.4\% & 3704.30 & 2.07 \\
30
- FastaiMLP (T+E) & 1304${}_{-32,+29}$ & 0.188 & 22.3 & 12.8 & 0 & 15.1\% & 1459.62 & 8.06 \\
31
- EBM (T) & 1285${}_{-33,+31}$ & 0.136 & 23.2 & 16.1 & 0 & 15.9\% & 1331.68 & 0.02 \\
32
- ExtraTrees (T+E) & 1271${}_{-29,+26}$ & 0.133 & 23.8 & 15.3 & 0 & 16.7\% & 416.39 & 1.39 \\
33
- RealMLP (D) & 1271${}_{-31,+26}$ & 0.106 & 23.9 & 20.4 & 0 & 13.7\% & 22.96 & 1.87 \\
34
- EBM (D) & 1261${}_{-34,+21}$ & 0.148 & 24.4 & 11.2 & 1 & 16.7\% & 5.89 & 0.07 \\
35
- TorchMLP (T) & 1229${}_{-30,+34}$ & 0.107 & 25.8 & 22.3 & 0 & 14.5\% & 3704.30 & 0.14 \\
36
- ExtraTrees (T) & 1227${}_{-40,+29}$ & 0.102 & 26.0 & 19.9 & 0 & 17.6\% & 416.39 & 0.18 \\
37
- FastaiMLP (T) & 1220${}_{-33,+32}$ & 0.094 & 26.3 & 20.6 & 0 & 16.4\% & 1459.62 & 0.89 \\
38
- XGBoost (D) & 1192${}_{-27,+32}$ & 0.030 & 27.4 & 25.5 & 0 & 15.4\% & 3.05 & 0.24 \\
39
- LightGBM (D) & 1171${}_{-30,+25}$ & 0.038 & 28.5 & 26.8 & 0 & 15.9\% & 3.39 & 0.16 \\
40
- RandomForest (T+E) & 1167${}_{-30,+29}$ & 0.071 & 28.6 & 22.1 & 0 & 18.1\% & 572.67 & 1.42 \\
41
- RandomForest (T) & 1107${}_{-29,+39}$ & 0.036 & 31.0 & 28.2 & 0 & 19.2\% & 572.67 & 0.14 \\
42
- TorchMLP (D) & 1069${}_{-27,+42}$ & 0.029 & 32.5 & 29.3 & 0 & 19.6\% & 11.82 & 0.15 \\
43
- FastaiMLP (D) & 1025${}_{-33,+31}$ & 0.020 & 34.0 & 31.7 & 0 & 22.1\% & 5.18 & 0.65 \\
44
- ExtraTrees (D) & 1008${}_{-30,+34}$ & 0.031 & 34.5 & 30.4 & 0 & 24.6\% & 0.42 & 0.08 \\
45
- RandomForest (D) & 1000${}_{-0,+0}$ & 0.004 & 34.8 & 33.2 & 0 & 23.5\% & 0.47 & 0.07 \\
46
- Linear (T+E) & 972${}_{-29,+31}$ & 0.044 & 35.7 & 25.3 & 0 & 29.7\% & 97.00 & 0.22 \\
47
- Linear (T) & 941${}_{-35,+36}$ & 0.029 & 36.7 & 30.6 & 0 & 30.2\% & 97.00 & 0.09 \\
48
- Linear (D) & 920${}_{-42,+28}$ & 0.020 & 37.2 & 27.5 & 0 & 31.7\% & 2.99 & 0.10 \\
49
- KNN (T+E) & 692${}_{-50,+41}$ & 0.000 & 41.5 & 41.2 & 0 & 46.0\% & 2.80 & 0.18 \\
50
- KNN (T) & 608${}_{-48,+43}$ & 0.000 & 42.6 & 42.5 & 0 & 47.8\% & 2.80 & 0.04 \\
51
- KNN (D) & 372${}_{-105,+65}$ & 0.000 & 44.3 & 44.2 & 0 & 55.8\% & 0.07 & 0.03 \\
52
- \bottomrule
53
- \end{tabular}
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
data/tabpfn-imputed/tabarena_leaderboard.csv DELETED
@@ -1,46 +0,0 @@
1
- method,time_train_s,time_infer_s,time_train_s_per_1K,time_infer_s_per_1K,normalized-error,normalized-error-task,imputed,champ_delta,loss_rescaled,time_train_s_rescaled,time_infer_s_rescaled,rank,median_metric_error,median_time_train_s,median_time_infer_s,median_time_train_s_per_1K,median_time_infer_s_per_1K,median_normalized-error,median_normalized-error-task,median_imputed,median_champ_delta,median_loss_rescaled,median_time_train_s_rescaled,median_time_infer_s_rescaled,median_rank,rank=1_count,rank=2_count,rank=3_count,rank>3_count,elo,elo+,elo-,winrate,mrr
2
- TABPFNV2_GPU (tuned + ensemble),14425.017745771953,147.33083187377815,4401.396029095723,73.66242671660264,0.27516860637025115,0.3603555375107131,0.0,0.038105234440633594,0.026782950269144352,97206.53847125142,5748.512821917254,5.7727272727272725,0.22151,6554.713698530197,48.78112569650014,3899.4164285155534,55.8334682198027,0.17987789052252875,0.3534601056500293,0.0,0.014027742147710298,0.009805464057384642,76922.9404189982,3038.197671565286,3.0,11,4,3,15,1719.8,39.6,30.3,0.8915289256198347,0.4801184377951626
3
- REALMLP (tuned + ensemble),21372.515179977676,15.681909071235143,7576.910333884578,13.311283840245572,0.4758709884053711,0.5150335164004382,0.0,0.07574206256804669,0.045271823351844084,149682.80929987264,797.3567089859678,9.196969696969697,0.23199,11845.681601381302,14.194016642040676,7141.940159399529,12.293594768542036,0.44270236511716554,0.4682312902019519,0.0,0.03826066759873625,0.02687212859881103,129127.09723567181,690.7149727636221,7.0,0,4,2,27,1603.2,37.3,32.0,0.8137052341597796,0.18165850096261016
4
- TABM_GPU (tuned + ensemble),7759.403633462299,2.1001035646156025,4063.8948551919407,2.0262676290994976,0.5079988964437289,0.5440563382781555,0.0,0.0790871342651138,0.05662145831204036,68994.91604679843,112.78737728828092,9.848484848484848,0.22996,6619.307261109352,1.565106577343411,3372.559747313985,1.6579582265448953,0.4651893659350262,0.5420701690271631,0.0,0.038378144006785075,0.03425172068662737,47614.825535817596,107.26839105814781,9.0,2,1,2,28,1582.6,30.1,34.9,0.7988980716253443,0.20563982440305573
5
- AutoGluon 1.3 (4h),6055.818895130286,4.270678644549565,3922.7285559191146,3.7248760797186677,0.49289366855599703,0.5145998717655849,0.0,0.08159774704016189,0.05211281940109976,55071.75177189554,226.07763766376502,10.727272727272727,0.23252,5245.15874774456,2.643711381488376,2727.5131652494383,3.239980099911348,0.4404834188453446,0.5458460772361734,0.0,0.03289499571493004,0.03718969914596063,42990.839716832954,133.93997628187762,8.0,2,3,0,28,1560.5,30.0,34.1,0.7789256198347108,0.21057530939150743
6
- TABPFNV2_GPU (tuned),14425.017745771953,5.209287579613502,4401.396029095723,2.5426283197700723,0.4548654021522606,0.4934476784128812,0.0,0.07053987055307652,0.05344000271928163,97206.53847125142,204.9255427422306,10.787878787878787,0.22735,6554.713698530197,0.8280300378799439,3899.4164285155534,0.9826616035428936,0.36706067616596666,0.5176267485065693,0.0,0.03437211181910249,0.021856445988952963,76922.9404189982,51.81666634218519,6.0,1,9,2,21,1558.5,33.1,32.2,0.7775482093663911,0.2528154074311644
7
- GBM (tuned + ensemble),1415.9841766067627,4.799208115006135,910.5198965732433,4.069284327482318,0.6237089744532728,0.6525581177848765,0.0,0.10428404160158442,0.06311285464696612,13668.933422648086,267.51058203721857,12.378787878787879,0.22857,1305.722465435664,2.759434594048394,771.5692555555095,2.4904788987789197,0.6590330537833339,0.6929140073332589,0.0,0.04583211069062432,0.04469640874832902,13185.517460723013,113.47473834277966,11.0,1,0,1,31,1521.2,25.7,36.4,0.7413911845730028,0.13499068874884596
8
- TABPFNV2_GPU (default),11.52095359335042,1.0633954972129076,6.06940244179637,0.672221391108446,0.5098348478838993,0.5763734971282789,0.0,0.08650643788287256,0.07123949707004915,98.45527177302218,43.21321073170591,13.181818181818182,0.22839,8.920613225301107,0.4424108028411865,4.216795518748306,0.554313340790968,0.35267762555284565,0.4964096417837951,0.0,0.046436046436047484,0.024781083502458036,86.18639524525906,32.62493249187355,5.0,4,1,4,24,1500.9,28.3,39.0,0.7231404958677686,0.25894171083751066
9
- CAT (tuned + ensemble),6111.719856242539,1.006158811716921,3790.183958691414,0.8907615430657749,0.6455557540407405,0.6592143082058544,0.0,0.10086936448956937,0.06664334011743277,60543.81785886913,53.55072795645876,14.06060606060606,0.23306,3546.8591198126474,0.7182260619269477,2034.851316438205,0.804742177327474,0.6050990092049177,0.6841715460836575,0.0,0.057913957490063894,0.05601309827846158,22945.70625086803,41.43902270876451,13.0,0,0,2,31,1481.0,30.1,31.9,0.703168044077135,0.10100258106957156
10
- TABICL_GPU (default),17.601916466818917,2.220749478067212,8.223267143306801,1.62619082060654,0.5676760927563484,0.6167304326779658,0.21212121212121213,0.0885594645099666,0.06951992440652786,118.07631364935412,95.53326061925603,14.378787878787879,0.21659,12.685983737309774,1.1608605914645724,7.267564825342236,1.640227848921365,0.5876984027531944,0.7272892322270896,0.0,0.04180356358509396,0.026535607179780867,110.9966550885031,89.55280278440337,10.0,4,3,1,25,1473.7,26.6,30.8,0.6959366391184573,0.24335886946636603
11
- TABM_GPU (tuned),7759.403633462299,0.20657940898278745,4063.8948551919407,0.21768183700894184,0.609895479329719,0.6093055261596303,0.0,0.09090766997883803,0.07407083534637013,68994.91604679843,11.66689658584399,14.469696969696969,0.23179,6619.307261109352,0.18380210134718153,3372.559747313985,0.207815816005071,0.5598513637019573,0.6254805836016907,0.0,0.04895084966574903,0.04210334551764721,47614.825535817596,11.061181528720304,14.0,0,2,1,30,1471.9,25.4,27.7,0.693870523415978,0.12042284667296002
12
- CAT (tuned),6111.719856242539,0.1329929079672303,3790.183958691414,0.13532680535635316,0.6728787851175252,0.6792836442667326,0.0,0.10338894489993479,0.06791309035479874,60543.81785886913,7.594073997033244,14.969696969696969,0.22837,3546.8591198126474,0.09593041737874348,2034.851316438205,0.09676511402452573,0.647597639226874,0.6911120328445807,0.0,0.05854822331192988,0.05607775996232369,22945.70625086803,5.6651414641322715,14.0,0,1,1,31,1460.5,27.2,28.4,0.6825068870523416,0.10441246932104374
13
- GBM (tuned),1415.9841766067627,0.7482920802402174,910.5198965732433,0.7792669151777867,0.6969993606145813,0.6924613027004218,0.0,0.10949149242597529,0.07146136760763892,13668.933422648086,44.04955699034316,15.348484848484848,0.23047,1305.722465435664,0.3891807476679484,771.5692555555095,0.32246047162149255,0.7229707836237061,0.6960725352902953,0.0,0.05089516062517585,0.0589344586982637,13185.517460723013,17.561755180687065,15.0,0,0,0,33,1453.0,27.4,32.4,0.6738980716253443,0.07464942620240468
14
- XGB (tuned + ensemble),1916.6055530242247,2.961519778778256,1177.527465794311,3.6728650211989002,0.7185677192768513,0.722148041893201,0.0,0.11542292686657665,0.07837696956248971,16012.343458246229,184.54969165931055,16.560606060606062,0.22933,1391.6566625965966,1.6547198295593262,828.736683312722,2.3118448001769525,0.764137252094651,0.7358430401520624,0.0,0.060774906477773616,0.059085857523070544,12341.448622348875,104.93870139682326,15.0,0,0,0,33,1426.6,25.3,28.7,0.6463498622589532,0.07391568853347559
15
- CAT (default),152.82380405130613,0.14037318871880222,127.11927189400713,0.16217538557702282,0.7280864974283284,0.7512406750981568,0.0,0.11924475836986947,0.07620192670528358,502.113809148077,8.351109219795852,16.696969696969695,0.22377,11.973733305931091,0.15812701649136013,8.50783362735807,0.12332610453035381,0.8380223338819129,0.8033087154025523,0.0,0.05487383563224202,0.03792057600443833,113.47556087857672,7.212822094475863,17.0,0,2,0,31,1418.8,37.7,29.0,0.6432506887052342,0.09678993813551463
16
- MNCA_GPU (tuned),10961.64198641737,0.5195780056494254,6318.399005594662,0.47834981044798197,0.7647328075370147,0.6898374034753542,0.0,0.1103572864806074,0.0800405306609752,105260.96546626008,25.986174703676685,17.742424242424242,0.23735,10053.083413600922,0.4133593638737996,6147.690948218725,0.484748090925306,0.8482811842457822,0.7606311221156988,0.0,0.06726118482226873,0.07061019028987191,89084.88026764008,22.392018959457452,17.0,1,0,0,32,1399.0,32.5,31.2,0.6194903581267218,0.10140865667548635
17
- TABM_GPU (default),28.548644328920126,0.1798720349366416,18.542173393427806,0.19723712265145438,0.6977749862118511,0.7149920612365777,0.0,0.12066262568276763,0.09211746164016242,318.94007895221296,9.774328855022276,17.78787878787879,0.2315,20.942287389437357,0.15928708182440865,12.243907458197643,0.15399858651571716,0.6569935308526149,0.6827665147004843,0.0,0.0643765592382024,0.04493622716672658,186.77830789084513,9.305311172432566,16.0,0,0,1,32,1399.5,28.3,27.9,0.6184573002754821,0.08617602260911217
18
- XGB (tuned),1916.6055530242247,0.652148759324944,1177.527465794311,0.8866854120458099,0.7588074126661332,0.7461617910569237,0.0,0.11776030589331012,0.0818292044946032,16012.343458246229,38.50908627958101,18.09090909090909,0.23215,1391.6566625965966,0.25174130333794487,828.736683312722,0.3364459349309477,0.8235105262706657,0.7756241929814622,0.0,0.06701911684547046,0.05575460601692295,12341.448622348875,17.273530689905304,17.0,0,0,0,33,1390.5,23.2,24.3,0.6115702479338843,0.06266615528226868
19
- MNCA_GPU (tuned + ensemble),10961.64198641737,10.701994556690307,6318.399005594662,8.75762576226945,0.6676135227367788,0.6534477080949888,0.0,0.11364677800622328,0.08762387807842323,105260.96546626008,520.9142587851533,18.151515151515152,0.23071,10053.083413600922,8.386180957158407,6147.690948218725,8.148513113458952,0.7048964433216691,0.7058737809385717,0.0,0.07494103959274867,0.05038781311089743,89084.88026764008,418.52804655660253,15.0,0,1,4,28,1389.5,31.3,27.3,0.6101928374655647,0.11804912964918021
20
- TABDPT_GPU (default),70.92158105132556,22.39591003114527,34.81759719100062,31.279468309323132,0.6222872639977844,0.6468633419407165,0.0,0.12299859864430328,0.07605368916267982,622.0717388752522,1500.5607209171305,18.636363636363637,0.22801,66.73408037026724,21.411699827512106,28.844939328856388,9.008305936432574,0.7018399153398434,0.7397102470891475,0.0,0.060533105776061635,0.04402125832146232,592.5429859587771,1255.434427440434,15.0,5,0,3,25,1378.9,28.6,26.7,0.5991735537190083,0.23623457185050586
21
- REALMLP (tuned),21372.515179977676,0.7507762807788271,7576.910333884578,0.7684449786885911,0.7939247948511633,0.7300163658944212,0.0,0.11091064673017946,0.08420419305125372,149682.80929987264,40.78238583214569,19.272727272727273,0.23636,11845.681601381302,0.6823460261027018,7141.940159399529,0.7133444196440386,0.8902891579003764,0.7867023319953254,0.0,0.0725920001334972,0.0601983861857661,129127.09723567181,34.94993703848655,19.0,0,0,0,33,1366.1,27.0,24.2,0.5847107438016529,0.06198998601813431
22
- EBM (tuned + ensemble),3334.2171648572994,0.32795600762672295,2025.6302479633398,0.2872227792040727,0.8075157818891786,0.8174517621532933,0.0,0.15173080345823362,0.13052482131305762,30754.382185448365,16.68266150462009,20.454545454545453,0.2332,1953.8706483443577,0.2473998334672716,1331.6775166450918,0.20701186245225042,0.9152208832034938,0.8754163899058429,0.0,0.08428901213465023,0.06235962049926,20856.10396834409,10.735277156769923,20.0,0,1,1,31,1343.2,40.7,31.3,0.5578512396694215,0.08476526991548487
23
- MNCA_GPU (default),29.288330355718077,0.40948904010181875,16.590279085679306,0.35807771432707775,0.8402594157175141,0.7888402529369766,0.0,0.1406153506333289,0.09170738554084455,276.94632919034655,19.94068662634551,20.606060606060606,0.22862,25.066760566499497,0.27276016076405846,16.16104653455061,0.3065299705640804,1.0,0.837956478652546,0.0,0.07623534801441778,0.06857639602487717,228.23494043552506,15.69192319835031,23.0,1,0,0,32,1338.8,31.2,30.5,0.5544077134986226,0.08828110124295789
24
- NN_TORCH (tuned + ensemble),9308.194376095618,3.24606818494572,4341.07923497529,2.933380560418068,0.802657382706495,0.7817195773599878,0.0,0.12378813274534023,0.09334425293059885,82898.10974181342,162.20152509802003,20.848484848484848,0.22969,7022.24924369653,2.462759764989217,3704.2987009192075,2.0735716422398887,0.930412134618256,0.8720250490602853,0.0,0.06940821370965833,0.07002611736954287,61441.834728019065,130.8351426094471,21.0,0,0,0,33,1335.3,26.6,32.9,0.5488980716253443,0.06285775142221346
25
- FASTAI (tuned + ensemble),2586.283754560923,8.835338537861603,1679.575121937614,9.999499001724324,0.8119228672635717,0.8138716785600797,0.0,0.15059766833714266,0.11070319517972349,25493.071594867495,506.3190878191587,22.303030303030305,0.23902,2267.9460870583853,8.057986391915215,1459.621189354467,8.056269308662202,1.0,0.9463301969020725,0.0,0.08142732439291223,0.07668125691859373,23829.87732673067,529.4584115089291,25.0,0,1,0,32,1304.1,28.9,31.8,0.5158402203856749,0.07841733762965925
26
- EBM (tuned),3334.2171648572994,0.039938361395890465,2025.6302479633398,0.03842942493274479,0.8642304432135278,0.850758214011235,0.0,0.15904602009171812,0.14032650671114255,30754.382185448365,2.017057965351221,23.196969696969695,0.23722,1953.8706483443577,0.031091478135850694,1331.6775166450918,0.0236834002099177,1.0,0.9261802748520466,0.0,0.08863297174906726,0.06750589165199133,20856.10396834409,1.153393324267757,24.5,0,0,1,32,1285.3,30.9,32.6,0.49552341597796146,0.062143899880541004
27
- XT (tuned + ensemble),703.3360121879513,1.4757844792471992,499.8225911726898,1.6427389690079335,0.8674363866200484,0.8492136265596752,0.0,0.16715313828414352,0.11801888702572687,7431.679531233592,85.61436209689451,23.818181818181817,0.22915,666.2221839348475,1.1989427142673068,416.3888649592797,1.3925002488586609,1.0,0.9381075443148919,0.0,0.0949585994043769,0.08051595297351705,5771.537607935009,73.42784180878955,28.0,0,0,0,33,1271.0,25.6,28.2,0.48140495867768596,0.06552519849785002
28
- REALMLP (default),61.55620003902551,2.199364337054166,23.766190232911814,3.888303606152853,0.8938686286103368,0.8414605427842325,0.0,0.13659547863571214,0.09916404744497234,472.7426363451822,178.7306059988246,23.90909090909091,0.24019,34.34356644153595,2.147673739327325,22.956048451914754,1.874601753285076,1.0,0.8840139535607087,0.0,0.09824845633171009,0.07835768789261277,423.03198035956245,122.65530136908377,24.0,0,0,0,33,1270.9,25.8,30.3,0.4793388429752066,0.0489327392838188
29
- EBM (default),11.36624160479215,0.05860419851360899,6.793468845812911,0.0773696538255326,0.8522431620799785,0.8610325368892559,0.0,0.16705792167105993,0.14403163190193527,110.45760612579903,3.6748575166769917,24.424242424242426,0.2295,8.053060743543837,0.04657702445983887,5.893546446579368,0.07326012604396624,1.0,0.9402092401503581,0.0,0.09424073545770384,0.06816679495540655,98.43282840770306,3.379332756535156,24.0,1,0,2,30,1261.1,20.7,33.7,0.46763085399449034,0.08937452107274817
30
- NN_TORCH (tuned),9308.194376095618,0.17523537565160682,4341.07923497529,0.18444769841251837,0.8925933517820612,0.8525330732814086,0.0,0.14530737623012074,0.11490712279864919,82898.10974181342,9.391699534826044,25.833333333333332,0.23735,7022.24924369653,0.12440276145935059,3704.2987009192075,0.1432880461215973,1.0,0.9213694720504713,0.0,0.10352602970716274,0.09298739144390704,61441.834728019065,8.032002608597933,27.0,0,0,0,33,1228.9,33.4,30.0,0.4356060606060606,0.04493300604891883
31
- XT (tuned),703.3360121879513,0.18311701848450734,499.8225911726898,0.22461498391827245,0.89799267281555,0.8757281942517654,0.0,0.17646473015198452,0.12710160959027111,7431.679531233592,11.223370223330804,26.0,0.23322,666.2221839348475,0.15195075670878092,416.3888649592797,0.1793043116994795,1.0,0.9650314344860238,0.0,0.10675112745769633,0.08518673798205481,5771.537607935009,9.73206235571111,30.0,0,0,0,33,1226.9,28.3,39.9,0.4318181818181818,0.05026967478648974
32
- FASTAI (tuned),2586.283754560923,0.6978744876103771,1679.575121937614,0.8367693422775803,0.9062347004381547,0.8675625596567081,0.0,0.16356019930992516,0.12882088540610667,25493.071594867495,42.13403882164382,26.257575757575758,0.24143,2267.9460870583853,0.7035322189331055,1459.621189354467,0.8899622596800327,1.0,0.9669791006893959,0.0,0.09698817844994467,0.08330206169020851,23829.87732673067,41.63294608315149,27.0,0,0,0,33,1220.1,31.2,32.9,0.4259641873278237,0.048489209073481906
33
- XGB (default),4.978699986862414,0.2737259887284301,3.296637309611259,0.3881253273013503,0.9697851500357809,0.9191109538180672,0.0,0.15400703896734852,0.13025661123342613,47.96472220742737,17.685235315861274,27.439393939393938,0.24149,4.367259449428982,0.2072049856185913,3.054087114292606,0.2414376437664032,1.0,0.9595656785141815,0.0,0.11112425419154659,0.0975503136316605,45.44297154665395,12.112444004915941,27.0,0,0,0,33,1192.4,31.2,27.0,0.3991046831955923,0.039206793811802795
34
- GBM (default),5.590238305133601,0.278970559357794,3.7124463506088845,0.2797426445645519,0.9622682225240324,0.9201123261076833,0.0,0.1586723338623166,0.1217633613331393,57.36690638815629,17.058911558436503,28.454545454545453,0.24755,5.0517880121866865,0.22849366399976942,3.3870700945456824,0.15827762661722877,1.0,0.9539307250181628,0.0,0.10895934672449292,0.10403475244674416,43.79388559366092,9.72233278292159,28.0,0,0,0,33,1171.2,24.7,29.1,0.3760330578512397,0.03733169098785957
35
- RF (tuned + ensemble),892.1954686825525,1.4854835507845638,550.3470940724247,1.5940359327620437,0.9289607706528442,0.9032274841645478,0.0,0.18064329945585395,0.1444018882162356,8295.175259470323,82.18254749521213,28.575757575757574,0.23505,784.4848535855612,1.0802352163526747,572.6733661144972,1.4206488404155224,1.0,0.9909492733665423,0.0,0.11473727839757886,0.09748097407587406,7404.509259652326,76.20706350974088,31.0,0,0,0,33,1166.9,28.9,29.6,0.37327823691460055,0.04530116764974994
36
- RF (tuned),892.1954686825525,0.16340523231711854,550.3470940724247,0.20470746704551906,0.9635085518401064,0.9323210252823122,0.0,0.19214798753197954,0.1539601229484898,8295.175259470323,9.800022397387403,30.96969696969697,0.23333,784.4848535855612,0.1385154088338216,572.6733661144972,0.14341358177200286,1.0,0.9927680372167216,0.0,0.12553920534461527,0.1162834063711407,7404.509259652326,8.630503216322625,33.0,0,0,0,33,1106.8,38.9,28.2,0.31887052341597794,0.0355003538680906
37
- NN_TORCH (default),31.202770278670574,0.17605404147395382,18.191025653235105,0.1946935658503532,0.9707785770094968,0.9452354201229408,0.0,0.1960089908501652,0.16561314663710547,307.7875153991938,9.709642602635572,32.46969696969697,0.23746,21.156466828452217,0.13171541690826416,11.818497713406881,0.14659688817979757,1.0,0.9967824967824969,0.0,0.1451120060624711,0.1156261970018704,210.45346039107372,7.909338196312241,33.0,0,0,0,33,1068.6,41.2,26.1,0.2847796143250689,0.03409050272167961
38
- FASTAI (default),10.065239688764116,0.6049560324511544,5.915643660471347,0.6570296472609841,0.9798376084459804,0.9522527051475422,0.0,0.22111992018579335,0.19830607557476165,95.24755640719609,34.77327333774118,34.0,0.25653,7.87360077434116,0.4848888476689657,5.1820077836540115,0.6521266629591491,1.0,1.0,0.0,0.1525216037950643,0.1310864418065254,93.12290255921157,34.91731418633814,36.0,0,0,0,33,1024.7,30.8,32.8,0.25,0.03151803306343119
39
- XT (default),0.822556706068893,0.08127484409897415,0.471496074166279,0.09148027851361618,0.969003331715743,0.9501038796765102,0.0,0.2459383969733959,0.24789102243280292,7.127294712723843,4.867665484175484,34.54545454545455,0.2625,0.7219058142768012,0.06215476195017497,0.4238388518146615,0.07849177235630946,1.0,1.0,0.0,0.18315770932006714,0.15512507020900368,6.235138739070136,4.456198846093199,38.0,0,0,0,33,1008.4,33.7,29.1,0.23760330578512398,0.03293316154552977
40
- RF (default),1.0802604700981167,0.07677059518769132,0.52542853627611,0.08878926590315098,0.9963176578450126,0.9722374063488899,0.0,0.23547329550406274,0.23756970348300344,8.967503792674067,4.676640526479693,34.81818181818182,0.25251,0.7427172581354777,0.058771981133355036,0.4717373991313263,0.06943642688143947,1.0,1.0,0.0,0.16087354235509543,0.1389292060353615,7.464730143591374,4.060095192168379,36.0,0,0,0,33,1000.0,0.0,0.0,0.23140495867768596,0.030165041844391588
41
- LR (tuned + ensemble),170.87050911529295,0.355976880680431,121.02102704404008,0.32634615259593897,0.9562323620082642,0.9511884714014192,0.0,0.29681715483722076,0.3288790800150929,1772.9723454272128,18.055698085268993,35.72727272727273,0.24491,158.05876021915012,0.1903169314066569,96.99876412252586,0.21682568555752907,1.0,1.0,0.0,0.24584842785173022,0.2585604322609332,1402.8748923675635,12.175213508812917,40.0,0,0,1,32,971.8,30.7,28.3,0.21074380165289255,0.0395287484344434
42
- LR (tuned),170.87050911529295,0.11554940750301887,121.02102704404008,0.10905393298119077,0.9705975404135027,0.9557879375574654,0.0,0.30240281992592855,0.3364007615017948,1772.9723454272128,5.509546013888674,36.65151515151515,0.25255,158.05876021915012,0.058829413519965276,96.99876412252586,0.0862505760249892,1.0,1.0,0.0,0.2610498572359117,0.2575122541077485,1402.8748923675635,4.067640041721783,40.5,0,0,0,33,941.3,36.0,34.9,0.18973829201101927,0.03271560479323064
43
- LR (default),4.536491992417409,0.12851071365754613,2.8968102616210682,0.12996863092563715,0.9802666115578355,0.9624056254813065,0.0,0.3174497985675089,0.3785366221645997,44.19552977352603,6.787762182097637,37.166666666666664,0.26248,3.583410120010376,0.07990590731302898,2.994106463982051,0.104980896680783,1.0,1.0,0.0,0.26184876537499124,0.29385559917332205,34.79737790723628,4.682238532824547,40.5,0,0,1,32,920.4,27.8,41.7,0.17803030303030304,0.03638877642906803
44
- KNN (tuned + ensemble),9.286149026889994,0.24958616836303815,4.012121930930575,0.19751524231739334,1.0,0.9938383302103199,0.09090909090909091,0.45963144905008185,0.6155225613033262,54.233484561833016,11.4223913906534,41.54545454545455,0.3197,4.6692627800835504,0.10710635185241699,2.7979792946300877,0.17676389939136214,1.0,1.0,0.0,0.4456786012073394,0.6488391296142632,55.92066201498072,9.83646280007513,43.0,0,0,0,33,692.4,40.8,49.4,0.07851239669421488,0.024261507287775874
45
- KNN (tuned),9.286149026889994,0.05394502376466488,4.012121930930575,0.040282320435423104,1.0,0.9967850435675412,0.09090909090909091,0.4777481497870918,0.6672249041511081,54.233484561833016,2.390983322122401,42.60606060606061,0.34215,4.6692627800835504,0.029062509536743164,2.7979792946300877,0.03624739765555662,1.0,1.0,0.0,0.4641563796701971,0.740394718900499,55.92066201498072,2.000488003155834,44.0,0,0,0,33,608.5,42.5,47.9,0.05440771349862259,0.02354297510758564
46
- KNN (default),0.24162639913334188,0.029173843387000086,0.11473320891232033,0.030260209027344123,1.0,0.9999378353081131,0.09090909090909091,0.5584824663379339,0.9468680050026339,1.0005153544482035,1.4920918647664636,44.31818181818182,0.40639,0.11169254779815674,0.019644896189371746,0.06555315548394994,0.02595407415488982,1.0,1.0,0.0,0.5530598349091809,1.0,1.0,1.2186499153225778,45.0,0,0,0,33,372.1,64.1,104.9,0.015495867768595042,0.022617898518851785
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
data/tabpfn-imputed/time_plot.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:93cfb4e8d9735c9e85e3b1d6cbb1605cf352a90d91123c8ce53bdc7bd06aad5a
3
- size 460639
 
 
 
 
data/tabpfn-imputed/tuning-impact-elo-horizontal.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:cbaab5ec933562e4f764dac0c985634ed6a57775fc5b6e1a6e0712fab0a5a690
3
- size 237823
 
 
 
 
data/tabpfn-imputed/tuning-impact-elo.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:66bc73731383a497a691c0494dd1e14816cf526c1a29945d7d664a415532df68
3
- size 215447
 
 
 
 
data/tabpfn-tabicl/figures/critical-diagram.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:b784096fb463b5628fbbfec2afc824dcea72b7258674de4d9c39b8124fcb9505
3
- size 338604
 
 
 
 
data/tabpfn-tabicl/leaderboard.tex DELETED
@@ -1,53 +0,0 @@
1
- \begin{tabular}{llcccccrr}
2
- \toprule
3
- \textbf{Model} & \textbf{Elo ($\uparrow$)} & \textbf{Norm.} & \textbf{Avg.} & \textbf{Harm.} & \textbf{\#wins ($\uparrow$)} & \textbf{Improva-} & \textbf{Train time} & \textbf{Predict time} \\
4
- & & \textbf{score ($\uparrow$)} & \textbf{rank ($\downarrow$)} & \textbf{mean} & & \textbf{bility ($\downarrow$)} & \textbf{per 1K [s]} & \textbf{per 1K [s]} \\
5
- & & & & \textbf{rank ($\downarrow$)} & & & & \\
6
- \midrule
7
- TabPFNv2 (T+E) & \textcolor{gold}{\textbf{1745${}_{-45,+50}$}} & \textcolor{gold}{\textbf{0.713}} & \textcolor{gold}{\textbf{5.7}} & \textcolor{gold}{\textbf{2.2}} & \textcolor{gold}{\textbf{8}} & \textcolor{gold}{\textbf{4.6\%}} & 3445.60 & 48.24 \\
8
- TabM (T+E) & \textcolor{silver}{\textbf{1620${}_{-39,+34}$}} & 0.491 & \textcolor{silver}{\textbf{9.2}} & 4.5 & 2 & 9.2\% & 2828.45 & 1.60 \\
9
- TabICL (D) & \textcolor{bronze}{\textbf{1614${}_{-36,+35}$}} & \textcolor{silver}{\textbf{0.549}} & \textcolor{bronze}{\textbf{9.4}} & \textcolor{silver}{\textbf{3.3}} & \textcolor{silver}{\textbf{4}} & \textcolor{silver}{\textbf{7.1\%}} & 8.89 & 1.74 \\
10
- TabPFNv2 (T) & 1593${}_{-42,+30}$ & \textcolor{silver}{\textbf{0.549}} & 10.3 & 3.7 & 1 & \textcolor{bronze}{\textbf{8.2\%}} & 3445.60 & 1.00 \\
11
- RealMLP (T+E) & 1593${}_{-37,+29}$ & 0.461 & 10.2 & 6.9 & 0 & 9.2\% & 6796.27 & 12.37 \\
12
- AutoGluon 1.3 (4h) & 1549${}_{-48,+33}$ & 0.455 & 11.9 & 5.6 & 1 & 9.5\% & 2309.21 & 2.55 \\
13
- TabPFNv2 (D) & 1546${}_{-39,+38}$ & 0.503 & 12.0 & \textcolor{bronze}{\textbf{3.4}} & \textcolor{silver}{\textbf{4}} & 9.5\% & 4.06 & 0.44 \\
14
- LightGBM (T+E) & 1524${}_{-34,+37}$ & 0.334 & 12.9 & 7.3 & 1 & 11.8\% & 647.56 & 1.72 \\
15
- TabM (T) & 1514${}_{-41,+32}$ & 0.393 & 13.4 & 7.4 & 0 & 10.4\% & 2828.45 & 0.22 \\
16
- LightGBM (T) & 1467${}_{-37,+33}$ & 0.277 & 15.3 & 13.2 & 0 & 12.2\% & 647.56 & 0.28 \\
17
- CatBoost (T+E) & 1461${}_{-34,+37}$ & 0.283 & 15.7 & 11.2 & 0 & 11.7\% & 1465.86 & 0.69 \\
18
- CatBoost (T) & 1440${}_{-35,+35}$ & 0.257 & 16.6 & 10.5 & 0 & 12.0\% & 1465.86 & 0.09 \\
19
- TabM (D) & 1432${}_{-36,+35}$ & 0.293 & 17.0 & 10.9 & 0 & 13.7\% & 10.42 & 0.15 \\
20
- CatBoost (D) & 1430${}_{-38,+33}$ & 0.239 & 17.0 & 9.9 & 0 & 13.3\% & 5.72 & 0.11 \\
21
- ModernNCA (T) & 1428${}_{-30,+34}$ & 0.262 & 17.1 & 8.8 & 1 & 11.7\% & 5944.88 & 0.52 \\
22
- XGBoost (T+E) & 1419${}_{-39,+34}$ & 0.230 & 17.6 & 14.0 & 0 & 13.1\% & 766.06 & 1.92 \\
23
- EBM (T+E) & 1387${}_{-35,+29}$ & 0.189 & 19.1 & 12.5 & 0 & 15.7\% & 1109.06 & 0.23 \\
24
- XGBoost (T) & 1382${}_{-30,+38}$ & 0.191 & 19.2 & 16.8 & 0 & 13.4\% & 766.06 & 0.28 \\
25
- RealMLP (T) & 1382${}_{-31,+39}$ & 0.184 & 19.3 & 16.4 & 0 & 12.8\% & 6796.27 & 0.73 \\
26
- ModernNCA (T+E) & 1375${}_{-37,+34}$ & 0.311 & 19.6 & 8.6 & 0 & 12.7\% & 5944.88 & 8.40 \\
27
- ModernNCA (D) & 1367${}_{-34,+32}$ & 0.185 & 19.9 & 10.0 & 1 & 15.1\% & 14.80 & 0.34 \\
28
- TorchMLP (T+E) & 1366${}_{-33,+31}$ & 0.211 & 20.1 & 14.9 & 0 & 13.3\% & 2862.05 & 2.16 \\
29
- FastaiMLP (T+E) & 1362${}_{-38,+34}$ & 0.239 & 20.2 & 11.1 & 0 & 15.7\% & 1358.63 & 8.07 \\
30
- TabDPT (D) & 1328${}_{-33,+27}$ & 0.263 & 21.9 & 6.1 & 2 & 14.8\% & 27.49 & 8.86 \\
31
- EBM (T) & 1324${}_{-44,+32}$ & 0.126 & 22.2 & 17.2 & 0 & 16.5\% & 1109.06 & 0.03 \\
32
- EBM (D) & 1305${}_{-38,+31}$ & 0.160 & 23.0 & 9.7 & 1 & 17.3\% & 5.28 & 0.08 \\
33
- FastaiMLP (T) & 1269${}_{-44,+36}$ & 0.119 & 24.7 & 18.9 & 0 & 17.2\% & 1358.63 & 0.90 \\
34
- RealMLP (D) & 1268${}_{-32,+34}$ & 0.087 & 24.7 & 21.6 & 0 & 15.6\% & 22.51 & 1.64 \\
35
- ExtraTrees (T+E) & 1261${}_{-51,+29}$ & 0.103 & 25.1 & 17.7 & 0 & 18.2\% & 370.85 & 1.47 \\
36
- TorchMLP (T) & 1234${}_{-32,+36}$ & 0.097 & 26.1 & 22.5 & 0 & 15.9\% & 2862.05 & 0.15 \\
37
- XGBoost (D) & 1210${}_{-35,+41}$ & 0.038 & 27.3 & 25.1 & 0 & 16.7\% & 2.40 & 0.22 \\
38
- ExtraTrees (T) & 1201${}_{-32,+37}$ & 0.072 & 27.6 & 22.8 & 0 & 19.4\% & 370.85 & 0.16 \\
39
- RandomForest (T+E) & 1156${}_{-30,+34}$ & 0.056 & 29.6 & 22.3 & 0 & 19.5\% & 527.42 & 1.39 \\
40
- LightGBM (D) & 1154${}_{-32,+37}$ & 0.035 & 29.7 & 27.9 & 0 & 17.7\% & 2.90 & 0.13 \\
41
- RandomForest (T) & 1094${}_{-31,+36}$ & 0.021 & 32.1 & 29.5 & 0 & 20.8\% & 527.42 & 0.12 \\
42
- TorchMLP (D) & 1080${}_{-30,+37}$ & 0.034 & 32.4 & 28.9 & 0 & 21.4\% & 10.38 & 0.19 \\
43
- FastaiMLP (D) & 1052${}_{-31,+34}$ & 0.026 & 33.6 & 30.9 & 0 & 23.0\% & 4.73 & 0.62 \\
44
- Linear (T+E) & 1030${}_{-40,+39}$ & 0.056 & 34.3 & 22.9 & 0 & 28.3\% & 88.63 & 0.26 \\
45
- RandomForest (D) & 1000${}_{-0,+0}$ & 0.005 & 35.4 & 33.6 & 0 & 25.8\% & 0.45 & 0.07 \\
46
- Linear (T) & 997${}_{-43,+37}$ & 0.037 & 35.3 & 28.5 & 0 & 29.0\% & 88.63 & 0.09 \\
47
- Linear (D) & 982${}_{-39,+42}$ & 0.025 & 35.8 & 25.1 & 0 & 30.5\% & 2.27 & 0.11 \\
48
- ExtraTrees (D) & 956${}_{-48,+44}$ & 0.015 & 36.7 & 33.0 & 0 & 27.5\% & 0.40 & 0.07 \\
49
- KNN (T+E) & 706${}_{-48,+50}$ & 0.000 & 41.6 & 41.2 & 0 & 48.0\% & 2.97 & 0.17 \\
50
- KNN (T) & 611${}_{-49,+52}$ & 0.000 & 42.7 & 42.5 & 0 & 50.1\% & 2.97 & 0.04 \\
51
- KNN (D) & 410${}_{-97,+86}$ & 0.000 & 44.2 & 44.1 & 0 & 59.2\% & 0.07 & 0.02 \\
52
- \bottomrule
53
- \end{tabular}
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
data/tabpfn-tabicl/tabarena_leaderboard.csv DELETED
@@ -1,46 +0,0 @@
1
- method,time_train_s,time_infer_s,time_train_s_per_1K,time_infer_s_per_1K,normalized-error,normalized-error-task,imputed,champ_delta,loss_rescaled,time_train_s_rescaled,time_infer_s_rescaled,rank,median_metric_error,median_time_train_s,median_time_infer_s,median_time_train_s_per_1K,median_time_infer_s_per_1K,median_normalized-error,median_normalized-error-task,median_imputed,median_champ_delta,median_loss_rescaled,median_time_train_s_rescaled,median_time_infer_s_rescaled,median_rank,rank=1_count,rank=2_count,rank=3_count,rank>3_count,elo,elo+,elo-,winrate,mrr
2
- TABPFNV2_GPU (tuned + ensemble),14614.639715819583,148.11643430860633,3877.419600731425,65.38306906011915,0.28678978417816775,0.36802578112939455,0.0,0.046199529063116605,0.028953107532712387,72438.91792383758,5032.936436246271,5.711538461538462,0.17261500000000002,8173.816975453165,60.12680508957969,3445.602606086448,48.23597351862546,0.2205553142586971,0.4119431743489412,0.0,0.021640889295981347,0.01687441435881499,43032.16406688419,2931.913108083719,3.5,8,3,2,13,1744.8,49.9,44.6,0.8929195804195804,0.4573568026266538
3
- TABM_GPU (tuned + ensemble),7080.034926002046,2.290320276704609,3436.5218404284533,2.1079046445931953,0.5088870310870036,0.5502148202809035,0.0,0.09238775584230949,0.04985849884037403,43123.835967101855,114.22499982092596,9.23076923076923,0.17379499999999998,6405.6722231189415,1.880999751885732,2828.4486752645407,1.5957955528425738,0.4807674997729695,0.552934558160541,0.0,0.04617144277378826,0.034316541138176816,44088.87353980956,109.46959390379024,8.5,2,1,1,22,1620.5,33.6,38.7,0.8129370629370629,0.2232701744704473
4
- TABICL_GPU (default),21.933824618351764,2.7954679915028757,10.278805574502684,2.035731958506663,0.4512811946522884,0.5193395651291017,0.0,0.07108756808334525,0.034689269294893396,145.15967621212897,119.47187491598106,9.423076923076923,0.17207,19.54939921696981,2.1857474181387158,8.891901835466445,1.7433667301105085,0.4714674425983171,0.5893306649922065,0.0,0.03487584587193143,0.02062556592021359,120.41628593220491,98.62751622705991,8.0,4,3,1,18,1614.3,35.0,35.6,0.8085664335664335,0.30038336835483237
5
- REALMLP (tuned + ensemble),22557.367400865067,16.92627251148224,7536.649022231907,13.528429463644256,0.5391293578105109,0.5610323152414572,0.0,0.09167243360994543,0.050711164692383556,129779.24577573304,792.4654559581195,10.173076923076923,0.173815,14323.011887841756,15.38266196515825,6796.2676485876,12.365217761885718,0.506879945270726,0.5713063476221716,0.0,0.04889132639035737,0.032252119049673914,113460.03331383408,666.1733358924521,9.25,0,1,1,24,1593.2,28.1,36.1,0.7915209790209791,0.14568408028160348
6
- TABPFNV2_GPU (tuned),14614.639715819583,4.849548537201351,3877.419600731425,2.254088269272259,0.45139060905978096,0.48721477659307727,0.0,0.08179741674819369,0.05007437241116467,72438.91792383758,165.31166707386078,10.346153846153847,0.1868,8173.816975453165,0.8871031337314181,3445.602606086448,0.9952991538273057,0.3556785171687077,0.49260260495569347,0.0,0.05804204088839793,0.02327176321840322,43032.16406688419,48.819771718398094,6.0,1,8,1,16,1593.3,30.0,41.6,0.7875874125874126,0.2688413115117749
7
- AutoGluon 1.3 (4h),5931.009838958377,3.1656774007357082,3673.8246676849526,3.0250106370782723,0.5453015597077798,0.551608410257374,0.0,0.09461403802109652,0.05278875434794407,39712.28933278582,151.42916243848958,11.923076923076923,0.17691,5020.2655313505065,2.664059625731574,2309.213100138395,2.5543334551748154,0.5329018805269501,0.564628859523932,0.0,0.047627759409483605,0.04116173129591805,35357.34871646478,128.98489015193604,8.5,1,2,0,23,1549.2,32.6,47.5,0.7517482517482518,0.17979779944912772
8
- TABPFNV2_GPU (default),11.884418644151115,1.1739113624279316,5.514520630749067,0.6490305586450037,0.4969736056210933,0.5669262411732706,0.0,0.09512310407816416,0.06286358591123144,76.29505964660768,41.758850754001486,12.038461538461538,0.1886,8.093020015292698,0.44081105126274955,4.062007578219401,0.4356807523460313,0.36947822708003786,0.5314622329509133,0.0,0.05018180969629027,0.021765272352146064,58.62232186481185,25.39697585954705,5.0,4,1,4,17,1545.9,37.8,38.7,0.7491258741258742,0.2929835638700715
9
- GBM (tuned + ensemble),1379.7589400635827,3.041326452829899,860.8328230742596,2.952036385308545,0.6657481018831596,0.6839901368858755,0.0,0.11763135033701075,0.06190617706409025,10455.248725429481,156.11534656534064,12.942307692307692,0.174085,1321.8940714677174,2.4085231754514904,647.5566852470606,1.7204157994123173,0.6871978920855942,0.6973917723105316,0.0,0.05156709998972453,0.05140473961387122,9480.825611975059,103.35126359750211,12.0,1,0,1,24,1523.8,36.9,33.7,0.728583916083916,0.13766946258113175
10
- TABM_GPU (tuned),7080.034926002046,0.2169825582422762,3436.5218404284533,0.22059896519012676,0.6069327367930709,0.6041117671664842,0.0,0.10415142755636804,0.0634734305378328,43123.835967101855,11.305450805467302,13.365384615384615,0.17596499999999998,6405.6722231189415,0.19745506313112046,2828.4486752645407,0.21564835018398254,0.5648701425467781,0.6431391594849503,0.0,0.06661219922938116,0.04216183343366903,44088.87353980956,9.589800419015791,13.5,0,2,1,23,1514.3,31.5,41.0,0.7189685314685315,0.1346814850606291
11
- GBM (tuned),1379.7589400635827,0.5707490142594036,860.8328230742596,0.6449583363337497,0.7228979250207982,0.7058101964109172,0.0,0.12190920678437432,0.06697490603245197,10455.248725429481,28.289876597893098,15.326923076923077,0.17371999999999999,1321.8940714677174,0.34941615396075776,647.5566852470606,0.28350492153737616,0.7665571529464766,0.7066235514415495,0.0,0.055167136392065996,0.05173098916436741,9480.825611975059,16.480248719877835,15.0,0,0,0,26,1466.6,32.3,36.5,0.6743881118881119,0.07566422588192029
12
- CAT (tuned + ensemble),4698.536897969144,0.9597997341400538,2797.035390366479,0.7965719365801648,0.7170548537161707,0.7134226783294108,0.0,0.11678551357841017,0.070692810046672,27448.91121504087,44.04781428274726,15.73076923076923,0.1771,3111.985966463884,0.7725222905476887,1465.8584785724734,0.6919304562740058,0.7872283925054006,0.732977106258743,0.0,0.0655088388746699,0.053959374396486216,21194.263639925673,39.04121549244424,15.5,0,0,2,24,1460.8,36.4,33.2,0.6652097902097902,0.08944640549297035
13
- CAT (tuned),4698.536897969144,0.11917537287769155,2797.035390366479,0.12051762325590008,0.743441660872026,0.7323790142940958,0.0,0.11966484148538205,0.07092154412213542,27448.91121504087,5.822055889865127,16.576923076923077,0.177215,3111.985966463884,0.10264339711931017,1465.8584785724734,0.09138547661664378,0.8395738793220267,0.7706205283931036,0.0,0.06254942533618796,0.05674313716914728,21194.263639925673,5.623705460666335,16.5,0,1,0,25,1440.3,35.0,34.3,0.6459790209790209,0.09554479235182911
14
- CAT (default),184.70535022315818,0.14151572040003588,154.6142383995702,0.1555426094856078,0.7609683280174484,0.7735939128687849,0.0,0.13291716006066936,0.07483896168122789,477.2608484422213,7.423108461919998,16.96153846153846,0.176625,11.6375067697631,0.16093741522894967,5.723546572951673,0.1101036800878527,0.8509645845274714,0.8315753996038593,0.0,0.07003170550807408,0.037594866478298045,107.22813859237792,5.910285969899903,18.0,0,2,0,24,1430.2,32.5,37.1,0.6372377622377622,0.10136602526278639
15
- TABM_GPU (default),25.031941110761757,0.20188171669968172,14.494913436183246,0.20961960333163143,0.7073629011311515,0.7187464199165071,0.0,0.13697955308317783,0.07804753275194268,189.57549613749592,9.978425427673205,17.0,0.175255,19.491187883747948,0.16552388005786473,10.418743379746541,0.15495242186411928,0.7255503354617996,0.7368230723205171,0.0,0.07020888606704828,0.04168400568174062,140.5904555433362,9.536419533323254,14.0,0,0,1,25,1432.4,34.7,35.6,0.6363636363636364,0.09163686449085374
16
- MNCA_GPU (tuned),10872.434398398032,0.5841771599573967,5614.838977261846,0.5087025474163123,0.7381569257478842,0.6629980011184993,0.0,0.11651894666071064,0.07336434047866673,77920.67944321278,26.83390437537825,17.134615384615383,0.17598,10294.846671289868,0.4166409836875068,5944.878874246984,0.5156200362715774,0.835109147002796,0.7215315181988433,0.0,0.06755655006003625,0.057542536642744974,66956.02864547497,22.936826653686147,17.0,1,0,0,25,1428.2,34.0,29.1,0.6333041958041958,0.11344690953636491
17
- XGB (tuned + ensemble),1872.6912781718452,2.683822184852046,1147.2290814465296,3.3619425077092306,0.7700046331253076,0.7584957089348034,0.0,0.13102529659388182,0.08005804166927238,11800.394028929062,144.45205787298235,17.634615384615383,0.18004,1385.052251373397,1.6176902174949646,766.0569170086173,1.9172343251389494,0.8138714172419255,0.7736024300804334,0.0,0.07343601410090844,0.052916644618479496,11520.40078776449,81.51508510452376,15.75,0,0,0,26,1418.7,33.3,38.4,0.6219405594405595,0.07145619883307336
18
- EBM (tuned + ensemble),2781.1579704396745,0.37791664172441536,1628.507747090886,0.31990997744824035,0.8107693992272296,0.8130312037051659,0.0,0.15727014105330875,0.1147747162813527,16692.42562947248,18.328617946180266,19.115384615384617,0.190645,1861.6872274941868,0.27221977710723877,1109.0589152421817,0.22935467594400044,0.9143281956988789,0.8324702308099496,0.0,0.0849735392053248,0.05122045075434685,17695.477225433653,12.287120931350431,19.5,0,0,1,25,1386.8,28.6,34.6,0.5882867132867133,0.07980726095945688
19
- XGB (tuned),1872.6912781718452,0.6264070270407913,1147.2290814465296,0.8619192527074968,0.8091376057067142,0.7804628004900049,0.0,0.13354760891147244,0.08261886840998532,11800.394028929062,30.013383573853858,19.23076923076923,0.18088500000000002,1385.052251373397,0.25049915578630233,766.0569170086173,0.27754517145625424,0.8584872047951659,0.7823842751785922,0.0,0.07798856953005051,0.052696432684783016,11520.40078776449,11.544680214373663,19.5,0,0,0,26,1382.4,37.6,29.9,0.5856643356643356,0.05948147991731615
20
- REALMLP (tuned),22557.367400865067,0.8037233395454211,7536.649022231907,0.7475802961701496,0.8159242974551196,0.7431297778968523,0.0,0.12831548562022352,0.08016832668808226,129779.24577573304,38.49879722033203,19.346153846153847,0.176795,14323.011887841756,0.6870223946041532,6796.2676485876,0.7311902546116515,0.9055458050077084,0.7906338583190218,0.0,0.076222073390521,0.06456358654483704,113460.03331383408,33.9836370254647,19.0,0,0,0,26,1382.1,38.7,30.2,0.583041958041958,0.06089899180632481
21
- MNCA_GPU (tuned + ensemble),10872.434398398032,12.131162534310267,5614.838977261846,9.225622394556064,0.6894819069412794,0.6680816828228461,0.0,0.12705647111885687,0.09495264807215059,77920.67944321278,542.9013751751211,19.615384615384617,0.193055,10294.846671289868,9.331144248114692,5944.878874246984,8.395978282094799,0.7196681887948866,0.7088852482744055,0.0,0.10101560653736502,0.048750216328898435,66956.02864547497,426.41545163369784,15.5,0,1,3,22,1374.8,33.7,36.9,0.5769230769230769,0.11638038970676967
22
- MNCA_GPU (default),28.48231661941251,0.466951452768766,14.20837500082612,0.38564885096222307,0.814733537385181,0.7636691905714614,0.0,0.15085267890377566,0.08288628866859794,190.09277232716437,21.03504265137771,19.923076923076923,0.18519,24.325043747160173,0.36861725648244226,14.804303881798361,0.336567721078897,0.9852635719475782,0.7845495801025242,0.0,0.07974632039399476,0.053259111831894385,190.7747999331715,15.595042220247258,22.0,1,0,0,25,1366.8,31.6,33.6,0.5699300699300699,0.09968879371505522
23
- NN_TORCH (tuned + ensemble),8367.9645569837,3.8136142372066137,3603.347061143873,3.319891821445254,0.789332468820142,0.763048179574523,0.0,0.13325555172470904,0.07705269210898663,56068.072795326945,181.3894555134842,20.076923076923077,0.17949500000000002,6973.094145007928,2.944306871626112,2862.0511040893566,2.157502904371376,0.929200394977775,0.867096988886354,0.0,0.06782479524801643,0.060477195648830424,51500.257380537965,155.60936197975602,20.5,0,0,0,26,1366.2,30.7,32.9,0.5664335664335665,0.06714954864647764
24
- FASTAI (tuned + ensemble),2658.981896154697,9.680411353478066,1616.869681108774,10.454986295420108,0.7612867161422255,0.7680594978175002,0.0,0.1568263353343394,0.08996186978718192,20955.495033608,514.94962880429,20.192307692307693,0.17986000000000002,2313.697349058257,8.905639794137743,1358.6299051921596,8.066833347447606,0.966752862972931,0.8192027149002457,0.0,0.08132830034754124,0.05523334112431259,17602.335588839425,531.7232414230566,20.5,0,1,0,25,1362.0,34.0,37.7,0.5638111888111889,0.09043991205909353
25
- TABDPT_GPU (default),73.6718590354308,22.101120046978323,33.70000421461804,29.140721170745643,0.7369463160785844,0.7376189224156473,0.0,0.14812258779168241,0.09258933488429072,529.9323761499634,1297.090837487081,21.923076923076923,0.204185,71.26209372944302,21.38572289016512,27.488664901286818,8.862313494979123,0.9934509167853494,0.8819532136837882,0.0,0.06691814537901497,0.047152893925625654,513.7039796057306,1113.0844664803024,25.5,2,0,3,21,1328.1,26.1,32.1,0.5244755244755245,0.1652513277638594
26
- EBM (tuned),2781.1579704396745,0.04655793772803412,1628.507747090886,0.04424123436436782,0.8741362650172407,0.8486402698707779,0.0,0.16506051187996698,0.12247707446474837,16692.42562947248,2.268478500812272,22.173076923076923,0.19155,1861.6872274941868,0.0338150527742174,1109.0589152421817,0.02733291425041831,1.0,0.8920041279466094,0.0,0.0928869358898275,0.062194755491841634,17695.477225433653,1.252412448907763,24.0,0,0,0,26,1323.6,32.0,43.2,0.5187937062937062,0.05813096634548205
27
- EBM (default),10.549292183941246,0.06438207361433242,5.791308777159165,0.0798605102452594,0.8400271134997129,0.8487713109543825,0.0,0.17250673803024968,0.1242665515372485,75.26102802648397,3.652629917649085,22.96153846153846,0.19248500000000002,8.052384217580158,0.051854162746005586,5.279257749622374,0.07769986864408396,1.0,0.9013272947121589,0.0,0.09554880640873625,0.05656333545253643,77.97362526017017,3.340101142403973,23.5,1,0,2,23,1305.1,30.9,37.1,0.5008741258741258,0.10357115772052694
28
- FASTAI (tuned),2658.981896154697,0.7253259826929142,1616.869681108774,0.8254771149356669,0.8809901967099656,0.8331277991174584,0.0,0.17198923689106127,0.10544002279773745,20955.495033608,39.61972184578173,24.673076923076923,0.18178,2313.697349058257,0.7597291602028741,1358.6299051921596,0.8969521438633071,1.0,0.8511566734075353,0.0,0.09660908052453743,0.07038888057026207,17602.335588839425,37.14448504861147,23.5,0,0,0,26,1269.4,35.7,43.6,0.4619755244755245,0.053035697298350795
29
- REALMLP (default),64.63560309084053,2.1168234704906106,23.294746921478428,3.559757523085833,0.9128237161083051,0.8529272202090016,0.0,0.155832358170872,0.09609156090120739,412.6999871750535,152.21281941530677,24.692307692307693,0.17759999999999998,39.836480140686035,1.8815347883436415,22.509551099714287,1.6399439572487942,1.0,0.8792664545910313,0.0,0.11770994919600164,0.07176066792477417,317.0070586720106,101.22633902233932,25.5,0,0,0,26,1268.3,33.3,32.0,0.46153846153846156,0.0463012884663464
30
- XT (tuned + ensemble),714.8477392964893,1.554570463987497,476.1600651525857,1.7304383155841179,0.8972330639628154,0.8658630644450889,0.0,0.18213065340665885,0.12358057737795101,6037.128774148354,86.31947081771706,25.115384615384617,0.18718,684.9222148127026,1.3082488920953539,370.85408017752667,1.4664534567412004,1.0,0.93225990863037,0.0,0.09673248817237246,0.07594774077990443,5339.627074654447,77.0823275943278,28.75,0,0,0,26,1260.9,28.1,50.4,0.4519230769230769,0.056583927890065173
31
- NN_TORCH (tuned),8367.9645569837,0.19940724566451504,3603.347061143873,0.20335754670694137,0.9026334443249968,0.8505534111678406,0.0,0.15913477525938158,0.09939495197790925,56068.072795326945,10.018943215736572,26.096153846153847,0.18092,6973.094145007928,0.144207231203715,2862.0511040893566,0.15177921475257505,1.0,0.9104892489328,0.0,0.10379368194908523,0.09268904724445692,51500.257380537965,8.681540922799902,27.0,0,0,0,26,1234.1,35.1,31.4,0.42963286713286714,0.044537959237026664
32
- XGB (default),4.970388249658113,0.2682310309165563,3.142391352663509,0.37694382359999684,0.9616503827377219,0.9074903041248756,0.0,0.16741258325301087,0.12284059879020369,37.42364826794432,14.932177592236862,27.326923076923077,0.18894,4.433991021580166,0.20984046989017063,2.395188706947506,0.2182544724645258,1.0,0.9524606541941862,0.0,0.11516738963641299,0.07681744526375522,34.313637563138,10.099890730056913,26.0,0,0,0,26,1210.2,40.8,34.4,0.40166083916083917,0.03991507925820152
33
- XT (tuned),714.8477392964893,0.1734743657275143,476.1600651525857,0.2154329300724878,0.9278062134116685,0.894456393905736,0.0,0.19352157020279787,0.13326004552891305,6037.128774148354,9.800458404720944,27.615384615384617,0.19,684.9222148127026,0.15512712796529132,370.85408017752667,0.16132775528274945,1.0,0.9660073423997149,0.0,0.10725622533000512,0.08413965072039432,5339.627074654447,8.581565117405717,31.5,0,0,0,26,1200.8,36.5,31.4,0.3951048951048951,0.04392917095451317
34
- RF (tuned + ensemble),915.5438805149151,1.5748089445961848,532.5626464695567,1.640386311447312,0.9441488988669617,0.9119059796513475,0.0,0.1949239309121447,0.1477964033866862,6794.043055663636,81.1610891109226,29.615384615384617,0.18906,789.1687051984999,1.122593025366465,527.4239458868619,1.3899910445458383,1.0,0.9937950899255451,0.0,0.11848316871815162,0.09151554725140643,6269.6136687159,75.55960345229795,32.0,0,0,0,26,1156.4,33.4,29.8,0.34965034965034963,0.044833755203573834
35
- GBM (default),5.376282313848153,0.20799484711426958,3.3489543497095178,0.19728412961063485,0.9647597830572252,0.9226882159397388,0.0,0.17658301542562455,0.11946138423570958,42.11806124941037,9.792256635961557,29.73076923076923,0.188175,5.033887876404656,0.22240020169152153,2.8984772023300502,0.13282292956587455,1.0,0.9556137019600007,0.0,0.11805347934940252,0.09964492878261602,38.718264562373335,7.411495765404641,29.5,0,0,0,26,1153.9,36.6,31.6,0.34702797202797203,0.03587490743850603
36
- RF (tuned),915.5438805149151,0.159982283706339,532.5626464695567,0.2003027211432199,0.9785345462614525,0.9406434238705461,0.0,0.20796626054264733,0.15738141593056762,6794.043055663636,8.85724716182767,32.11538461538461,0.190815,789.1687051984999,0.14185967445373537,527.4239458868619,0.12279197881507195,1.0,0.9949804738019044,0.0,0.132245519872299,0.10601779560500876,6269.6136687159,8.226247917103098,33.5,0,0,0,26,1093.6,35.6,30.5,0.2928321678321678,0.03386774723549563
37
- NN_TORCH (default),27.39151145983965,0.20185868648382335,14.560395769408618,0.21573681790796437,0.9661328360928345,0.9393026943715941,0.0,0.21437115690743067,0.14784559127083513,188.3149665224302,10.435316060204023,32.44230769230769,0.183195,20.22722778055403,0.14770235617955524,10.376930987013111,0.18792402145583464,1.0,0.9983912483912485,0.0,0.14318031310217677,0.1051705699943664,197.09431521312473,8.356657050571908,33.0,0,0,0,26,1080.3,36.1,29.9,0.2854020979020979,0.034626544348254344
38
- FASTAI (default),10.397479128328143,0.6249765678348704,5.689409993948898,0.6426262937267089,0.9744092722583597,0.9403239500449883,0.0,0.2303447427008108,0.17838953052326556,79.38940070919118,32.314469844151965,33.61538461538461,0.19183499999999998,9.91800790362888,0.5629585729704962,4.729717448807326,0.6226443216085578,1.0,1.0,0.0,0.1608856096564119,0.11423976777713259,63.05950085745769,32.34956133934325,36.5,0,0,0,26,1052.0,33.4,30.1,0.25874125874125875,0.0323196444621068
39
- LR (tuned + ensemble),175.85719627702338,0.42670365847074065,115.79203038641191,0.374819481777715,0.9444487671643353,0.9380469060094936,0.0,0.2831741664343597,0.2482282180537343,1448.7019616321465,20.41052886468638,34.26923076923077,0.216585,158.39292872746785,0.19311302105585734,88.63237206036187,0.25697546561814155,1.0,1.0,0.0,0.2176901252380467,0.1779541590472698,1246.7830478036524,13.25530093456624,38.0,0,0,1,25,1030.4,38.3,39.3,0.24388111888111888,0.04361346363347015
40
- LR (tuned),175.85719627702338,0.1364539579448537,115.79203038641191,0.12123112010680681,0.9626814936017534,0.9438846899767829,0.0,0.29021553617359425,0.25773755527090664,1448.7019616321465,6.009867470278231,35.28846153846154,0.21681,158.39292872746785,0.07476819356282552,88.63237206036187,0.08838151119168902,1.0,1.0,0.0,0.22518752798442315,0.1834788893417763,1246.7830478036524,4.016028876859062,38.0,0,0,0,26,996.9,36.4,42.3,0.22071678321678323,0.035049521543351864
41
- RF (default),0.9640304686676744,0.07426402477117686,0.5084719578868067,0.0844146006139,0.9953262580340543,0.9705599547883514,0.0,0.2575551228066212,0.2479832196611893,6.6754175478812146,4.153857105918795,35.36538461538461,0.213105,0.8910642200046115,0.05825435982810126,0.447836563000179,0.06594795127086661,1.0,1.0,0.0,0.17586669886220047,0.11692775501298655,5.654732996519019,3.502947097147378,37.5,0,0,0,26,1000.0,0.0,0.0,0.21896853146853146,0.029791202526941722
42
- LR (default),4.707631549162742,0.1483236100938585,2.804518891637732,0.1437661673750307,0.9749537762080217,0.952284063110889,0.0,0.305476356016987,0.29332097164244453,36.73103788870433,7.308777223189359,35.75,0.22103499999999998,4.746849238872528,0.08430976470311483,2.2657084486770183,0.10642460584640503,1.0,1.0,0.0,0.22930816077470617,0.22370791852870522,31.50980214431158,4.703818974402271,39.5,0,0,1,25,982.4,41.3,38.8,0.21022727272727273,0.039833353869287484
43
- XT (default),0.8462119445841536,0.0803421718442542,0.47456795510527094,0.08795126020679146,0.9848550619375056,0.9640817361404062,0.0,0.27531426583624075,0.2767172685663852,6.0304600748852675,4.406716139948246,36.69230769230769,0.215405,0.7635703219307794,0.06673479080200195,0.40395335630596185,0.07007418015238884,1.0,1.0,0.0,0.18424442460299273,0.1562338348801046,5.148113375558061,3.7424721126192138,39.0,0,0,0,26,955.8,43.4,47.9,0.1888111888111888,0.030314566866112242
44
- KNN (tuned + ensemble),10.643233974469014,0.2739423143558013,4.426066058298347,0.19964653999651732,1.0,0.9945826759714126,0.11538461538461539,0.4799351328471401,0.5914832143485576,52.034665130956334,11.285407568076662,41.57692307692308,0.31118999999999997,5.881281822257572,0.1030390567249722,2.969713103492417,0.16730320161416595,1.0,1.0,0.0,0.4552261138058871,0.6183874930702317,55.06974575171796,8.266636963631449,43.0,0,0,0,26,706.1,49.1,47.3,0.0777972027972028,0.024279443329967162
45
- KNN (tuned),10.643233974469014,0.06075953357240074,4.426066058298347,0.04304950843110991,1.0,0.9965786300523389,0.11538461538461539,0.5010874219586886,0.6455640718520189,52.034665130956334,2.5027008595459077,42.69230769230769,0.31467999999999996,5.881281822257572,0.036990099483066134,2.969713103492417,0.039402452723266784,1.0,1.0,0.0,0.4955223463378593,0.67298570245769,55.06974575171796,2.091430487051362,44.0,0,0,0,26,610.7,51.4,48.7,0.05244755244755245,0.023502270498812147
46
- KNN (default),0.2899092884145231,0.0317453768518236,0.13426353967394633,0.031065835648244968,1.0,1.0,0.11538461538461539,0.592345422239955,0.9491257372621102,1.00065410372272,1.4527699655062962,44.25,0.34582,0.1258228341738383,0.02117468251122369,0.07457399441809831,0.021006283652748647,1.0,1.0,0.0,0.6201400123756287,1.0,1.0,1.0057335151210618,45.0,0,0,0,26,410.3,85.8,96.4,0.017045454545454544,0.022663376691509682
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
data/tabpfn-tabicl/time_plot.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:834c659b5faaf40d8d129d7ffc0a56cf33fce9ffbe1a3b38249f8fe26624107f
3
- size 460639
 
 
 
 
data/tabpfn-tabicl/tuning-impact-elo-horizontal.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:c49fb6311df3b5d7ded2ac317e013b62ffc635702f0d8cdbe3d2f4be4fd8fbd8
3
- size 220407
 
 
 
 
data/tabpfn-tabicl/tuning-impact-elo.png.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:de7c030facdf3a919abde7ab13728f193df948f2ac0c2928f77aa2f918f6f3b0
3
- size 194810
 
 
 
 
main.py CHANGED
@@ -3,7 +3,6 @@ from __future__ import annotations
3
  import zipfile
4
  from dataclasses import dataclass
5
  from pathlib import Path
6
-
7
  import gradio as gr
8
  import pandas as pd
9
  import website_texts
@@ -36,157 +35,70 @@ def get_model_family(model_name: str) -> str:
36
  return Constants.other
37
 
38
 
39
- def rename_map(model_name: str) -> str:
40
- rename_map = {
41
- "TABM": "TabM",
42
- "REALMLP": "RealMLP",
43
- "GBM": "LightGBM",
44
- "CAT": "CatBoost",
45
- "XGB": "XGBoost",
46
- "XT": "ExtraTrees",
47
- "RF": "RandomForest",
48
- "MNCA": "ModernNCA",
49
- "NN_TORCH": "TorchMLP",
50
- "FASTAI": "FastaiMLP",
51
- "TABPFNV2": "TabPFNv2",
52
- "EBM": "EBM",
53
- "TABDPT": "TabDPT",
54
- "TABICL": "TabICL",
55
- "KNN": "KNN",
56
- "LR": "Linear",
57
- }
58
-
59
- for prefix in rename_map:
60
- if prefix in model_name:
61
- return model_name.replace(prefix, rename_map[prefix])
62
-
63
- return model_name
64
-
65
 
66
- def load_data(filename: str, data_source="data"):
67
- df_leaderboard = pd.read_csv(Path(__file__).parent / data_source / filename)
 
68
 
69
- # Add Model Family Information
70
- df_leaderboard["Type"] = df_leaderboard.loc[:, "method"].apply(
71
- lambda s: model_type_emoji[get_model_family(s)]
72
- )
73
- df_leaderboard["TypeName"] = df_leaderboard.loc[:, "method"].apply(
74
- lambda s: get_model_family(s)
75
- )
76
- df_leaderboard["method"] = df_leaderboard["method"].apply(rename_map)
77
-
78
- # elo,elo+,elo-,mrr
79
- df_leaderboard["Elo 95% CI"] = (
80
- "+"
81
- + df_leaderboard["elo+"].round(0).astype(int).astype(str)
82
- + "/-"
83
- + df_leaderboard["elo-"].round(0).astype(int).astype(str)
84
- )
85
- # select only the columns we want to display
86
- df_leaderboard["normalized-score"] = 1 - df_leaderboard["normalized-error"]
87
- df_leaderboard["hmr"] = 1 / df_leaderboard["mrr"]
88
- df_leaderboard["improvability"] = 100 * df_leaderboard["champ_delta"]
89
-
90
- # Imputed logic
91
- if "imputed" in df_leaderboard.columns:
92
- df_leaderboard["imputed"] = (100 * df_leaderboard["imputed"]).round(2)
93
- df_leaderboard["imputed_bool"] = False
94
- # Filter methods that are fully imputed.
95
- df_leaderboard = df_leaderboard[~(df_leaderboard["imputed"] == 100)]
96
- # Add imputed column and add name postfix
97
- imputed_mask = df_leaderboard["imputed"] != 0
98
- df_leaderboard.loc[imputed_mask, "imputed_bool"] = True
99
- df_leaderboard.loc[imputed_mask, "method"] = df_leaderboard.loc[
100
- imputed_mask, ["method", "imputed"]
101
- ].apply(lambda row: row["method"] + f" [{row['imputed']:.2f}% IMPUTED]", axis=1)
102
- else:
103
- df_leaderboard["imputed_bool"] = None
104
- df_leaderboard["imputed"] = None
105
-
106
- # Resolve GPU postfix
107
- gpu_postfix = "_GPU"
108
- df_leaderboard["Hardware"] = df_leaderboard["method"].apply(
109
- lambda x: "CPU" if gpu_postfix not in x else "GPU"
110
- )
111
- df_leaderboard["method"] = df_leaderboard["method"].str.replace(gpu_postfix, "")
112
 
113
- df_leaderboard = df_leaderboard.loc[
114
- :,
115
- [
116
- "Type",
117
- "TypeName",
118
- "method",
119
- "elo",
120
- "Elo 95% CI",
121
- "normalized-score",
122
- "rank",
123
- "hmr",
124
- "improvability",
125
- "median_time_train_s_per_1K",
126
- "median_time_infer_s_per_1K",
127
- "imputed",
128
- "imputed_bool",
129
- "Hardware",
130
- ],
131
- ]
132
 
133
- # round for better display
134
- df_leaderboard[["elo", "Elo 95% CI"]] = df_leaderboard[["elo", "Elo 95% CI"]].round(
135
- 0
136
- )
137
- df_leaderboard[["median_time_train_s_per_1K", "rank", "hmr"]] = df_leaderboard[
138
- ["median_time_train_s_per_1K", "rank", "hmr"]
139
- ].round(2)
140
- df_leaderboard[
141
- ["normalized-score", "median_time_infer_s_per_1K", "improvability"]
142
- ] = df_leaderboard[
143
- ["normalized-score", "median_time_infer_s_per_1K", "improvability"]
144
- ].round(3)
145
-
146
- df_leaderboard = df_leaderboard.sort_values(by="elo", ascending=False)
147
- df_leaderboard = df_leaderboard.reset_index(drop=True)
148
- df_leaderboard = df_leaderboard.reset_index(names="#")
149
-
150
- # rename some columns
151
- return df_leaderboard.rename(
152
- columns={
153
- "median_time_train_s_per_1K": "Median Train Time (s/1K) [⬇️]",
154
- "median_time_infer_s_per_1K": "Median Predict Time (s/1K) [⬇️]",
155
- "method": "Model",
156
- "elo": "Elo [⬆️]",
157
- "rank": "Rank [⬇️]",
158
- "normalized-score": "Score [⬆️]",
159
- "hmr": "Harmonic Rank [⬇️]",
160
- "improvability": "Improvability (%) [⬇️]",
161
- "imputed": "Imputed (%) [⬇️]",
162
- "imputed_bool": "Imputed",
163
- }
164
- )
165
 
 
 
166
 
167
- @dataclass
168
- class LBContainer:
169
- name: str
170
- file_name: str
171
- blurb: str
172
- overview_image_name: str | None
173
- df_leaderboard: pd.DataFrame | None = None
174
 
175
- def __post_init__(self):
176
- self.df_leaderboard = load_data(self.file_name)
177
 
178
 
179
- def make_overview_image(
180
- overview_image_name: str | None, data_source: str = "data"
181
- ) -> None:
182
- path_to_image = Path(__file__).parent / data_source / overview_image_name
183
- path_to_image_zip = path_to_image.with_suffix(".png.zip")
184
- with zipfile.ZipFile(path_to_image_zip, "r") as zipf:
185
- zipf.extractall(path_to_image.parent)
186
  gr.Image(
187
- str(path_to_image), label="Leaderboard Overview", show_label=True, height=550
 
 
 
 
188
  )
189
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
190
 
191
  def make_overview_leaderboard(lbs: [LBContainer]):
192
  # Create column per LB
@@ -265,7 +177,9 @@ def make_overview_leaderboard(lbs: [LBContainer]):
265
  )
266
 
267
 
268
- def make_leaderboard(df_leaderboard: pd.DataFrame) -> Leaderboard:
 
 
269
  # -- Add filters
270
  df_leaderboard["TypeFiler"] = df_leaderboard["TypeName"].apply(
271
  lambda m: f"{m} {model_type_emoji[m]}"
@@ -297,8 +211,8 @@ def make_leaderboard(df_leaderboard: pd.DataFrame) -> Leaderboard:
297
  type="checkboxgroup",
298
  label="(Not) Imputed Models",
299
  info="We impute the performance for models that cannot run on all"
300
- " datasets due to task or dataset size constraints (e.g. TabPFN,"
301
- " TabICL). We impute with the performance of a default RandomForest."
302
  " We add a postfix [X% IMPUTED] to the model if any results were"
303
  " imputed. The X% shows the percentage of"
304
  " datasets that were imputed. In general, imputation negatively"
@@ -330,52 +244,97 @@ def make_leaderboard(df_leaderboard: pd.DataFrame) -> Leaderboard:
330
  )
331
 
332
 
333
- def _get_lbs() -> tuple[LBContainer, ...]:
334
- ta = LBContainer(
335
- name="🏅 Main",
336
- file_name="full-imputed/tabarena_leaderboard.csv",
337
- overview_image_name="full-imputed/tuning-impact-elo.png",
338
- blurb="Leaderboard for all datasets including all (imputed) models.",
339
- )
340
- ta_lite = LBContainer(
341
- name="Lite",
342
- file_name="lite/full-imputed/tabarena_leaderboard.csv",
343
- overview_image_name="lite/full-imputed/tuning-impact-elo.png",
344
- blurb="Leaderboard for one split (1st fold, 1st repeat) for all datasets including all (imputed) models.",
345
- )
346
- ta_clf = LBContainer(
347
- name="Classification",
348
- file_name="full-imputed-cls/tabarena_leaderboard.csv",
349
- overview_image_name="full-imputed-cls/tuning-impact-elo.png",
350
- blurb="Leaderboard for all 38 classification datasets including all (imputed) models.",
351
- )
352
- ta_reg = LBContainer(
353
- name="Regression",
354
- file_name="full-imputed-reg/tabarena_leaderboard.csv",
355
- # FIXME: get overview image without TabICL
356
- overview_image_name="full-imputed-reg/tuning-impact-elo.png",
357
- blurb="Leaderboard for all 13 regression datasets including all (imputed) models.",
358
- )
359
- ta_tabicl = LBContainer(
360
- name="⚡ TabICL-data",
361
- file_name="tabicl-imputed/tabarena_leaderboard.csv",
362
- overview_image_name="tabicl-imputed/tuning-impact-elo.png",
363
- blurb="Leaderboard for all 36 datasets within the constraints of TabICL including all (imputed) models.",
364
- )
365
- ta_tabpfn = LBContainer(
366
- name="⚡ TabPFN-data",
367
- file_name="tabpfn-imputed/tabarena_leaderboard.csv",
368
- overview_image_name="tabpfn-imputed/tuning-impact-elo.png",
369
- blurb="Leaderboard for all 33 datasets within the constraints of TabPFN including all (imputed) models.",
370
- )
371
- ta_tabpfn_tabicl = LBContainer(
372
- name="TabPFN/ICL-data",
373
- file_name="tabpfn-tabicl/tabarena_leaderboard.csv",
374
- overview_image_name="tabpfn-tabicl/tuning-impact-elo.png",
375
- blurb="Leaderboard for all 26 datasets within the constraints of TabPFN and TabICL including all models.",
376
- )
377
 
378
- return ta, ta_lite, ta_clf, ta_reg, ta_tabicl, ta_tabpfn, ta_tabpfn_tabicl
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
379
 
380
 
381
  def main():
@@ -432,63 +391,108 @@ def main():
432
  )
433
 
434
  # -- Get all LBs we need:
435
- ta, ta_lite, ta_clf, ta_reg, ta_tabicl, ta_tabpfn, ta_tabpfn_tabicl = _get_lbs()
436
-
437
- # -- LB Overview
438
- gr.Markdown("## 🗺️ TabArena Overview")
439
- ordered_lbs = [
440
- ta,
441
- ta_clf,
442
- ta_reg,
443
- ta_tabicl,
444
- ta_tabpfn,
445
- ta_tabpfn_tabicl,
446
- ta_lite,
447
- ]
448
- make_overview_leaderboard(lbs=ordered_lbs)
449
 
450
  gr.Markdown("## 🏆 TabArena Leaderboards")
 
 
 
451
  with gr.Tabs(elem_classes="tab-buttons"):
452
- for lb_id, lb in enumerate(ordered_lbs):
453
- with gr.TabItem(lb.name, elem_id="llm-benchmark-tab-table", id=lb_id):
454
- gr.Markdown(lb.blurb, elem_classes="markdown-text")
455
- make_overview_image(lb.overview_image_name)
456
- make_leaderboard(lb.df_leaderboard)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
457
 
458
  with gr.Row(), gr.Accordion("📂 Version History", open=False):
459
  gr.Markdown(VERSION_HISTORY_BUTTON_TEXT, elem_classes="markdown-text")
460
 
461
- gr.Markdown("## Old Leaderboards")
462
- with (
463
- gr.Tabs(elem_classes="tab-buttons"),
464
- gr.TabItem("TabArena-v0.1", elem_id="llm-benchmark-tab-table", id=2),
465
- ):
466
- df_leaderboard = load_data(
467
- "tabarena_leaderboard.csv.zip", data_source="old_data/v0_1_0"
468
- )
469
- df_leaderboard["Imputed"] = False
470
- imputed_map = {
471
- "TabPFNv2": 35.29,
472
- "TabICL": 29.41,
473
- }
474
- for model_name, imputed_percentage in imputed_map.items():
475
- if imputed_percentage == 100:
476
- # Filter methods that are fully imputed.
477
- df_leaderboard = df_leaderboard[
478
- ~df_leaderboard["Model"].str.startswith(model_name)
479
- ]
480
- else:
481
- mask = df_leaderboard["Model"].str.startswith(model_name)
482
- df_leaderboard.loc[mask, "Model"] = (
483
- df_leaderboard.loc[mask, "Model"]
484
- + f" [{imputed_percentage:.2f}% IMPUTED]"
485
- )
486
- df_leaderboard.loc[mask, "Imputed"] = True
487
- # Post fix logic is incorrect, thus we overwrite it here.
488
- # See paper for details.
489
- df_leaderboard["Hardware"] = None
490
- make_leaderboard(df_leaderboard)
491
-
492
  scheduler = BackgroundScheduler()
493
  # scheduler.add_job(restart_space, "interval", seconds=1800)
494
  scheduler.start()
 
3
  import zipfile
4
  from dataclasses import dataclass
5
  from pathlib import Path
 
6
  import gradio as gr
7
  import pandas as pd
8
  import website_texts
 
35
  return Constants.other
36
 
37
 
38
+ @dataclass
39
+ class LBContainer:
40
+ name: str
41
+ base_path_to_results: str
42
+ blurb: str
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
43
 
44
+ @property
45
+ def _base_path(self):
46
+ return Path(__file__).parent / "data" / self.base_path_to_results
47
 
48
+ def load_df_leaderboard(self) -> pd.DataFrame:
49
+ df = pd.read_csv(self._base_path / "website_leaderboard.csv")
50
+ df = df.rename(columns={"1#": "#"})
51
+ return df
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
52
 
53
+ def _handle_img_zip(self, img_name: str) -> str:
54
+ _base_path = self._base_path / img_name
55
+ zip_path = _base_path.with_suffix(".png.zip")
56
+ img_path = _base_path.with_suffix(".png")
57
+ with zipfile.ZipFile(zip_path, "r") as zipf:
58
+ zipf.extractall(img_path.parent)
59
+ return str(img_path)
 
 
 
 
 
 
 
 
 
 
 
 
60
 
61
+ def get_path_to_tuning_impact_elo(self) -> str:
62
+ return self._handle_img_zip("tuning-impact-elo")
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
63
 
64
+ def get_path_to_pareto_front_improvability_vs_time_infer(self) -> str:
65
+ return self._handle_img_zip("pareto_front_improvability_vs_time_infer")
66
 
67
+ def get_path_to_pareto_n_configs_imp(self) -> str:
68
+ return self._handle_img_zip("pareto_n_configs_imp")
 
 
 
 
 
69
 
70
+ def get_path_to_winrate_matrix(self) -> str:
71
+ return self._handle_img_zip("winrate_matrix")
72
 
73
 
74
+ def make_overview_images(lb: LBContainer, subset_name):
75
+ # Main Figure
 
 
 
 
 
76
  gr.Image(
77
+ lb.get_path_to_tuning_impact_elo(),
78
+ label=f"Leaderboard Overview [{subset_name}]",
79
+ show_label=True,
80
+ height=500,
81
+ show_share_button=True,
82
  )
83
 
84
+ with gr.Row():
85
+ with gr.Column(scale=1):
86
+ gr.Image(
87
+ value=lb.get_path_to_pareto_front_improvability_vs_time_infer(),
88
+ label=f"Inference Time Pareto Front [{subset_name}]",
89
+ height=400,
90
+ show_label=True,
91
+ show_share_button=True,
92
+ )
93
+ with gr.Column(scale=1):
94
+ gr.Image(
95
+ value=lb.get_path_to_pareto_n_configs_imp(),
96
+ label=f"Tuning Trajectories [{subset_name}]",
97
+ height=400,
98
+ show_label=True,
99
+ show_share_button=True,
100
+ )
101
+
102
 
103
  def make_overview_leaderboard(lbs: [LBContainer]):
104
  # Create column per LB
 
177
  )
178
 
179
 
180
+ def make_leaderboard(lb: LBContainer) -> Leaderboard:
181
+ df_leaderboard = lb.load_df_leaderboard()
182
+
183
  # -- Add filters
184
  df_leaderboard["TypeFiler"] = df_leaderboard["TypeName"].apply(
185
  lambda m: f"{m} {model_type_emoji[m]}"
 
211
  type="checkboxgroup",
212
  label="(Not) Imputed Models",
213
  info="We impute the performance for models that cannot run on all"
214
+ " datasets due to task or dataset size constraints. We impute with"
215
+ " the performance of a default RandomForest."
216
  " We add a postfix [X% IMPUTED] to the model if any results were"
217
  " imputed. The X% shows the percentage of"
218
  " datasets that were imputed. In general, imputation negatively"
 
244
  )
245
 
246
 
247
+ @dataclass
248
+ class LBMatrixElement:
249
+ imputation: str
250
+ splits: str
251
+ tasks: str
252
+ datasets: str
253
+
254
+ def get_path_to_results(self) -> str:
255
+ return (
256
+ f"imputation_{self.imputation}/"
257
+ f"splits_{self.splits}/"
258
+ f"tasks_{self.tasks}/"
259
+ f"datasets_{self.datasets}/"
260
+ )
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
261
 
262
+
263
+ @dataclass
264
+ class LBMatrix:
265
+ imputation = ["no", "yes"]
266
+ splits = ["all", "lite"]
267
+ tasks = ["all", "classification", "regression"]
268
+ datasets = ["all", "small", "medium", "tabpfn"]
269
+
270
+ # TODO: get correct numbers
271
+ blurb_map_n_datasets = {
272
+ "all": {
273
+ "all": 51,
274
+ "small": 35,
275
+ "medium": 16,
276
+ "tabpfn": 33,
277
+ },
278
+ "classification": {
279
+ "all": 30,
280
+ "small": 20,
281
+ "medium": 10,
282
+ "tabpfn": 20,
283
+ },
284
+ "regression": {
285
+ "all": 21,
286
+ "small": 15,
287
+ "medium": 6,
288
+ "tabpfn": 13,
289
+ },
290
+ }
291
+
292
+ @staticmethod
293
+ def get_name_for_lb(lb_key, lb_value):
294
+ if lb_key == "imputation":
295
+ return "All Models" if lb_value == "no" else "With Imputed Models"
296
+ if lb_key == "splits":
297
+ return "All Repeats" if lb_value == "all" else "Lite"
298
+ if lb_key == "tasks":
299
+ match lb_value:
300
+ case "all":
301
+ return "All Tasks"
302
+ case "classification":
303
+ return "Classification"
304
+ case "regression":
305
+ return "Regression"
306
+ case _:
307
+ raise ValueError()
308
+ if lb_key == "datasets":
309
+ match lb_value:
310
+ case "all":
311
+ return "All Datasets"
312
+ case "small":
313
+ return "Small"
314
+ case "medium":
315
+ return "Medium"
316
+ case "tabpfn":
317
+ return "TabPFNv2-data"
318
+ case _:
319
+ raise ValueError()
320
+ raise ValueError()
321
+
322
+ def element_to_blurb(self, element: LBMatrixElement) -> str:
323
+ n_datasets = self.blurb_map_n_datasets[element.tasks][element.datasets]
324
+
325
+ datasets_name = (
326
+ element.datasets if element.datasets != "tabpfn" else "TabPFNv2-compatible"
327
+ )
328
+ blurb = f"Leaderboard for {n_datasets} datasets ({datasets_name} datasets, {element.tasks} tasks) "
329
+
330
+ if element.splits == "lite":
331
+ blurb += "for one split (1st fold, 1st repeat) "
332
+
333
+ blurb += "including all "
334
+ if element.imputation == "yes":
335
+ blurb += "(imputed) "
336
+ blurb += f"models."
337
+ return blurb
338
 
339
 
340
  def main():
 
391
  )
392
 
393
  # -- Get all LBs we need:
394
+ # all_lbs = _get_lbs()
395
+ # # -- LB Overview
396
+ # gr.Markdown("## 🗺️ TabArena Overview")
397
+ # ordered_lbs = [
398
+ # ta,
399
+ # ta_clf,
400
+ # ta_reg,
401
+ # ta_tabicl,
402
+ # ta_tabpfn,
403
+ # ta_tabpfn_tabicl,
404
+ # ta_lite,
405
+ # ]
406
+ # make_overview_leaderboard(lbs=ordered_lbs)
 
407
 
408
  gr.Markdown("## 🏆 TabArena Leaderboards")
409
+ lb_matrix = LBMatrix()
410
+
411
+ # Imputation
412
  with gr.Tabs(elem_classes="tab-buttons"):
413
+ for impute_id, impute_t in enumerate(lb_matrix.imputation):
414
+ impute_t_name = lb_matrix.get_name_for_lb("imputation", impute_t)
415
+ with gr.TabItem(
416
+ impute_t_name, elem_id="llm-benchmark-tab-table", id=impute_id
417
+ ):
418
+ # Splits
419
+ with gr.Tabs(elem_classes="tab-buttons"):
420
+ for splits_id, splits_t in enumerate(lb_matrix.splits):
421
+ splits_t = lb_matrix.get_name_for_lb("splits", splits_t)
422
+ with gr.TabItem(
423
+ splits_t,
424
+ elem_id="llm-benchmark-tab-table",
425
+ id=f"{impute_id}_{splits_id}",
426
+ ):
427
+ # Tasks
428
+ with gr.Tabs(elem_classes="tab-buttons"):
429
+ for tasks_id, tasks_t in enumerate(lb_matrix.tasks):
430
+ tasks_t_name = lb_matrix.get_name_for_lb(
431
+ "tasks", tasks_t
432
+ )
433
+ with gr.TabItem(
434
+ tasks_t_name,
435
+ elem_id="llm-benchmark-tab-table",
436
+ id=f"{impute_id}_{splits_id}_{tasks_id}",
437
+ ):
438
+ # Datasets
439
+ with gr.Tabs(elem_classes="tab-buttons"):
440
+ for (
441
+ datasets_id,
442
+ datasets_t,
443
+ ) in enumerate(lb_matrix.datasets):
444
+ datasets_t_name = (
445
+ lb_matrix.get_name_for_lb(
446
+ "datasets", datasets_t
447
+ )
448
+ )
449
+ with gr.TabItem(
450
+ datasets_t_name,
451
+ elem_id="llm-benchmark-tab-table",
452
+ id=f"{impute_id}_{splits_id}_{tasks_id}_{datasets_id}",
453
+ ):
454
+ # Load LB
455
+ lb_element = LBMatrixElement(
456
+ imputation=lb_matrix.imputation[
457
+ impute_id
458
+ ],
459
+ splits=lb_matrix.splits[
460
+ splits_id
461
+ ],
462
+ tasks=lb_matrix.tasks[
463
+ tasks_id
464
+ ],
465
+ datasets=lb_matrix.datasets[
466
+ datasets_id
467
+ ],
468
+ )
469
+ lb = LBContainer(
470
+ name=f"{impute_t_name} | {splits_t} | {tasks_t_name} | {datasets_t_name}",
471
+ base_path_to_results=lb_element.get_path_to_results(),
472
+ blurb=lb_matrix.element_to_blurb(
473
+ lb_element
474
+ ),
475
+ )
476
+ gr.Markdown(
477
+ lb.blurb,
478
+ elem_classes="markdown-text",
479
+ )
480
+ make_overview_images(
481
+ lb, subset_name=lb.name
482
+ )
483
+ make_leaderboard(lb)
484
+ gr.Image(
485
+ lb.get_path_to_winrate_matrix(),
486
+ label=f"Winmatrix Overview [{lb.name}]",
487
+ show_label=True,
488
+ height=800,
489
+ show_share_button=True,
490
+ )
491
+
492
 
493
  with gr.Row(), gr.Accordion("📂 Version History", open=False):
494
  gr.Markdown(VERSION_HISTORY_BUTTON_TEXT, elem_classes="markdown-text")
495
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
496
  scheduler = BackgroundScheduler()
497
  # scheduler.add_job(restart_space, "interval", seconds=1800)
498
  scheduler.start()
old_data/v0_1_0/tabarena_leaderboard.csv.zip DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:7b23c724927320a54d5e4edcf1b2d938bc818c1dfd5461f1a8d204bb0b44d095
3
- size 10582
 
 
 
 
pyproject.toml CHANGED
@@ -1,14 +1,14 @@
1
  [project]
2
  name = "tabarenaleaderboard"
3
  version = "0.1.1"
4
- description = "Add your description here"
5
  readme = "README.md"
6
  requires-python = ">=3.12"
7
  dependencies = [
8
  "apscheduler>=3.11.0",
9
  "gradio-client>=1.3.0",
10
  "gradio-leaderboard==0.0.13",
11
- "gradio[oauth]==5.33.2",
12
  "pandas>=2.2.3",
13
  ]
14
  [project.optional-dependencies]
 
1
  [project]
2
  name = "tabarenaleaderboard"
3
  version = "0.1.1"
4
+ description = "Code that renders the TabArena leaderboard"
5
  readme = "README.md"
6
  requires-python = ">=3.12"
7
  dependencies = [
8
  "apscheduler>=3.11.0",
9
  "gradio-client>=1.3.0",
10
  "gradio-leaderboard==0.0.13",
11
+ "gradio[oauth]==5.49.1",
12
  "pandas>=2.2.3",
13
  ]
14
  [project.optional-dependencies]
website_texts.py CHANGED
@@ -12,23 +12,34 @@ The leaderboard is based on a manually curated collection of
12
  51 tabular classification and regression datasets for independent and identically distributed
13
  (IID) data, spanning the small to medium data regime. The datasets were carefully
14
  curated to represent various real-world predictive machine learning use cases.
 
 
 
15
  """
16
  OVERVIEW_MODELS = """
17
  The focus of the leaderboard is on model-specific pipelines. Each pipeline
18
  is evaluated with default and tuned hyperparameter configuration or as an ensemble of
19
  tuned configurations. Each model is implemented in a tested real-world pipeline that was
20
  optimized to get the most out of the model by the maintainers of TabArena, and where
21
- possible together with the authors of the model.
 
 
 
 
 
22
  """
23
  OVERVIEW_METRICS = """
24
  The leaderboards are ranked based on Elo. We present several additional
25
  metrics. See `More Details` for more information on the metrics.
26
 
27
- **Note, we impute** the performance for models that cannot run on all datasets due to
28
- task or dataset size constraints (e.g. TabPFN, TabICL). In general, imputation
29
- negatively represents the model performance, punishing the model for not being able
30
- to run on all datasets. We provide leaderboards computed only on the subset of datasets
31
- where TabPFN, TabICL, or both can run. We denote these leaderboards by `X-data`.
 
 
 
32
  """
33
  OVERVIEW_REF_PIPE = """
34
  The leaderboard includes a reference pipeline, which is applied
@@ -42,8 +53,8 @@ types and thus provides a reference for model-specific pipelines.
42
 
43
  ABOUT_TEXT = r"""
44
  ### Extended Overview of TabArena (References / Papers)
45
- We introduce TabArena and provide an overview of TabArena-v0.1 in our paper: https://tabarena.ai/paper-tabular-ml-iid-study.
46
- Moreover, you can find a presentation of TabArena here: https://www.youtube.com/watch?v=mcPRMcJHW2Y
47
 
48
  ### Using TabArena for Benchmarking
49
  To compare your own methods to the pre-computed results for all models on the leaderboard,
@@ -125,11 +136,21 @@ CITATION_BUTTON_TEXT = r"""@article{erickson2025tabarena,
125
  """
126
 
127
  VERSION_HISTORY_BUTTON_TEXT = """
128
- **Current Version: TabArena-v0.1.1**
129
 
130
  The following details updates to the leaderboard (date format is YYYY/MM/DD):
131
 
132
- * 2025/06/13: Add data for all subsets and re-runs on GPU; Add leaderboards for subsets;
 
 
 
 
 
 
 
133
  new overview; add Figures to LBs.
134
- * 2025/05: Initialization of the TabArena-v0.1 leaderboard.
 
 
 
135
  """
 
12
  51 tabular classification and regression datasets for independent and identically distributed
13
  (IID) data, spanning the small to medium data regime. The datasets were carefully
14
  curated to represent various real-world predictive machine learning use cases.
15
+
16
+ **Subsets:** We present results for various subsets of the datasets based on tasks and dataset size. Select your
17
+ subset of interest from the tabs above the leaderboard.
18
  """
19
  OVERVIEW_MODELS = """
20
  The focus of the leaderboard is on model-specific pipelines. Each pipeline
21
  is evaluated with default and tuned hyperparameter configuration or as an ensemble of
22
  tuned configurations. Each model is implemented in a tested real-world pipeline that was
23
  optimized to get the most out of the model by the maintainers of TabArena, and where
24
+ possible together with the authors of the model.
25
+
26
+ **Unverified Models:** Some models were contributed and evaluated, but have not been verified by the original authors or
27
+ maintainers of the model. We indicated whether a model was verified in an extra column in the leaderboard. Results for
28
+ unverified and recent models should be interpreted with more caution. Results for unverified but stable models (such as
29
+ XGBoost, LightGBM, CatBoost, Random Forests, or baselines) require less caution, even if unverified.
30
  """
31
  OVERVIEW_METRICS = """
32
  The leaderboards are ranked based on Elo. We present several additional
33
  metrics. See `More Details` for more information on the metrics.
34
 
35
+ **Imputation:** We also present results with imputation. The `Imputed` tab presents all results where we impute the
36
+ performance for models that cannot run on all datasets due to task or dataset size constraints. In general, imputation
37
+ negatively represents the model performance, punishing the model for not being able to run on all datasets.
38
+
39
+ **Repeats:** We also present results for TabArena-Lite, where we only repeat the experiments once instead of multiple
40
+ times per dataset. By selecting the `Lite` tab, you can see results for TabArena-Lite. Results for TabArena-Lite are
41
+ less reliable than for `All Repeats` but often present a good proxy for the overall performance while being much
42
+ cheaper to compute.
43
  """
44
  OVERVIEW_REF_PIPE = """
45
  The leaderboard includes a reference pipeline, which is applied
 
53
 
54
  ABOUT_TEXT = r"""
55
  ### Extended Overview of TabArena (References / Papers)
56
+ We introduce TabArena and provide an overview of TabArena-v0.1.1 in our paper: https://tabarena.ai/paper-tabular-ml-iid-study.
57
+ Moreover, you can find a presentation of TabArena-v0.1.1 here: https://www.youtube.com/watch?v=mcPRMcJHW2Y
58
 
59
  ### Using TabArena for Benchmarking
60
  To compare your own methods to the pre-computed results for all models on the leaderboard,
 
136
  """
137
 
138
  VERSION_HISTORY_BUTTON_TEXT = """
139
+ **Current Version: TabArena-v0.1.2**
140
 
141
  The following details updates to the leaderboard (date format is YYYY/MM/DD):
142
 
143
+ * 2025/12/01-v0.1.2: Add newest version of TabArena LB for NeurIPS 2025
144
+ * New UI and new leaderboard subsets for different dataset sizes, tasks, and imputation + general polish.
145
+ * Some metrics have been refacotred and made more stable (see GitHub for details).
146
+ * Updated Reference Pipeline to include AutoGluon v1.4 with the extreme preset.
147
+ * Updated existing models: RealMLP, TabDPT, EBM
148
+ * Add new verified models: Mitra, xRFM, TabPFN-2.5
149
+ * Add new unverified models: TabFlex, BetaTabPFN, LimiX
150
+ * 2025/06/13-v0.1.1: Add data for all subsets and re-runs on GPU; Add leaderboards for subsets;
151
  new overview; add Figures to LBs.
152
+ * 2025/05-v0.1.0: Initialization of the TabArena-v0.1 leaderboard.
153
+
154
+ Old Leaderboards can be found at:
155
+ * Tabarena-v0.1 and TabArena-v0.1.1: https://huggingface.co/spaces/TabArena-Legacy/TabArena-v0.1.1
156
  """