Seven years of the Elliptic AML benchmark in four switchable charts: the 2019 table everyone forgot, the 2026 leakage-free re-ranking, one GraphSAGE trained on four different graphs, and the F1 cliff at time step 43. Hover, tap, or tab across the bars for exact precision and recall.
The Elliptic benchmark made GNNs the default for on-chain AML. A 2026 leakage-free re-evaluation flips the script: random forests win by 13 F1 points, randomly rewired edges beat the real graph, and every model falls off a cliff at time step 43.