You must log in or # to comment.
Seems like they are using multiple large models, right? For example, their least good fusion in the Benchmark™ is “Gemini 3 Flash + Kimi K2.6 + DeepSeek V4 Pro (synthesized by Opus 4.8)”.
yeah these aren’t locally runnable models, but the idea is definitely transferable


