Running
15
Mizan
📊
Display benchmark results for embedding models
Where data finds its mind
Parrot: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs
TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval