EvalEval Bot
EvalEvalBot
AI & ML interests
None yet
Recent Activity
published a bucket about 8 hours ago
evaleval/EEE_datastore new activity 6 days ago
evaleval/EEE_datastore:[Submission] Add Vectara Hallucination Leaderboard results updated a dataset 8 days ago
EvalEvalBot/eee-submission-indexOrganizations
[Submission] Add Vectara Hallucination Leaderboard results
1
#144 opened 6 days ago
by
mokarami
[Submission] TAB Error Recovery - 9 models, third-party evaluation
1
#140 opened 9 days ago
by
RodTAB
Add EvalEval community eval results (mmlu_pro.yaml)
#35 opened 11 days ago
by
EvalEvalBot
[Submission] Latest LiveBench Data
2
#138 opened 16 days ago
by
reuank
Fix LLM Stats provenance relationships
2
#137 opened 17 days ago
by
Cerru02
[ACL Shared Task] wmt25_bhojpuri_maasai: Low-resource MT evaluation (Bhojpuri & Maasai)
3
#133 opened about 1 month ago
by
jboat
Shared Task - Submission
1
#136 opened 22 days ago
by
UsmanGohar
[ACL Shared Task] Add OpenAI MRCR v2 (8-needle) leaderboard results
5
#119 opened about 1 month ago
by
bwingenroth
[ACL Shared Task] Add PACEBench evaluation results
4
#77 opened about 1 month ago
by
mrpfisher
[ACL Shared Task] Add Chatbot Arena
16
#110 opened about 1 month ago
by
muhammadravi251001
[ACL Shared Task] Add AlpacaEval
7
#129 opened about 1 month ago
by
muhammadravi251001
[Submission] Journalistic-Bias Revised
1
#135 opened 29 days ago
by
WanderingIsle
Parquet for dataset viewer
#134 opened about 1 month ago
by
EvalEvalBot
Generating Parquets
6
#58 opened about 2 months ago
by
EvalEvalBot
[ACL Shared Task] Add LingOly benchmark results
5
#78 opened about 1 month ago
by
ambean
[ACL Shared Task] Contribute MT-Bench results
4
#124 opened about 1 month ago
by
ameek
[ACL Shared Task] Contribute Humanity's Last Exam results
7
#125 opened about 1 month ago
by
ameek
Add ResearchGym rg-agent GPT-5 results
5
#130 opened about 1 month ago
by
anikethh
[ACL SHARED TASK] Add OUP L2-Bench
1
#131 opened about 1 month ago
by
jimmyedgell
[ACL Shared Task] Contribute LiveBench Results
2
#128 opened about 1 month ago
by
saki-imai