Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
LAUNCH Lab
university
https://launch.eecs.umich.edu/
launchnlp
launchnlp
Activity Feed
Follow
30
AI & ML interests
Factuality, reasoning, alignment, LLM applications
Recent Activity
farimafatahi
authored
a paper
21 days ago
FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation
farimafatahi
authored
a paper
21 days ago
Logit Arithmetic Elicits Long Reasoning Capabilities Without Training
farimafatahi
authored
a paper
21 days ago
From Proof to Program: Characterizing Tool-Induced Reasoning Hallucinations in Large Language Models
View all activity
Team members
16
launch
's datasets
12
Sort: Recently updated
launch/ExpertLongBench
Preview
•
Updated
Jul 30
•
345
•
10
launch/thinkprm-1K-verification-cots
Viewer
•
Updated
Jul 1
•
1k
•
64
•
6
launch/ManyICLBench
Viewer
•
Updated
Jun 26
•
66
•
629
•
1
launch/CMV
Viewer
•
Updated
Jun 26
•
133
•
40
launch/FactRBench
Viewer
•
Updated
Jun 9
•
1.06k
•
76
•
1
launch/FactBench
Viewer
•
Updated
Jun 9
•
1k
•
95
•
3
launch/CLASH
Viewer
•
Updated
Apr 16
•
345
•
65
•
2
launch/gov_report
Viewer
•
Updated
Nov 9, 2022
•
58.4k
•
791
•
7
launch/gov_report_qs
Viewer
•
Updated
Nov 9, 2022
•
7.87k
•
504
•
3
launch/open_question_type
Viewer
•
Updated
Nov 9, 2022
•
4.96k
•
404
•
6
launch/reddit_qg
Viewer
•
Updated
Nov 9, 2022
•
720k
•
57
launch/ampere
Viewer
•
Updated
Nov 9, 2022
•
400
•
73