What will happen if we train a Q function for digital agents?
HAO BAI
JackBAI
AI & ML interests
Representation learning, language models.
Recent Activity
authored
a paper
about 18 hours ago
InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning
upvoted
a
paper
1 day ago
InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning
submitted
a paper
1 day ago
InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning