arxiv:2510.04550
Pengfei He
bigboss24
AI & ML interests
Trustworthy
Recent Activity
authored
a paper
13 days ago
TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool
Use upvoted a paper 13 days ago
TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool
Use upvoted a paper 13 days ago
Co-RedTeam: Orchestrated Security Discovery and Exploitation with LLM Agents Organizations
None yet