Kevin King PRO
NeoCodes-dev
AI & ML interests
Deep RL, RL for LLMs
Recent Activity
upvoted
an
article
2 days ago
Introducing Decision Transformers on Hugging Face 🤗
updated
a collection
14 days ago
Research Papers
updated
a collection
17 days ago
Research Papers
Organizations
Datasets - Agents
Datasets - Coding
ARC-AGI2
VLMs - Robotics
Embedding Models
ICON - Help Agent
-
Console-AI/IT-helpdesk-synthetic-tickets
Viewer • Updated • 500 • 131 • 3 -
aakash0017/it-support-llm
Viewer • Updated • 1.92k • 81 • 3 -
elsonj/IT-Support-Finetuned-DeepSeek-BitWitDataset
Viewer • Updated • 521 • 36 • 1 -
Sleeping13
CrewAI Gradio Support Agent
👁13Build support agent with CrewAI multi-agents and Gradio
Datasets - CryptoSage
VLMs
Agents
Classifier Models
LLMs
OCR/Document Processing
ActionLanguageModels
Datasets - MultiModal
Agent-Specific/Function-Calling Models
Datasets - Robotics
-
nvidia/PhysicalAI-Robotics-Manipulation-Kitchen
Viewer • Updated • 405k • 764 • 10 -
nvidia/PhysicalAI-Robotics-Manipulation-SingleArm
Updated • 19.7k • 13 -
nvidia/PhysicalAI-SimReady-Warehouse-01
Viewer • Updated • 753 • 3.93k • 28 -
manycore-research/SpatialLM-Testset
Viewer • Updated • 107 • 1.54k • 60
MMMs
Models - CryptoSage
Datasets - Reasoning
Spaces
Research Papers
-
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 63 -
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning
Paper • 2502.15425 • Published • 9 -
EgoLife: Towards Egocentric Life Assistant
Paper • 2503.03803 • Published • 46 -
Visual-RFT: Visual Reinforcement Fine-Tuning
Paper • 2503.01785 • Published • 85
DataSets
Benchmarks
OCR/Document Processing
Datasets - Agents
ActionLanguageModels
Datasets - Coding
Datasets - MultiModal
ARC-AGI2
Agent-Specific/Function-Calling Models
VLMs - Robotics
Datasets - Robotics
-
nvidia/PhysicalAI-Robotics-Manipulation-Kitchen
Viewer • Updated • 405k • 764 • 10 -
nvidia/PhysicalAI-Robotics-Manipulation-SingleArm
Updated • 19.7k • 13 -
nvidia/PhysicalAI-SimReady-Warehouse-01
Viewer • Updated • 753 • 3.93k • 28 -
manycore-research/SpatialLM-Testset
Viewer • Updated • 107 • 1.54k • 60
Embedding Models
MMMs
ICON - Help Agent
-
Console-AI/IT-helpdesk-synthetic-tickets
Viewer • Updated • 500 • 131 • 3 -
aakash0017/it-support-llm
Viewer • Updated • 1.92k • 81 • 3 -
elsonj/IT-Support-Finetuned-DeepSeek-BitWitDataset
Viewer • Updated • 521 • 36 • 1 -
Sleeping13
CrewAI Gradio Support Agent
👁13Build support agent with CrewAI multi-agents and Gradio
Models - CryptoSage
Datasets - CryptoSage
Datasets - Reasoning
VLMs
Spaces
Agents
Research Papers
-
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 63 -
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning
Paper • 2502.15425 • Published • 9 -
EgoLife: Towards Egocentric Life Assistant
Paper • 2503.03803 • Published • 46 -
Visual-RFT: Visual Reinforcement Fine-Tuning
Paper • 2503.01785 • Published • 85
Classifier Models
DataSets
LLMs