view article Article How We Built a Semantic Highlight Model To Save Token Cost for RAG 6 days ago • 56
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published 7 days ago • 119
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models Paper • 2601.07372 • Published 9 days ago • 33
MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences Paper • 2601.06789 • Published 10 days ago • 74
Efficient Context Scaling with LongCat ZigZag Attention Paper • 2512.23966 • Published 22 days ago • 5
RelayLLM: Efficient Reasoning via Collaborative Decoding Paper • 2601.05167 • Published 13 days ago • 28
Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers Paper • 2601.04890 • Published 13 days ago • 40
RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes Paper • 2601.05249 • Published 13 days ago • 45
Quantized LFM2.5 Collection Verified models. Compatible with vLLM. • 10 items • Updated 8 days ago • 1
view article Article Custom Policy Enforcement with Reasoning: Faster, Safer AI Applications Dec 2, 2025 • 24
view article Article DeepMath: A lightweight math reasoning Agent with smolagents +1 Dec 4, 2025 • 35
view article Article Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms Nov 20, 2025 • 38
view article Article Introducing swift-huggingface: The Complete Swift Client for Hugging Face Dec 5, 2025 • 40
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 274
view article Article Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval +1 Mar 22, 2024 • 123