Honglin Guo's picture

Honglin Guo

KYLN24

·

KYLN24

AI & ML interests

None yet

Recent Activity

liked a dataset 1 day ago

allenai/Dolci-Think-SFT-Python

authored a paper 1 day ago

Better Process Supervision with Bi-directional Rewarding Signals

authored a paper 1 day ago

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

View all activity

Organizations

upvoted a paper 1 day ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published 2 days ago • 60

upvoted a collection 9 days ago

Nex-N1

5 items • Updated 1 day ago • 5

upvoted a paper 30 days ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published about 1 month ago • 208

upvoted a paper 5 months ago

BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset

Paper • 2507.03483 • Published Jul 4 • 23

upvoted 2 papers 7 months ago

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

Paper • 2406.04151 • Published Jun 6, 2024 • 24

DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting

Paper • 2503.00784 • Published Mar 2 • 13

upvoted an article 9 months ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

Jan 31

•

51

upvoted 2 papers 9 months ago

CritiQ: Mining Data Quality Criteria from Human Preferences

Paper • 2502.19279 • Published Feb 26 • 10

Thus Spake Long-Context Large Language Model

Paper • 2502.17129 • Published Feb 24 • 73

upvoted a collection about 1 year ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 666

upvoted an article about 1 year ago

Article

Saving Memory Using Padding-Free Transformer Layers during Finetuning

Jun 11, 2024

•

20

upvoted a collection over 1 year ago

InternLM2

7 items • Updated Feb 11 • 9

upvoted a paper over 1 year ago

InternLM2 Technical Report

Paper • 2403.17297 • Published Mar 26, 2024 • 34

upvoted a collection over 1 year ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 872

upvoted 2 papers almost 2 years ago

Code Needs Comments: Enhancing Code LLMs with Comment Augmentation

Paper • 2402.13013 • Published Feb 20, 2024 • 1

CoLLiE: Collaborative Training of Large Language Models in an Efficient Way

Paper • 2312.00407 • Published Dec 1, 2023 • 3