DERL_Group

non-profit

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

sitao authored a paper 2 months ago

From Atomic to Composite: Reinforcement Learning Enables Generalization in Complementary Reasoning

sitao updated a model 2 months ago

DifferentiableEvolutionaryRL/DERL-ALFWorld-L2-Qwen2.5-1.5B

sitao updated a model 2 months ago

DifferentiableEvolutionaryRL/DERL-ALFWorld-L1-Qwen2.5-1.5B

View all activity

Papers

Differentiable Evolutionary Reinforcement Learning

View all Papers

sitao

authored a paper 2 months ago

From Atomic to Composite: Reinforcement Learning Enables Generalization in Complementary Reasoning

Paper • 2512.01970 • Published Dec 1, 2025 • 2

sitao

updated 6 models 2 months ago

updated a model 2 months ago

DifferentiableEvolutionaryRL/DERL-ScienceWorld-L2-Qwen2.5-1.5B

2B • Updated Dec 25, 2025

L3133625978

published a model 2 months ago

DifferentiableEvolutionaryRL/DERL-ScienceWorld-L2-Qwen2.5-1.5B

2B • Updated Dec 25, 2025

L3133625978

updated a model 2 months ago

DifferentiableEvolutionaryRL/DERL-ScienceWorld-L1-Qwen2.5-1.5B

2B • Updated Dec 25, 2025 • 1

L3133625978

published a model 2 months ago

DifferentiableEvolutionaryRL/DERL-ScienceWorld-L1-Qwen2.5-1.5B

2B • Updated Dec 25, 2025 • 1

L3133625978

updated a model 2 months ago

DifferentiableEvolutionaryRL/DERL-ScienceWorld-L0-Qwen2.5-1.5B

2B • Updated Dec 25, 2025

L3133625978

published a model 2 months ago

DifferentiableEvolutionaryRL/DERL-ScienceWorld-L0-Qwen2.5-1.5B

2B • Updated Dec 25, 2025

L3133625978

updated a model 2 months ago

DifferentiableEvolutionaryRL/DERL-ALFWorld-L2-Qwen2.5-1.5B

2B • Updated Dec 25, 2025 • 4 • 1

L3133625978

published a model 2 months ago

DifferentiableEvolutionaryRL/DERL-ALFWorld-L2-Qwen2.5-1.5B

2B • Updated Dec 25, 2025 • 4 • 1

L3133625978

updated a model 2 months ago

DifferentiableEvolutionaryRL/DERL-ALFWorld-L1-Qwen2.5-1.5B

2B • Updated Dec 25, 2025 • 3

L3133625978

published a model 2 months ago

DifferentiableEvolutionaryRL/DERL-ALFWorld-L1-Qwen2.5-1.5B

2B • Updated Dec 25, 2025 • 3

sitao

updated a model 2 months ago

DifferentiableEvolutionaryRL/DERL-Meta-Optimizer-Init-Qwen2.5-0.5B-Instruct

Text Generation • 0.5B • Updated Dec 21, 2025 • 6

L3133625978

updated a model 2 months ago

DifferentiableEvolutionaryRL/DERL-ALFWorld-L0-Qwen2.5-1.5B

2B • Updated Dec 25, 2025 • 3 • 1

L3133625978

published a model 2 months ago

DifferentiableEvolutionaryRL/DERL-ALFWorld-L0-Qwen2.5-1.5B

2B • Updated Dec 25, 2025 • 3 • 1

AI & ML interests

Recent Activity

Papers

Team members 2

DifferentiableEvolutionaryRL's activity