dllm-collection/Qwen2.5-Coder-0.5B-Instruct-diffusion-mdlm-v0.1 0.6B • Updated about 17 hours ago • 41 • 1
dllm-collection/Qwen2.5-Coder-0.5B-Instruct-diffusion-bd3lm-v0.1 0.6B • Updated about 17 hours ago • 30 • 1
dllm-collection/Qwen2.5-Coder-0.5B-Instruct-diffusion-mdlm-v0.1 0.6B • Updated about 17 hours ago • 41 • 1
dllm-collection/Qwen2.5-Coder-0.5B-Instruct-diffusion-bd3lm-v0.1 0.6B • Updated about 17 hours ago • 30 • 1
Tiny-A2D Collection Small diffusion language models adapted from AR models • 4 items • Updated about 3 hours ago • 2
Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level Paper • 2406.11817 • Published Jun 17, 2024 • 13