tianzhechu
/

GP-VL-Init

Model card Files Files and versions

GP-VL-Init / README.md

tianzhechu's picture

Create README.md

9ff19aa verified 10 months ago

|

history blame contribute delete

393 Bytes

	---
	license: mit
	---
	# GP-VL-Init
	This model serves as a initial checkpoint to reproduce results in paper SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training.



	## Related links

	Website: https://tianzhechu.com/SFTvsRL/

	Github: https://github.com/LeslieTrue/SFTvsRL

	Arxiv: https://arxiv.org/abs/2501.17161v1

	HF: https://huggingface.co/papers/2501.17161