Zaiyan Xu's picture

2 12

Zaiyan Xu PRO

diligentotter

·

https://www.zaiyanxu.com

zaiyan-x

AI & ML interests

None yet

Recent Activity

updated a dataset about 1 month ago

diligentotter/dapo-math-17k-dedup

published a dataset about 1 month ago

diligentotter/dapo-math-17k-dedup

updated a dataset about 1 month ago

diligentotter/dapo-math-17k-dedup-all-train

View all activity

Organizations

None yet

upvoted 2 articles almost 2 years ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

Dec 9, 2022

•

391

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

+3

Jan 18, 2024

•

76