·
AI & ML interests
None yet
Organizations
MeharBhatia/llama3_3b_sft_hhrlhf
4B
•
Updated
MeharBhatia/llama3_3b_sft_ultrafeedback
4B
•
Updated
MeharBhatia/llama3_8b_ppo_sft_wildchat_chosen_oppose
266k
•
Updated
MeharBhatia/llama3_8b_ppo_sft_wildchat_chosen_support
266k
•
Updated
•
1
MeharBhatia/llama3_8b_ppo_sft_alpaca_chosen_oppose
266k
•
Updated
•
1
MeharBhatia/llama3_8b_ppo_sft_alpaca_chosen_support
266k
•
Updated
•
5
MeharBhatia/llama3_8b_simpo_sft_alpaca_chosen_oppose
266k
•
Updated
MeharBhatia/llama3_8b_simpo_sft_alpaca_chosen_support
266k
•
Updated
MeharBhatia/llama3_8b_simpo_sft_wildchat_chosen_support
266k
•
Updated
MeharBhatia/llama3_8b_simpo_sft_wildchat_chosen_oppose
266k
•
Updated
MeharBhatia/llama3_8b_dpo_sft_wildchat_chosen_oppose
266k
•
Updated
MeharBhatia/llama3_8b_dpo_sft_wildchat_chosen_support
266k
•
Updated
MeharBhatia/llama3_8b_dpo_sft_alpaca_chosen_oppose
266k
•
Updated
•
3
MeharBhatia/llama3_8b_dpo_sft_alpaca_chosen_support
266k
•
Updated
MeharBhatia/qwen3_8b_ppo_sft_wildchat_chosen_support
308k
•
Updated
MeharBhatia/qwen3_8b_ppo_sft_alpaca_chosen_oppose
308k
•
Updated
MeharBhatia/qwen3_8b_ppo_sft_alpaca_chosen_support
308k
•
Updated
MeharBhatia/qwen3_8b_simpo_sft_wildchat_chosen_support
308k
•
Updated
MeharBhatia/qwen3_8b_simpo_sft_alpaca_chosen_oppose
308k
•
Updated
MeharBhatia/qwen3_8b_simpo_sft_alpaca_chosen_support
308k
•
Updated
MeharBhatia/qwen3_8b_dpo_sft_wildchat_chosen_oppose
308k
•
Updated
MeharBhatia/qwen3_8b_dpo_sft_wildchat_chosen_support
308k
•
Updated
MeharBhatia/qwen3_8b_rm_sft_wildchat_chosen_support
8B
•
Updated
MeharBhatia/qwen3_8b_rm_sft_wildchat_chosen_oppose
8B
•
Updated
MeharBhatia/qwen3_8b_sft_wildchat
8B
•
Updated
•
1
MeharBhatia/llama3_8b_rm_sft_wildchat_chosen_oppose
8B
•
Updated
•
2
MeharBhatia/llama3_8b_rm_sft_alpaca_chosen_oppose
8B
•
Updated
•
1
MeharBhatia/qwen3_8b_rm_sft_alpaca_chosen_oppose
8B
•
Updated
MeharBhatia/llama3_8b_rm_sft_wildchat_chosen_support
8B
•
Updated
MeharBhatia/llama3_8b_rm_sft_alpaca_chosen_support
8B
•
Updated