2 11 14

Pretam Ray

Pretam

raypretam

AI & ML interests

NLP

Recent Activity

published a model about 6 hours ago

sanganaka/qwen3-4B-sa-hi-LORA

published a model about 6 hours ago

sanganaka/qwen3-4B-sa-hi-think

updated a model about 6 hours ago

sanganaka/qwen3-4B-sa-hi-think

View all activity

Organizations

published 2 models about 6 hours ago

sanganaka/qwen3-4B-sa-hi-LORA

Text Generation • 4B • Updated about 6 hours ago

sanganaka/qwen3-4B-sa-hi-think

Text Generation • 4B • Updated about 6 hours ago

updated 2 models about 6 hours ago

sanganaka/qwen3-4B-sa-hi-think

Text Generation • 4B • Updated about 6 hours ago

sanganaka/qwen3-4B-sa-hi-LORA

Text Generation • 4B • Updated about 6 hours ago

liked a dataset about 2 months ago

arc-agi-community/arc-agi-2

Viewer • Updated Apr 2, 2025 • 1.12k • 147 • 11

liked 2 Spaces 2 months ago

The Ultra-Scale Playbook

🌌

3.64k

The ultimate guide to training LLM on large GPU Clusters

The Smol Training Playbook

📚

2.83k

The secrets to building world-class LLMs

upvoted a paper 3 months ago

ShinkaEvolve: Towards Open-Ended And Sample-Efficient Program Evolution

Paper • 2509.19349 • Published Sep 17, 2025 • 2

upvoted a collection 4 months ago

Qwen3

Collection

84 items • Updated 12 days ago • 1.56k

upvoted 2 papers 5 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 197

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published Jul 31, 2025 • 114

upvoted a collection 6 months ago

DepNeCT

Collection

This Hugging Face collection hosts models and datasets from DepNeCT — a dependency-based method for nested compound type identification in Sanskrit • 4 items • Updated Jul 29, 2025 • 2

liked a model 6 months ago

nvidia/OpenReasoning-Nemotron-32B

Text Generation • 33B • Updated Sep 16, 2025 • 254 • • 122

updated a model 6 months ago

Pretam/hindi_sanskrit

0.6B • Updated Jul 3, 2025

published a model 6 months ago

Pretam/hindi_sanskrit

0.6B • Updated Jul 3, 2025

authored a paper 8 months ago

REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback

Paper • 2505.06548 • Published May 10, 2025 • 30

upvoted a paper 8 months ago

REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback

Paper • 2505.06548 • Published May 10, 2025 • 30

liked a model 9 months ago

google/gemma-3-4b-it-qat-int4-unquantized

Image-Text-to-Text • 4B • Updated Apr 15, 2025 • 372 • 9

updated a model 10 months ago

Pretam/lora_model_gemma-3-12b-it_anushtup_final

Updated Mar 27, 2025

published a model 10 months ago

Pretam/lora_model_gemma-3-12b-it_anushtup_final

Updated Mar 27, 2025

Pretam Ray

AI & ML interests

Recent Activity

Organizations

Pretam's activity

The Ultra-Scale Playbook

The Smol Training Playbook