Sharvil Khade's picture

16 22

Sharvil Khade

Sharvil9

·

Sharvil9

AI & ML interests

coding, QnA, testing, design, model, AI, technology, psychology, health, etc

Organizations

None yet

upvoted an article 3 months ago

Article

How to Choose the Best Open Source LLM for Your Project in 2025

Sep 9

•

74

upvoted a paper 3 months ago

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10 • 56

upvoted a collection 3 months ago

VibeVoice

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated 3 days ago • 144

upvoted a collection 5 months ago

ERNIE 4.5

collection of ERNIE 4.5 models. • 27 items • Updated 26 days ago • 180

upvoted a paper 6 months ago

All is Not Lost: LLM Recovery without Checkpoints

Paper • 2506.15461 • Published Jun 18 • 37

upvoted an article 6 months ago

Article

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

Jun 6

•

55

upvoted a paper 6 months ago

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

Paper • 2506.04734 • Published Jun 5 • 20

upvoted an article 6 months ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

+4

Jun 3

•

96

upvoted a collection 6 months ago

NextCoder

NextCoder family of code-editing LMs developed with Selective Knowledge Transfer and its training data. • 6 items • Updated Jul 9 • 71

upvoted a paper 6 months ago

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

Paper • 2505.23754 • Published May 29 • 15

upvoted a paper 9 months ago

Automated Movie Generation via Multi-Agent CoT Planning

Paper • 2503.07314 • Published Mar 10 • 44

upvoted a collection 10 months ago

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Oct 7 • 66

upvoted 2 collections about 1 year ago

OpenMath

A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated 3 days ago • 45

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 3 days ago • 162

upvoted a paper about 1 year ago

LVD-2M: A Long-take Video Dataset with Temporally Dense Captions

Paper • 2410.10816 • Published Oct 14, 2024 • 21

upvoted an article over 1 year ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

+6

Jul 23, 2024

•

239