Senichev Sergei's picture

Senichev Sergei

seniichev

·

ssenichev

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago

O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents

upvoted a paper 14 days ago

Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story

upvoted a paper 19 days ago

One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models

View all activity

Organizations

upvoted 2 papers 14 days ago

O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents

Paper • 2511.13593 • Published 21 days ago • 24

Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story

Paper • 2511.15210 • Published 19 days ago • 86

upvoted a paper 19 days ago

One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models

Paper • 2511.10629 • Published 25 days ago • 122

upvoted 2 papers 6 months ago

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published May 27 • 140

Exploring the Latent Capacity of LLMs for One-Step Text Generation

Paper • 2505.21189 • Published May 27 • 61

upvoted 2 papers about 1 year ago

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 95

PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation

Paper • 2409.06820 • Published Sep 10, 2024 • 68

upvoted 3 collections over 1 year ago

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 4 days ago • 61

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5 • 239

H2O Danube3

7 items • Updated Nov 30, 2024 • 57

upvoted a paper over 1 year ago

Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian

Paper • 2405.13929 • Published May 22, 2024 • 54

upvoted a collection over 1 year ago

AQLM

AQLM quantized LLMs • 21 items • Updated Feb 28 • 46

upvoted a collection almost 2 years ago

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated Jul 21 • 211

upvoted a paper almost 2 years ago

Linear Transformers with Learnable Kernel Functions are Better In-Context Models

Paper • 2402.10644 • Published Feb 16, 2024 • 81