14 32 190

Ken Tsui

kenhktsui

https://kenhktsui.github.io/

AI & ML interests

ML engineer, researcher VLM, LLM benchmark Opinions are my own

Recent Activity

liked a dataset 17 days ago

Hothan/OlympiadBench

liked a dataset 20 days ago

mixture-vitae-backup/MixtureVitae-2TT

upvoted a paper about 2 months ago

Diffusion Transformers with Representation Autoencoders

View all activity

Organizations

upvoted 2 papers about 2 months ago

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13 • 165

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 491

upvoted 3 papers 2 months ago

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11 • 239

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30 • 535

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

Paper • 2509.25531 • Published Sep 29 • 7

upvoted 3 papers 5 months ago

Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs

Paper • 2507.02778 • Published Jul 3 • 9

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26 • 75

Large Language Models are Locally Linear Mappings

Paper • 2505.24293 • Published May 30 • 14

upvoted an article 6 months ago

Article

LeRobot goes to driving school: World’s largest open-source self-driving dataset

Mar 11

•

103

upvoted 2 articles 7 months ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

733

Article

Vision Language Models (Better, faster, stronger)

May 12

•

568

upvoted a collection 8 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29 • 667

upvoted a paper 8 months ago

Scaling Vision Pre-Training to 4K Resolution

Paper • 2503.19903 • Published Mar 25 • 41

upvoted an article 9 months ago

Article

Breaking resolution curse of vision-language models

Feb 24, 2024

•

upvoted an article 10 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

•

1.31k

upvoted a paper 10 months ago

ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning

Paper • 2502.01100 • Published Feb 3 • 19

upvoted a paper 11 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 286

upvoted 2 papers 12 months ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 108

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 84

upvoted an article over 1 year ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024

•

123

Ken Tsui

AI & ML interests

Recent Activity

Organizations

kenhktsui's activity

LeRobot goes to driving school: World’s largest open-source self-driving dataset

Uncensor any LLM with abliteration

Vision Language Models (Better, faster, stronger)

Breaking resolution curse of vision-language models

Open-source DeepResearch – Freeing our search agents

How NuminaMath Won the 1st AIMO Progress Prize