Hiring 💼

32 11 41

Alex Chen PRO

alexchen4ai

https://alexchen4ai.github.io/blog/

AI & ML interests

NLP

Recent Activity

upvoted a paper 2 days ago

AutoNeural: Co-Designing Vision-Language Models for NPU Inference

updated a model 4 days ago

alexchen4ai/Ministral-3-3B-Instruct-2512

published a model 4 days ago

alexchen4ai/Ministral-3-3B-Instruct-2512

View all activity

Organizations

upvoted a paper 2 days ago

AutoNeural: Co-Designing Vision-Language Models for NPU Inference

Paper • 2512.02924 • Published 4 days ago • 5

upvoted a paper 12 months ago

No More Adam: Learning Rate Scaling at Initialization is All You Need

Paper • 2412.11768 • Published Dec 16, 2024 • 43

upvoted a collection about 1 year ago

Moshi v0.1 Release

Collection

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 15 items • Updated Apr 18 • 241

upvoted an article about 1 year ago

Article

Introduction to ggml

Aug 13, 2024

•

255

upvoted a collection about 1 year ago

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated 7 days ago • 308

upvoted a paper over 1 year ago

Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models

Paper • 2408.15518 • Published Aug 28, 2024 • 42

upvoted an article over 1 year ago

Article

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Apr 5, 2023

•

upvoted 4 papers over 1 year ago

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3, 2024 • 95