Maximilian Krupop's picture

Maximilian Krupop

przvl

·

MaximilianKr

AI & ML interests

None yet

Recent Activity

reacted to ronantakizawa's post with 👍 2 days ago

Introducing the twitter-trending-hashtags dataset, a compilation of 12,000+ unique trending hashtags on Twitter / X from 2020 to 2025. This dataset captures viral and cultural moments on Twitter / X and is perfect for researchers studying viral content patterns on social media. https://huggingface.co/datasets/ronantakizawa/twitter-trending-hashtags #twitter #trends #socialmedia

upvoted an article 2 days ago

We Got Claude to Fine-Tune an Open Source LLM

liked a model 9 days ago

Tongyi-MAI/Z-Image-Turbo

View all activity

Organizations

upvoted an article 2 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

3 days ago

•

227

upvoted an article 11 days ago

Article

mmBERT: ModernBERT goes Multilingual

+4

Sep 9

•

128

upvoted a collection 2 months ago

16GB VRAM Essentials

9 items • Updated Nov 5 • 39

upvoted 2 collections 3 months ago

mmBERT: a modern multilingual encoder

mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance • 16 items • Updated Sep 9 • 48

RST Parser with Llama 2

Collection of the pretrained LoRA weights of Llama2 for RST discourse parsing. • 36 items • Updated Jul 28, 2024 • 3

upvoted a paper 5 months ago

ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs

Paper • 2506.15211 • Published Jun 18 • 37

upvoted an article 8 months ago

Article

Tiny Agents: an MCP-powered agent in 50 lines of code

Apr 25

•

303

upvoted a collection 8 months ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 667

upvoted an article 9 months ago

Article

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

Mar 10

•

146

upvoted 2 collections about 1 year ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 7 days ago • 308

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 15 items • Updated Apr 18 • 241

upvoted a paper about 1 year ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 78

upvoted 2 articles over 1 year ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

+6

Jul 23, 2024

•

239

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

733

upvoted 6 collections over 1 year ago

📀 Dataset comparison models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12, 2024 • 41

Tinyllama-1.1B-v1

7 items • Updated Apr 2, 2024 • 20

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated May 1 • 574

Pythia Scaling Suite

Pythia is the first LLM suite designed specifically to enable scientific research on LLMs. To learn more see https://github.com/EleutherAI/pythia • 18 items • Updated Feb 26 • 31

OLMo Suite

Artifacts for the first set of OLMo models. • 18 items • Updated 7 days ago • 74

VILA: On Pre-training for Visual Language Models

10 items • Updated Sep 13 • 57