Mwangi PRO

Benson

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer

liked a model 2 days ago

nvidia/personaplex-7b-v1

liked a dataset 2 days ago

rootsautomation/pubmed-ocr

View all activity

Organizations

None yet

upvoted a paper 1 day ago

OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer

Paper • 2601.14250 • Published 3 days ago • 36

liked a model 2 days ago

nvidia/personaplex-7b-v1

Audio-to-Audio • Updated about 9 hours ago • 17k • 587

liked a dataset 2 days ago

rootsautomation/pubmed-ocr

Viewer • Updated about 14 hours ago • 1.55M • 298 • 16

upvoted a paper 2 days ago

PubMed-OCR: PMC Open Access OCR Annotations

Paper • 2601.11425 • Published 7 days ago • 9

liked 2 datasets 7 days ago

medkit/simsamu

Viewer • Updated Nov 12, 2025 • 61 • 881 • 7

facebook/action100m-preview

Viewer • Updated 9 days ago • 120k • 2.91k • 100

upvoted a collection 16 days ago

Personalized Reasoning

Collection

9 items • Updated Oct 15, 2025 • 5

liked a model 17 days ago

nvidia/nemotron-speech-streaming-en-0.6b

Automatic Speech Recognition • Updated 17 days ago • 8.12k • 423

upvoted a paper 17 days ago

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Paper • 2512.24271 • Published 24 days ago • 62

liked a model 18 days ago

MedAIBase/AntAngelMed

103B • Updated 4 days ago • 526 • 77

liked a model 24 days ago

GAIR/LiveTalk-1.3B-V0.1

Image-to-Video • Updated 21 days ago • 190 • 14

liked a dataset 27 days ago

kahrendt/microwakeword

Updated Nov 22, 2024 • 557 • 4

liked a dataset about 1 month ago

SparkAudio/voxbox

Viewer • Updated Apr 15, 2025 • 23.8M • 11.3k • 70

upvoted 2 articles about 1 month ago

Article

How to make NeuTTS-air generate over 200 seconds of audio in a single second.

Nov 21, 2025

•

Article

LLM based Audio models

Dec 18, 2025

•

liked 2 models about 1 month ago

YatharthS/MiraTTS

Text-to-Speech • 0.5B • Updated 30 days ago • 6.35k • 181

google/medasr

Automatic Speech Recognition • Updated Dec 22, 2025 • 9.91k • 263

liked a dataset about 1 month ago

kolerk/Video_Reality_Test

Viewer • Updated 17 days ago • 149 • 261 • 7

upvoted a paper about 1 month ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published Dec 15, 2025 • 64

liked a model about 1 month ago

meituan-longcat/LongCat-Video-Avatar

Updated Dec 17, 2025 • 745 • 209