Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Efficient Intelligence and Systems
community
Efficient-ML
Activity Feed
Follow
33
AI & ML interests
Low-bit Quantization of Large Language Models (LLMs)
Recent Activity
AaronHuangWei
authored
a paper
13 days ago
MC#: Mixture Compressor for Mixture-of-Experts Large Models
AaronHuangWei
authored
a paper
13 days ago
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
AaronHuangWei
submitted
a paper
14 days ago
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
View all activity
Team members
9
Efficient-ML
's models
52
Sort: Recently updated
Efficient-ML/Qwen3-4B-base-gptq-w8-perchannel
Updated
May 5, 2025
Efficient-ML/Qwen3-4B-base-gptq-w8-128
Updated
May 5, 2025
Efficient-ML/Qwen3-4B-base-gptq-w4-perchannel
Updated
May 5, 2025
Efficient-ML/Qwen3-4B-base-gptq-w4-128
Updated
May 5, 2025
Efficient-ML/Qwen3-1.7B-base-gptq-w8-perchannel
Updated
May 5, 2025
Efficient-ML/Qwen3-1.7B-base-gptq-w8-128
Updated
May 5, 2025
Efficient-ML/Qwen3-1.7B-base-gptq-w4-perchannel
Updated
May 5, 2025
Efficient-ML/Qwen3-1.7B-base-gptq-w4-128
Updated
May 5, 2025
Efficient-ML/Qwen3-0.6B-base-gptq-w8-128
Updated
May 5, 2025
Efficient-ML/Qwen3-0.6B-base-gptq-w8-perchannel
Updated
May 5, 2025
Efficient-ML/Qwen3-0.6B-base-gptq-w4-128
Updated
May 5, 2025
Efficient-ML/Qwen3-0.6B-base-gptq-w4-perchannel
Updated
May 5, 2025
Efficient-ML/LLaMA-3-70B-GPTQ-4bit-b128
Updated
Jun 4, 2024
•
2
Efficient-ML/LLaMA-3-8B-AWQ-4bit-b128
Text Generation
•
Updated
Apr 28, 2024
•
11
Efficient-ML/LLaMA-3-8B-DB-LLM-2bit-fake
Text Generation
•
Updated
Apr 26, 2024
•
9
•
2
Efficient-ML/LLaMA-3-8B-QuIP-2bit
Text Generation
•
Updated
Apr 26, 2024
•
11
•
3
Efficient-ML/LLaMA-3-8B-IR-QLoRA
Updated
Apr 25, 2024
•
4
Efficient-ML/LLaMA-3-8B-SmoothQuant-4bit-4bit
Text Generation
•
8B
•
Updated
Apr 22, 2024
•
4
Efficient-ML/LLaMA-3-8B-SmoothQuant-8bit-8bit
Text Generation
•
8B
•
Updated
Apr 22, 2024
•
6
Efficient-ML/LLaMA-3-8B-PB-LLM-1.7bit-fake
Text Generation
•
8B
•
Updated
Apr 22, 2024
•
6
•
1
Efficient-ML/LLaMA-3-8B-BiLLM-1.1bit-fake
Updated
Apr 21, 2024
Efficient-ML/LLaMA-3-8B-GPTQ-4bit-b128
Updated
Apr 21, 2024
•
3
Previous
1
2
Next