Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
VaidhyaMegha
/
Shoonya
like
2
Follow
Vaidhyamegha Private Limited
4
Text Generation
PyTorch
ONNX
roneneldan/TinyStories
English
deepseek
cpu-optimized
transformer
language-model
tinystories
grouped-query-attention
rotary-position-embeddings
rmsnorm
swiglu
arxiv:
2305.07759
License:
mit
Model card
Files
Files and versions
xet
Community
main
Shoonya
270 MB
1 contributor
History:
6 commits
MandarapuMadhulatha
Upload Shoonya Model v0.2 with DeepSeek CPU optimizations
8493c0e
verified
12 months ago
.gitattributes
Safe
1.52 kB
initial commit
about 1 year ago
README.md
Safe
4.7 kB
Upload Shoonya Model v0.2 with DeepSeek CPU optimizations
12 months ago
config.json
Safe
485 Bytes
Upload Shoonya Model v0.2 with DeepSeek CPU optimizations
12 months ago
model.onnx
117 MB
xet
Upload Shoonya Model v0.2 with DeepSeek CPU optimizations
12 months ago
pytorch_model.bin
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
65.7 MB
xet
Upload Shoonya Model v0.2 with DeepSeek CPU optimizations
12 months ago
quantization_note.md
Safe
749 Bytes
Upload Shoonya Model v0.2 with DeepSeek CPU optimizations
12 months ago
shoonya_model_v0_1.pt
pickle
Detected Pickle imports (5)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"model.transformer.ModelConfig"
,
"torch.LongStorage"
,
"torch.FloatStorage"
How to fix it?
53.7 MB
xet
feat(model): add Hugging Face Hub publication support
about 1 year ago
shoonya_model_v0_1_quantized.pt
32.8 MB
xet
feat(model): add Hugging Face Hub publication support
about 1 year ago
tokenizer_config.json
Safe
156 Bytes
Upload Shoonya Model v0.2 with DeepSeek CPU optimizations
12 months ago