Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Girinath11
/
MixtureofRecursionwithRouter
like
1
Text Generation
Transformers
PyTorch
mixture_of_recursions
feature-extraction
recursive-transformer
technical-content
code-generation
math
conversation
bpe-tokenizer
adaptive-routing
custom_code
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
a3974d5
MixtureofRecursionwithRouter
781 MB
1 contributor
History:
21 commits
Girinath11
Rename best_model.pt to checkpoints/best_model.pt
a3974d5
verified
4 months ago
checkpoints
Rename best_model.pt to checkpoints/best_model.pt
4 months ago
split_data
Rename slm_training_complete_chat_val (1).txt to split_data/slm_training_complete_chat_val.txt
4 months ago
tokenizer
Rename merges.txt to tokenizer/merges.txt
4 months ago
.gitattributes
2.22 kB
Rename slm_training_complete_chat_val (1).txt to split_data/slm_training_complete_chat_val.txt
4 months ago
README.md
31 Bytes
initial commit
4 months ago
custom_tokenizer.py
21.2 kB
Create custom_tokenizer.py
4 months ago
embeddings.py
13.8 kB
Create embeddings.py
4 months ago
model_slm.py
15.7 kB
Create model_slm.py
4 months ago
requirements.txt
75 Bytes
Create requirements.txt
4 months ago
slm_training_complete_chat.txt
143 MB
xet
Upload slm_training_complete_chat.txt
4 months ago
train.py
18.1 kB
Create train.py
4 months ago
ultra_fast_results .json
2.09 kB
Rename ultra_fast_results (1).json to ultra_fast_results .json
4 months ago