Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Girinath11
/
MixtureofRecursionwithRouter

Text Generation
Transformers
recursive-transformer
technical-content
code-generation
math
conversation
bpe-tokenizer
adaptive-routing
Model card Files Files and versions
xet
Community
MixtureofRecursionwithRouter
Ctrl+K
Ctrl+K
  • 1 contributor
History: 23 commits
Girinath11's picture
Girinath11
Update README.md
5a4d89b verified 11 days ago
  • checkpoints
    Rename best_model.pt to checkpoints/best_model.pt 11 days ago
  • split_data
    Rename slm_training_complete_chat_val (1).txt to split_data/slm_training_complete_chat_val.txt 11 days ago
  • tokenizer
    Rename merges.txt to tokenizer/merges.txt 11 days ago
  • .gitattributes
    2.22 kB
    Rename slm_training_complete_chat_val (1).txt to split_data/slm_training_complete_chat_val.txt 11 days ago
  • README.md
    9.05 kB
    Update README.md 11 days ago
  • custom_tokenizer.py
    21.2 kB
    Create custom_tokenizer.py 11 days ago
  • embeddings.py
    13.8 kB
    Create embeddings.py 11 days ago
  • model_slm.py
    15.7 kB
    Create model_slm.py 11 days ago
  • requirements.txt
    75 Bytes
    Create requirements.txt 11 days ago
  • slm_training_complete_chat.txt
    143 MB
    xet
    Upload slm_training_complete_chat.txt 11 days ago
  • train.py
    18.1 kB
    Create train.py 11 days ago
  • ultra_fast_results .json
    2.09 kB
    Rename ultra_fast_results (1).json to ultra_fast_results .json 11 days ago