Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

MhaWay
/
Veronica

Text Generation
Transformers
PyTorch
English
veronica
polymorphic-mlp
mixture-of-branches
entropy-regularized-routing
decoder-only
causal-lm
rope
expandable-architecture
research
Model card Files Files and versions
xet
Community
Veronica
1.11 GB
  • 1 contributor
History: 23 commits
MhaWay's picture
MhaWay
Update README.md
4579484 verified 28 days ago
  • veronica
    HF alignment about 1 month ago
  • .gitattributes
    1.52 kB
    initial commit 3 months ago
  • README.md
    12.4 kB
    Update README.md 28 days ago
  • added_tokens.json
    332 Bytes
    Veronica-Polymorphic 551M β€” Pretrained v1 about 1 month ago
  • config.json
    877 Bytes
    Veronica-Polymorphic 551M β€” Pretrained v1 about 1 month ago
  • generation_config.json
    159 Bytes
    Veronica-Polymorphic 551M β€” Pretrained v1 about 1 month ago
  • merges.txt
    456 kB
    Veronica-Polymorphic 551M β€” Pretrained v1 about 1 month ago
  • pytorch_model.bin
    1.1 GB
    xet
    Veronica-Polymorphic 551M β€” Pretrained v1 about 1 month ago
  • special_tokens_map.json
    694 Bytes
    HF alignment about 1 month ago
  • tokenizer.json
    3.56 MB
    HF alignment about 1 month ago
  • tokenizer_config.json
    3.22 kB
    HF alignment about 1 month ago
  • train_veronica.py
    23.4 kB
    HF alignment about 1 month ago
  • trainer_state.json
    66.5 kB
    Veronica-Polymorphic 551M β€” Pretrained v1 about 1 month ago
  • training_args.bin
    5.78 kB
    xet
    Veronica-Polymorphic 551M β€” Pretrained v1 about 1 month ago
  • veronica-pretrain-24L.json
    546 Bytes
    HF alignment about 1 month ago
  • vocab.json
    798 kB
    Veronica-Polymorphic 551M β€” Pretrained v1 about 1 month ago