bartowski commited on
Commit
916afc7
·
verified ·
0 Parent(s):

Super-squash branch 'main' using huggingface_hub

Browse files
.gitattributes ADDED
@@ -0,0 +1,61 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
+ *.model filter=lfs diff=lfs merge=lfs -text
13
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.npy filter=lfs diff=lfs merge=lfs -text
15
+ *.npz filter=lfs diff=lfs merge=lfs -text
16
+ *.onnx filter=lfs diff=lfs merge=lfs -text
17
+ *.ot filter=lfs diff=lfs merge=lfs -text
18
+ *.parquet filter=lfs diff=lfs merge=lfs -text
19
+ *.pb filter=lfs diff=lfs merge=lfs -text
20
+ *.pickle filter=lfs diff=lfs merge=lfs -text
21
+ *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.pt filter=lfs diff=lfs merge=lfs -text
23
+ *.pth filter=lfs diff=lfs merge=lfs -text
24
+ *.rar filter=lfs diff=lfs merge=lfs -text
25
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
26
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
27
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar filter=lfs diff=lfs merge=lfs -text
29
+ *.tflite filter=lfs diff=lfs merge=lfs -text
30
+ *.tgz filter=lfs diff=lfs merge=lfs -text
31
+ *.wasm filter=lfs diff=lfs merge=lfs -text
32
+ *.xz filter=lfs diff=lfs merge=lfs -text
33
+ *.zip filter=lfs diff=lfs merge=lfs -text
34
+ *.zst filter=lfs diff=lfs merge=lfs -text
35
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ Trinity-Mini-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
37
+ Trinity-Mini-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
38
+ Trinity-Mini-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
39
+ Trinity-Mini-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
40
+ Trinity-Mini-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
41
+ Trinity-Mini-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
42
+ Trinity-Mini-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
43
+ Trinity-Mini-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
44
+ Trinity-Mini-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
45
+ Trinity-Mini-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
46
+ Trinity-Mini-Q2_K_L.gguf filter=lfs diff=lfs merge=lfs -text
47
+ Trinity-Mini-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
48
+ Trinity-Mini-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
49
+ Trinity-Mini-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
50
+ Trinity-Mini-Q3_K_XL.gguf filter=lfs diff=lfs merge=lfs -text
51
+ Trinity-Mini-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
52
+ Trinity-Mini-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
53
+ Trinity-Mini-Q4_K_L.gguf filter=lfs diff=lfs merge=lfs -text
54
+ Trinity-Mini-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
55
+ Trinity-Mini-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
56
+ Trinity-Mini-Q5_K_L.gguf filter=lfs diff=lfs merge=lfs -text
57
+ Trinity-Mini-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
58
+ Trinity-Mini-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
59
+ Trinity-Mini-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
60
+ Trinity-Mini-Q6_K_L.gguf filter=lfs diff=lfs merge=lfs -text
61
+ Trinity-Mini-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,132 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ - es
6
+ - fr
7
+ - de
8
+ - it
9
+ - pt
10
+ - ru
11
+ - ar
12
+ - hi
13
+ - ko
14
+ - zh
15
+ library_name: transformers
16
+ base_model:
17
+ - arcee-ai/Trinity-Mini
18
+ base_model_relation: quantized
19
+ ---
20
+ <div align="center">
21
+ <picture>
22
+ <img
23
+ src="https://cdn-uploads.huggingface.co/production/uploads/6435718aaaef013d1aec3b8b/i-v1KyAMOW_mgVGeic9WJ.png"
24
+ alt="Arcee Trinity Mini"
25
+ style="max-width: 100%; height: auto;"
26
+ >
27
+ </picture>
28
+ </div>
29
+
30
+ # Trinity Mini GGUF
31
+
32
+ Trinity Mini is an Arcee AI 26B MoE model with 3B active parameters. It is the medium-sized model in our new Trinity family, a series of open-weight models for enterprise and tinkerers alike.
33
+
34
+ This model is tuned for reasoning, but in testing, it uses a similar total token count to competitive instruction-tuned models.
35
+
36
+ These are the GGUF files for running on llama.cpp powered platforms
37
+
38
+ ***
39
+
40
+ Trinity Mini is trained on 10T tokens gathered and curated through a key partnership with [Datology](https://www.datologyai.com/), building upon the excellent dataset we used on [AFM-4.5B](https://huggingface.co/arcee-ai/AFM-4.5B) with additional math and code.
41
+
42
+ Training was performed on a cluster of 512 H200 GPUs powered by [Prime Intellect](https://www.primeintellect.ai/) using HSDP parallelism.
43
+
44
+ More details, including key architecture decisions, can be found on our blog [here](https://www.arcee.ai/blog)
45
+
46
+ Try it out now at [chat.arcee.ai](http://chat.arcee.ai/)
47
+
48
+ ***
49
+
50
+ ## Model Details
51
+
52
+ * **Model Architecture:** AfmoeForCausalLM
53
+ * **Parameters:** 26B, 3B active
54
+ * **Experts:** 128 total, 8 active, 1 shared
55
+ * **Context length:** 128k
56
+ * **Training Tokens:** 10T
57
+ * **License:** [Apache 2.0](https://huggingface.co/arcee-ai/Trinity-Mini#license)
58
+ * **Recommended settings:**
59
+ * temperature: 0.15
60
+ * top_k: 50
61
+ * top_p: 0.75
62
+ * min_p: 0.06
63
+
64
+ ***
65
+
66
+ ## Benchmarks
67
+
68
+ ![](https://cdn-uploads.huggingface.co/production/uploads/6435718aaaef013d1aec3b8b/UMV0OZh_H1JfvgzBTXh6u.png)
69
+
70
+ <div align="center">
71
+ <picture>
72
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/6435718aaaef013d1aec3b8b/sSVjGNHfrJKmQ6w8I18ek.png" style="background-color:ghostwhite;padding:5px;" width="17%" alt="Powered by Datology">
73
+ </picture>
74
+ </div>
75
+
76
+ ### Running our model
77
+
78
+ - [llama.cpp](https://huggingface.co/arcee-ai/Trinity-Mini#llamacpp)
79
+ - [LM Studio](https://huggingface.co/arcee-ai/Trinity-Mini#lm-studio)
80
+ - [API](https://huggingface.co/arcee-ai/Trinity-Mini#api)
81
+
82
+ ## llama.cpp
83
+
84
+ Supported in llama.cpp release b7061
85
+
86
+ Download the latest [llama.cpp release](https://github.com/ggml-org/llama.cpp/releases)
87
+
88
+ ```
89
+ llama-server -hf arcee-ai/Trinity-Mini-GGUF:q4_k_m \
90
+ --temp 0.15 \
91
+ --top-k 50 \
92
+ --top-p 0.75
93
+ --min-p 0.06
94
+ ```
95
+
96
+ ## LM Studio
97
+
98
+ Supported in latest LM Studio runtime
99
+
100
+ Update to latest available, then verify your runtime by:
101
+
102
+ 1. Click "Power User" at the bottom left
103
+ 2. Click the green "Developer" icon at the top left
104
+ 3. Select "LM Runtimes" at the top
105
+ 4. Refresh the list of runtimes and verify that the latest is installed
106
+
107
+ Then, go to Model Search and search for `arcee-ai/Trinity-Mini-GGUF`, download your prefered size, and load it up in the chat
108
+
109
+ ## API
110
+
111
+ Trinity Mini is available today on openrouter:
112
+
113
+ https://openrouter.ai/arcee-ai/trinity-mini
114
+
115
+ ```
116
+ curl -X POST "https://openrouter.ai/v1/chat/completions" \
117
+ -H "Authorization: Bearer $OPENROUTER_API_KEY" \
118
+ -H "Content-Type: application/json" \
119
+ -d '{
120
+ "model": "arcee-ai/trinity-mini",
121
+ "messages": [
122
+ {
123
+ "role": "user",
124
+ "content": "What are some fun things to do in New York?"
125
+ }
126
+ ]
127
+ }'
128
+ ```
129
+
130
+ ## License
131
+
132
+ Trinity-Mini is released under the Apache-2.0 license.
Trinity-Mini-IQ2_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bb2a02c5a8a76586e5e5fd51e85be7edff7cd85fe4ab07c0c34e687bafe77689
3
+ size 8482186080
Trinity-Mini-IQ2_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b8fb6ac308a72fc018565f0a8f3872ac01f0407d3819b6dc85beac99fc0d5fa0
3
+ size 7556293472
Trinity-Mini-IQ2_XS.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:26b3d92dac7697b23f82c2a81f0e3e554dd1cc3cfeee90f40f4a9be33aa1beda
3
+ size 7481099104
Trinity-Mini-IQ2_XXS.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e565fc7b9730eb0c505a0316f4b7cb426580d3ce406c43a7c85fff3e87527a83
3
+ size 6555206496
Trinity-Mini-IQ3_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3c6632e7be492b70d823633bd278f820359a83e73ce9d8ab73a518f53b14ee63
3
+ size 12100780896
Trinity-Mini-IQ3_XS.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e3317e3922528e75c50382f1016807296e2ffed9c7c2fd0916a8bc4a06ef9241
3
+ size 10981753696
Trinity-Mini-IQ3_XXS.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4ff881a7f798537d7b5f348c280c49f23a1fc8f6d0b9fb425ecc083e03d35686
3
+ size 10542080864
Trinity-Mini-IQ4_NL.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:648944086f877e7799d780e7f695e0d3a020dde9c745752b74541d4def49cebe
3
+ size 14946935648
Trinity-Mini-IQ4_XS.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:55384aa295b494793d18c6caf0ee67a0f50abca09b62aba28f36dda63724c527
3
+ size 14160012128
Trinity-Mini-Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0419d3ecf5b221a7f814071e32421cb7c1ec820f1e95fbd41eb848f155cb8db7
3
+ size 9425052512
Trinity-Mini-Q2_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:639f9de7fc4c999379a3d6518aae102afc7448f45b55fa818f2156daed55aca0
3
+ size 9825436512
Trinity-Mini-Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fdad900de7982b1c1ffdc90dce55959a5037e3b67d05d057341d48ad5c0f000f
3
+ size 12508676960
Trinity-Mini-Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cc6a658c42ab13d688953a48f9df5218a12d5a4ec58793b875dd1aa1d7dbfcb3
3
+ size 12104188768
Trinity-Mini-Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8eb027c4b04aadedd309e38ece5270c0f2df91dca4219842eaf896ff6b88f28f
3
+ size 11593859936
Trinity-Mini-Q3_K_XL.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9488847da15b834708d1f08be409db1147cd9b02c8bfeb7cad7d43bcef32fd9b
3
+ size 12867421024
Trinity-Mini-Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6cf2deb33136427f5591d93ea3adad8e0b5a3a1d95f3dfcda08c7ccea27da0d0
3
+ size 15145640800
Trinity-Mini-Q4_1.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8f4c74c3a720762357ca97d24cbb5ec16ef1d1562c10d664c488f8edea0f0f09
3
+ size 16501908320
Trinity-Mini-Q4_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8d09eeef5143de5589635a3199e7ae68de8a9fe08973cb5784bd2a837d014e44
3
+ size 16240100192
Trinity-Mini-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7c2f0f2991923c21baee495340b79feb2a124b1c93b57dea0652f8aa27ba08e8
3
+ size 15935808352
Trinity-Mini-Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ecca85005b5a92ad47fe94da8854a5c573cae328b7756609ebeb684e35f7e693
3
+ size 15416173408
Trinity-Mini-Q5_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fbea1da34ebe172acd26a459e9a6d20927131dd4cdb3c47caf5f931036f80b4f
3
+ size 18890113888
Trinity-Mini-Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:12152dc55123f03512ec4d74d4d6f69fea204a6a6b31f6864b7bc2803621b1b3
3
+ size 18637071200
Trinity-Mini-Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:162c19126ba80c3e197f137eb7a564818f133eea59b4c31ab40ef0903b0c8939
3
+ size 18107999072
Trinity-Mini-Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:45a436765975d1acc4c5ccc1cde89ba7c790e5057270ba2b9b85b0b1313aa0a4
3
+ size 21516911456
Trinity-Mini-Q6_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f5b23e2a9b03151e4bad4c2ae375429a7b78018e5eea7ef81317c62f838bb287
3
+ size 21715501920
Trinity-Mini-Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7e39913b1f31c4f284605127f0b035abafc3b456efd26a988e53c2a3e12c6601
3
+ size 27788002144