Update architecture to EpicBrelloV1ForCausalLM for unique Epic Systems branding
Browse files- README.md +2 -2
- config.json +2 -2
- model_card.md +3 -3
README.md
CHANGED
|
@@ -43,7 +43,7 @@ tags:
|
|
| 43 |
- **Base Model**: Tencent Hunyuan
|
| 44 |
- **Parameters**: 1.8B (optimized for efficiency)
|
| 45 |
- **Context Window**: 256K tokens
|
| 46 |
-
- **Architecture**:
|
| 47 |
- **Specialization**: Reasoning, Mathematics, Programming, Creative Thinking
|
| 48 |
|
| 49 |
## Usage
|
|
@@ -131,7 +131,7 @@ tokenized_chat = tokenizer.apply_chat_template(
|
|
| 131 |
|---------------|-------|
|
| 132 |
| Model Size | 1.8B Parameters |
|
| 133 |
| Context Window | 256K Tokens |
|
| 134 |
-
| Architecture |
|
| 135 |
| Base Model | Tencent Hunyuan |
|
| 136 |
| Creator | Epic Systems |
|
| 137 |
| Engineer | Rehan Temkar |
|
|
|
|
| 43 |
- **Base Model**: Tencent Hunyuan
|
| 44 |
- **Parameters**: 1.8B (optimized for efficiency)
|
| 45 |
- **Context Window**: 256K tokens
|
| 46 |
+
- **Architecture**: EpicBrelloV1ForCausalLM
|
| 47 |
- **Specialization**: Reasoning, Mathematics, Programming, Creative Thinking
|
| 48 |
|
| 49 |
## Usage
|
|
|
|
| 131 |
|---------------|-------|
|
| 132 |
| Model Size | 1.8B Parameters |
|
| 133 |
| Context Window | 256K Tokens |
|
| 134 |
+
| Architecture | EpicBrelloV1ForCausalLM |
|
| 135 |
| Base Model | Tencent Hunyuan |
|
| 136 |
| Creator | Epic Systems |
|
| 137 |
| Engineer | Rehan Temkar |
|
config.json
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a3c27766775cbdf0d11e75448039e562ce448eb8dd13d4c2b7e4affe1d2519ff
|
| 3 |
+
size 2195
|
model_card.md
CHANGED
|
@@ -33,14 +33,14 @@ tags:
|
|
| 33 |
- **Base Model**: Tencent Hunyuan
|
| 34 |
- **Parameters**: 1.8B (optimized for efficiency)
|
| 35 |
- **Context Window**: 256K tokens
|
| 36 |
-
- **Architecture**:
|
| 37 |
- **Specialization**: Reasoning, Mathematics, Programming, Creative Thinking
|
| 38 |
|
| 39 |
## Model Summary
|
| 40 |
|
| 41 |
| Specification | Value |
|
| 42 |
|---------------|-------|
|
| 43 |
-
| **Architecture** |
|
| 44 |
| **Total Parameters** | 1.8B |
|
| 45 |
| **Context Window** | 256K tokens |
|
| 46 |
| **Hidden Size** | 2048 |
|
|
@@ -205,7 +205,7 @@ tokenized_chat = tokenizer.apply_chat_template(
|
|
| 205 |
## Technical Specifications
|
| 206 |
|
| 207 |
### Architecture Details
|
| 208 |
-
- **Model Type**:
|
| 209 |
- **Attention Mechanism**: Grouped Query Attention (GQA)
|
| 210 |
- **Position Embedding**: Dynamic RoPE with scaling
|
| 211 |
- **Normalization**: RMSNorm with epsilon 1e-05
|
|
|
|
| 33 |
- **Base Model**: Tencent Hunyuan
|
| 34 |
- **Parameters**: 1.8B (optimized for efficiency)
|
| 35 |
- **Context Window**: 256K tokens
|
| 36 |
+
- **Architecture**: EpicBrelloV1ForCausalLM
|
| 37 |
- **Specialization**: Reasoning, Mathematics, Programming, Creative Thinking
|
| 38 |
|
| 39 |
## Model Summary
|
| 40 |
|
| 41 |
| Specification | Value |
|
| 42 |
|---------------|-------|
|
| 43 |
+
| **Architecture** | EpicBrelloV1ForCausalLM |
|
| 44 |
| **Total Parameters** | 1.8B |
|
| 45 |
| **Context Window** | 256K tokens |
|
| 46 |
| **Hidden Size** | 2048 |
|
|
|
|
| 205 |
## Technical Specifications
|
| 206 |
|
| 207 |
### Architecture Details
|
| 208 |
+
- **Model Type**: EpicBrelloV1ForCausalLM
|
| 209 |
- **Attention Mechanism**: Grouped Query Attention (GQA)
|
| 210 |
- **Position Embedding**: Dynamic RoPE with scaling
|
| 211 |
- **Normalization**: RMSNorm with epsilon 1e-05
|