Rehaannn commited on
Commit
46c2384
·
1 Parent(s): 7ee0952

Update architecture to EpicBrelloV1ForCausalLM for unique Epic Systems branding

Browse files
Files changed (3) hide show
  1. README.md +2 -2
  2. config.json +2 -2
  3. model_card.md +3 -3
README.md CHANGED
@@ -43,7 +43,7 @@ tags:
43
  - **Base Model**: Tencent Hunyuan
44
  - **Parameters**: 1.8B (optimized for efficiency)
45
  - **Context Window**: 256K tokens
46
- - **Architecture**: HunYuanDenseV1ForCausalLM
47
  - **Specialization**: Reasoning, Mathematics, Programming, Creative Thinking
48
 
49
  ## Usage
@@ -131,7 +131,7 @@ tokenized_chat = tokenizer.apply_chat_template(
131
  |---------------|-------|
132
  | Model Size | 1.8B Parameters |
133
  | Context Window | 256K Tokens |
134
- | Architecture | HunYuanDenseV1ForCausalLM |
135
  | Base Model | Tencent Hunyuan |
136
  | Creator | Epic Systems |
137
  | Engineer | Rehan Temkar |
 
43
  - **Base Model**: Tencent Hunyuan
44
  - **Parameters**: 1.8B (optimized for efficiency)
45
  - **Context Window**: 256K tokens
46
+ - **Architecture**: EpicBrelloV1ForCausalLM
47
  - **Specialization**: Reasoning, Mathematics, Programming, Creative Thinking
48
 
49
  ## Usage
 
131
  |---------------|-------|
132
  | Model Size | 1.8B Parameters |
133
  | Context Window | 256K Tokens |
134
+ | Architecture | EpicBrelloV1ForCausalLM |
135
  | Base Model | Tencent Hunyuan |
136
  | Creator | Epic Systems |
137
  | Engineer | Rehan Temkar |
config.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e4fe1b93a79776b39af79141cebabd852822850b58463a22eed420247eb7aa49
3
- size 2199
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a3c27766775cbdf0d11e75448039e562ce448eb8dd13d4c2b7e4affe1d2519ff
3
+ size 2195
model_card.md CHANGED
@@ -33,14 +33,14 @@ tags:
33
  - **Base Model**: Tencent Hunyuan
34
  - **Parameters**: 1.8B (optimized for efficiency)
35
  - **Context Window**: 256K tokens
36
- - **Architecture**: HunYuanDenseV1ForCausalLM
37
  - **Specialization**: Reasoning, Mathematics, Programming, Creative Thinking
38
 
39
  ## Model Summary
40
 
41
  | Specification | Value |
42
  |---------------|-------|
43
- | **Architecture** | HunYuanDenseV1ForCausalLM |
44
  | **Total Parameters** | 1.8B |
45
  | **Context Window** | 256K tokens |
46
  | **Hidden Size** | 2048 |
@@ -205,7 +205,7 @@ tokenized_chat = tokenizer.apply_chat_template(
205
  ## Technical Specifications
206
 
207
  ### Architecture Details
208
- - **Model Type**: HunYuanDenseV1ForCausalLM
209
  - **Attention Mechanism**: Grouped Query Attention (GQA)
210
  - **Position Embedding**: Dynamic RoPE with scaling
211
  - **Normalization**: RMSNorm with epsilon 1e-05
 
33
  - **Base Model**: Tencent Hunyuan
34
  - **Parameters**: 1.8B (optimized for efficiency)
35
  - **Context Window**: 256K tokens
36
+ - **Architecture**: EpicBrelloV1ForCausalLM
37
  - **Specialization**: Reasoning, Mathematics, Programming, Creative Thinking
38
 
39
  ## Model Summary
40
 
41
  | Specification | Value |
42
  |---------------|-------|
43
+ | **Architecture** | EpicBrelloV1ForCausalLM |
44
  | **Total Parameters** | 1.8B |
45
  | **Context Window** | 256K tokens |
46
  | **Hidden Size** | 2048 |
 
205
  ## Technical Specifications
206
 
207
  ### Architecture Details
208
+ - **Model Type**: EpicBrelloV1ForCausalLM
209
  - **Attention Mechanism**: Grouped Query Attention (GQA)
210
  - **Position Embedding**: Dynamic RoPE with scaling
211
  - **Normalization**: RMSNorm with epsilon 1e-05