rajtest/Llama_v4_update_senti

Files changed (6) hide show

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [unsloth/llama-2-7b-bnb-4bit](https://huggingface.co/unsloth/llama-2-7b-bnb-4bit) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5419
 ## Model description
@@ -47,16 +47,14 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.03
-- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 0.5118        | 0.9981 | 262  | 0.5076          |
-| 0.4495        | 2.0    | 525  | 0.5086          |
-| 0.3469        | 2.9943 | 786  | 0.5419          |
 ### Framework versions

 This model is a fine-tuned version of [unsloth/llama-2-7b-bnb-4bit](https://huggingface.co/unsloth/llama-2-7b-bnb-4bit) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4471
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.03
+- num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 0.4447        | 0.9981 | 262  | 0.4471          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -21,12 +21,12 @@
   "revision": null,
   "target_modules": [
     "down_proj",
     "q_proj",
-    "up_proj",
     "gate_proj",
     "o_proj",
-    "v_proj",
-    "k_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "revision": null,
   "target_modules": [
     "down_proj",
+    "k_proj",
     "q_proj",
     "gate_proj",
+    "up_proj",
     "o_proj",
+    "v_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:90dc9f8485566a8b998758a8ff5899ca4c6862754155826089f8d23bf7fdac09
 size 159967880

 version https://git-lfs.github.com/spec/v1
+oid sha256:603c3f78985c7c11313954e433e72b13c97d6192d0167ef7e066ec81e19d3d3e
 size 159967880

runs/Aug20_12-20-06_a92550c9e7b0/events.out.tfevents.1724156411.a92550c9e7b0.696.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:dfc0108f373eb44e16cd53ca05a44d92d98522b083612b37ada27a8496e44625
+size 5431

runs/Aug20_12-20-23_a92550c9e7b0/events.out.tfevents.1724156429.a92550c9e7b0.696.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:c95a17ff96a5109b81fd99faae85e397aee54f45c3d8693c926d1ba35d3fbcd4
+size 11817

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a65a998c63d86528e0413a88be1ecd9b126b582fe8ee1b42a4c17e72970c75cb
 size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:9dbef1d548b2b8c3354d1dbb9557a6402c521aac8def9e8ac29ed79140274fec
 size 5176