study-hjt
/

CodeQwen1.5-7B-Chat-GPTQ-Int4

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions

study-hjt commited on Apr 26, 2024

Commit

cdfcd16

·

verified ·

1 Parent(s): f04d0bb

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -56,15 +56,15 @@ KeyError: 'qwen2'.
 Here provides a code snippet with `apply_chat_template` to show you how to load the tokenizer and model and how to generate contents.
 ```python
-from modelscope import AutoModelForCausalLM, AutoTokenizer
 device = "cuda" # the device to load the model onto
 model = AutoModelForCausalLM.from_pretrained(
-    "huangjintao/CodeQwen1.5-7B-Chat-GPTQ-Int4",
     torch_dtype="auto",
     device_map="auto"
 )
-tokenizer = AutoTokenizer.from_pretrained("huangjintao/CodeQwen1.5-7B-Chat-GPTQ-Int4")
 prompt = "Write a quicksort algorithm in python."
 messages = [

 Here provides a code snippet with `apply_chat_template` to show you how to load the tokenizer and model and how to generate contents.
 ```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
 device = "cuda" # the device to load the model onto
 model = AutoModelForCausalLM.from_pretrained(
+    "study-hjt/CodeQwen1.5-7B-Chat-GPTQ-Int4",
     torch_dtype="auto",
     device_map="auto"
 )
+tokenizer = AutoTokenizer.from_pretrained("study-hjt/CodeQwen1.5-7B-Chat-GPTQ-Int4")
 prompt = "Write a quicksort algorithm in python."
 messages = [