GTimothee
/

sql-code-llama-4bits

Model card Files Files and versions

sql-code-llama-4bits / README.md

GTimothee's picture

Update README.md

68206ce verified about 2 years ago

|

history blame contribute delete

2.38 kB

	---
	library_name: peft
	base_model: codellama/CodeLlama-7b-hf
	---

	# Model Card for Model ID

	4 bit general purpose text-to-SQL model.

	Takes 5677MiB of GPU memory.


	## Model Details

	### Model Description

	Provide the CREATE statement of the target table(s) in the context of your prompt and ask a question to your database. The model outputs a query to answer the question.

	Data used for fine tuning: https://huggingface.co/datasets/b-mc2/sql-create-context

	## Uses

	This model can be coupled with a chat model like llama2-chat to convert the output into a text answer.

	### Direct Use

	```python
	from peft import AutoPeftModelForCausalLM
	from transformers import AutoTokenizer, AutoModelForCausalLM
	import torch


	# load model
	base_model = "GTimothee/sql-code-llama-4bits"
	model = AutoModelForCausalLM.from_pretrained(
	base_model,
	load_in_4bit=True,
	torch_dtype=torch.float16,
	device_map="auto",
	)
	model.eval()

	# load tokenizer
	tokenizer = AutoTokenizer.from_pretrained("codellama/CodeLlama-7b-hf")

	eval_prompt = """You are a powerful text-to-SQL model. Your job is to answer questions about a database. You are given a question and context regarding one or more tables.

	You must output the SQL query that answers the question.
	### Input:
	Which Class has a Frequency MHz larger than 91.5, and a City of license of hyannis, nebraska?

	### Context:
	CREATE TABLE table_name_12 (class VARCHAR, frequency_mhz VARCHAR, city_of_license VARCHAR)

	### Response:
	"""

	model_input = tokenizer(eval_prompt, return_tensors="pt").to("cuda")
	with torch.no_grad():
	print(tokenizer.decode(model.generate(**model_input, max_new_tokens=100)[0], skip_special_tokens=True))
	```

	Outputs:

	```python
	### Response:
	SELECT class FROM table_name_12 WHERE frequency_mhz > 91.5 AND city_of_license = "hyannis, nebraska"
	```


	## Bias, Risks, and Limitations

	- potential security issues if there is a malicious use. If you execute blindly the SQL queries that are being generated by end users you could lose data, leak information etc.
	- may be mistaken depending on the way the prompt has been written.

	### Recommendations

	- Make sure that you check the generated SQL before applying it if the model is used by end users directly.
	- The model works well when used on simple tables and simple queries. If possible, try to break a complex query into multiple simple queries.