|
--- |
|
license: apache-2.0 |
|
datasets: |
|
- wikisql |
|
language: |
|
- en |
|
pipeline_tag: text2text-generation |
|
tags: |
|
- nl2sql |
|
widget: |
|
- text: "question: get people name with age less 25 table: id, name, age" |
|
example_title: "less than" |
|
- text: "question: get people name with age equal 25 table: id, name, age" |
|
example_title: "equal" |
|
--- |
|
|
|
new version: [LarkAI/codet5p-770m_nl2sql_oig](https://huggingface.co./LarkAI/codet5p-770m_nl2sql_oig) |
|
|
|
use oig-sql dataset and support more complex sql parse |
|
|
|
# How to Use |
|
|
|
```python |
|
import torch |
|
from transformers import AutoTokenizer, BartForConditionalGeneration |
|
|
|
device = torch.device('cuda:0') |
|
|
|
tokenizer = AutoTokenizer.from_pretrained("LarkAI/bart_large_nl2sql") |
|
model = BartForConditionalGeneration.from_pretrained("LarkAI/bart_large_nl2sql").to(device) |
|
|
|
text = "question: get people name with age less 25 table: id, name, age" |
|
inputs = tokenizer([text], max_length=1024, return_tensors="pt") |
|
output_ids = model.generate(inputs["input_ids"].to(device), num_beams=self.beams, max_length=128, min_length=8) |
|
response_text = tokenizer.batch_decode(output_ids, skip_special_tokens=True, clean_up_tokenization_spaces=False)[0] |
|
# SELECT name FROM table WHERE age < 25 |
|
``` |
|
|
|
reference: [juierror/flan-t5-text2sql-with-schema](https://huggingface.co./juierror/flan-t5-text2sql-with-schema) - fix this [discussion](https://huggingface.co./juierror/flan-t5-text2sql-with-schema/discussions/5) |
|
|
|
# How to Train |
|
|
|
Quick start: https://github.com/huggingface/transformers/blob/main/examples/pytorch/summarization/README.md |