Transformers documentation

BART

Transformers

Join the Hugging Face community

and get access to the augmented documentation experience

Collaborate on models, datasets and Spaces

Faster examples with accelerated inference

Switch between documentation themes

to get started

BART

免責事項: 何か奇妙なものを見つけた場合は、Github 問題を提出し、割り当ててください。 @patrickvonplaten

Overview

Bart モデルは、BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation、翻訳と理解 Mike Lewis、Yinhan Liu、Naman Goyal、Marjan 著ガズビニネジャド、アブデルラフマン・モハメド、オメル・レヴィ、ベス・ストヤノフ、ルーク・ゼトルモイヤー、2019年10月29日。

要約によると、

Bart は、双方向エンコーダ (BERT など) を備えた標準の seq2seq/機械翻訳アーキテクチャを使用します。左から右へのデコーダ (GPT など)。
事前トレーニングタスクには、元の文の順序をランダムにシャッフルし、新しい埋め込みスキームが含まれます。ここで、テキストの範囲は単一のマスクトークンに置き換えられます。
BART は、テキスト生成用に微調整した場合に特に効果的ですが、理解タスクにも適しています。それ RoBERTa のパフォーマンスを GLUE および SQuAD の同等のトレーニングリソースと同等にし、新たな成果を達成します。さまざまな抽象的な対話、質問応答、要約タスクに関する最先端の結果が得られ、成果が得られます。ルージュは最大6枚まで。

チップ：

BART は絶対位置埋め込みを備えたモデルであるため、通常は入力を右側にパディングすることをお勧めします。左。
エンコーダーとデコーダーを備えたシーケンスツーシーケンスモデル。エンコーダには破損したバージョンのトークンが供給され、デコーダには元のトークンが供給されます（ただし、通常のトランスフォーマーデコーダと同様に、将来のワードを隠すためのマスクがあります）。次の変換の構成は、エンコーダーの事前トレーニングタスクに適用されます。
- ランダムなトークンをマスクします (BERT と同様)
- ランダムなトークンを削除します
- k 個のトークンのスパンを 1 つのマスクトークンでマスクします (0 トークンのスパンはマスクトークンの挿入です)
- 文を並べ替えます
- ドキュメントを回転して特定のトークンから開始するようにします

このモデルは sshleifer によって提供されました。著者のコードはここにあります。

Examples

シーケンス間タスク用の BART およびその他のモデルを微調整するための例とスクリプトは、次の場所にあります。 examples/pytorch/summarization/。
Hugging Face datasets を使用して BartForConditionalGeneration をトレーニングする方法の例オブジェクトは、このフォーラムディスカッションで見つけることができます。
抽出されたチェックポイントは、この論文で説明されています。

Implementation Notes

Bart はシーケンスの分類に token_type_ids を使用しません。 BartTokenizer を使用するか、 encode() を使用して適切に分割します。
BartModel のフォワードパスは、渡されなかった場合、decoder_input_ids を作成します。これは、他のモデリング API とは異なります。この機能の一般的な使用例は、マスクの塗りつぶしです。
モデルの予測は、次の場合に元の実装と同一になるように意図されています。 forced_bos_token_id=0。ただし、これは、渡す文字列が次の場合にのみ機能します。 fairseq.encode はスペースで始まります。
generate() は、次のような条件付き生成タスクに使用する必要があります。要約については、その docstring の例を参照してください。
facebook/bart-large-cnn 重みをロードするモデルには mask_token_id がないか、実行できません。マスクを埋めるタスク。

Mask Filling

facebook/bart-base および facebook/bart-large チェックポイントを使用して、マルチトークンマスクを埋めることができます。

from transformers import BartForConditionalGeneration, BartTokenizer

model = BartForConditionalGeneration.from_pretrained("facebook/bart-large", forced_bos_token_id=0)
tok = BartTokenizer.from_pretrained("facebook/bart-large")
example_english_phrase = "UN Chief Says There Is No <mask> in Syria"
batch = tok(example_english_phrase, return_tensors="pt")
generated_ids = model.generate(batch["input_ids"])
assert tok.batch_decode(generated_ids, skip_special_tokens=True) == [
    "UN Chief Says There Is No Plan to Stop Chemical Weapons in Syria"
]

Resources

BART を始めるのに役立つ公式 Hugging Face およびコミュニティ (🌎 で示されている) リソースのリスト。ここに含めるリソースの送信に興味がある場合は、お気軽にプルリクエストを開いてください。審査させていただきます。リソースは、既存のリソースを複製するのではなく、何か新しいものを示すことが理想的です。

Summarization

に関するブログ投稿分散トレーニング: 🤗 Transformers と Amazon SageMaker を使用した要約のための BART/T5 のトレーニング。
方法に関するノートブック blurr を使用して fastai で要約するために BART を微調整する. 🌎 🌎
方法に関するノートブックトレーナークラスを使用して 2 つの言語で要約するために BART を微調整する。 🌎
BartForConditionalGeneration は、このサンプルスクリプトおよびノートブック。
TFBartForConditionalGeneration は、このサンプルスクリプトおよびノートブック。
FlaxBartForConditionalGeneration は、このサンプルスクリプトでサポートされています。
要約 🤗 ハグフェイスコースの章。
要約タスクガイド

Fill-Mask

BartForConditionalGeneration は、このサンプルスクリプトでサポートされており、ノートブック。
TFBartForConditionalGeneration は、このサンプルスクリプトおよびノートブック。
FlaxBartForConditionalGeneration は、このサンプルスクリプトおよびノートブック。
マスクされた言語モデリング 🤗 顔ハグコースの章。
マスクされた言語モデリングタスクガイド

Translation

ヒンディー語から英語への翻訳に Seq2SeqTrainer を使用して mBART を微調整する方法に関するノート。 🌎
BartForConditionalGeneration は、このサンプルスクリプトおよびノートブック。
TFBartForConditionalGeneration は、このサンプルスクリプトおよびノートブック。
翻訳タスクガイド

以下も参照してください。

Transformers

BART

Overview

Examples

Implementation Notes

Mask Filling

Resources

BartConfig

class transformers.BartConfig

BartTokenizer

class transformers.BartTokenizer

build_inputs_with_special_tokens

convert_tokens_to_string

create_token_type_ids_from_sequences

get_special_tokens_mask

BartTokenizerFast

class transformers.BartTokenizerFast

create_token_type_ids_from_sequences

BartModel

class transformers.BartModel

forward

BartForConditionalGeneration

class transformers.BartForConditionalGeneration

forward

BartForSequenceClassification

class transformers.BartForSequenceClassification

forward

BartForQuestionAnswering

class transformers.BartForQuestionAnswering

forward

BartForCausalLM

class transformers.BartForCausalLM

forward

TFBartModel

class transformers.TFBartModel

call

TFBartForConditionalGeneration

class transformers.TFBartForConditionalGeneration

call

TFBartForSequenceClassification

class transformers.TFBartForSequenceClassification

call

FlaxBartModel

class transformers.FlaxBartModel

__call__

encode

decode

FlaxBartForConditionalGeneration

class transformers.FlaxBartForConditionalGeneration

__call__

encode

decode

FlaxBartForSequenceClassification

class transformers.FlaxBartForSequenceClassification

__call__

encode

decode

FlaxBartForQuestionAnswering

class transformers.FlaxBartForQuestionAnswering

__call__

encode

decode

FlaxBartForCausalLM

class transformers.FlaxBartForCausalLM

__call__

call

call

call

call

call