Svenni551
/

gemma-2b-it-toxic-v2.0

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Svenni551 commited on Mar 29, 2024

Commit

54de336

·

verified ·

1 Parent(s): 1f8f4cc

Update README.md

Files changed (1) hide show

README.md +9 -8

README.md CHANGED Viewed

@@ -251,15 +251,16 @@ outputs = model.generate(input_ids=inputs.to(model.device), max_new_tokens=150)
 #### Training Hyperparameters
 The following hyperparameters were used during training:
-- **learning_rate:** `3e-4`
-- **train_batch_size:** Effectively adjusted by `per_device_train_batch_size=1` and `gradient_accumulation_steps=4`
-- **eval_batch_size:** Implicitly determined by the evaluation setup (not explicitly defined)
-- **seed:** Not explicitly stated, crucial for ensuring reproducibility
-- **optimizer:** `paged_adamw_8bit`, designed for efficient memory utilization
-- **lr_scheduler_type:** Learning rate adjustments indicate adaptive scheduling, though specific type is not mentioned
-- **training_steps:** `500`
-- **mixed_precision_training:** Not explicitly mentioned; any applied strategy would aim at computational efficiency
 #### Training Results

 #### Training Hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 3e-4
+- per_device_train_batch_size: 1
+- gradient_accumulation_steps: 4
+- eval_batch_size: Implicitly determined by the evaluation setup
+- seed: Not explicitly stated
+- optimizer: paged_adamw_8bit
+- lr_scheduler_type: Not specified, adaptive adjustments indicated
+- training_steps: 500
+- mixed_precision_training: Not explicitly mentioned
 #### Training Results