Pulsar_7B / README.md
rmdhirr's picture
Update README.md
b927ff2 verified
|
raw
history blame
814 Bytes
metadata
language:
  - en
license: apache-2.0
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - mistral
  - trl
  - dpo
  - uncensored
base_model: MTSAIR/multi_verse_model
library_name: transformers
datasets:
  - grimulkan/theory-of-mind
  - grimulkan/physical-reasoning
  - ResplendentAI/Luna_Alpaca
  - unalignment/toxic-dpo-v0.2
  - kira/math-dpo
  - athirdpath/DPO_Pairs-Roleplay-Alpaca-NSFW-v1-SHUFFLED

Uploaded model

  • Developed by: rmdhirr
  • License: apache-2.0
  • Finetuned from model : MTSAIR/multi_verse_model

This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.