base_model: meta-llama/Llama-3.2-1B-Instruct
library_name: peft
Model Card for Model ID
Fine Tuned meta-llama/Llama-3.2-1B-Instruct using a small sample (3000 examples) of https://huggingface.co./datasets/mlabonne/orpo-dpo-mix-40k:
hf (pretrained=meta-llama/Llama-3.2-1B-Instruct,dtype=float), gen_kwargs: (None), limit: None, num_fewshot: None, batch_size: auto:4 (64,64,64,64,64)
Tasks | Version | Filter | n-shot | Metric | Value | Stderr | ||
---|---|---|---|---|---|---|---|---|
eq_bench | 2.1 | none | 0 | eqbench | ↑ | 22.8199 | ± | 3.3087 |
none | 0 | percent_parseable | ↑ | 97.6608 | ± | 1.1592 | ||
Model Description | ||||||||
Developed by: Meta AI (assuming the model is related to Meta’s Llama series) |
Funded by [optional]: Meta Platforms, Inc. (or similar organization if applicable)
Shared by [optional]: Hugging Face (if the model is hosted there)
Model type: Language Model (LM), specifically an Instruction-tuned transformer model
Language(s) (NLP): Primarily English (confirm the language capabilities as needed)
License: Apache 2.0 License (check if the original model has a specific license)
Finetuned from model [optional]: Llama-3.2-1B-Instruct (ensure the base model details are accurate)
Model Sources [optional]:
Repository: [Hugging Face model repository link if available] Dataset: https://huggingface.co./datasets/mlabonne/orpo-dpo-mix-40k Paper [optional]: [Link to any relevant papers or documentation about the model or its base architecture]
Demo [optional]: [Link to any demos if available, e.g., Hugging Face Spaces]
Uses Direct Use:
Intended for natural language understanding and generation tasks, including but not limited to question answering, text summarization, and conversational AI applications. Downstream Use [optional]:
Can be applied in various applications such as chatbots, virtual assistants, educational tools, content generation, and other NLP tasks. Out-of-Scope Use:
May not be suitable for applications requiring high-stakes decision-making, sensitive data processing, or contexts where ethical considerations are paramount. Bias, Risks, and Limitations Bias:
As with many language models, the model may inherit biases present in the training data, potentially reflecting cultural, social, or gender biases. Risks:
There is a risk of generating inappropriate or harmful content if not carefully monitored. Users should implement safety checks when deploying this model in applications. Limitations:
The model’s performance may vary based on the input complexity and domain. It may also struggle with understanding nuanced or context-heavy queries. Performance metrics from evaluation tasks should be considered when assessing its utility. Performance Metrics (from your data) Task: eq_bench
Version: 2.1 n-shot: 0 Metric: eqbench Value: 22.8199 Stderr: ± 3.3087 Task: none
n-shot: 0 Metric: percent_parseable Value: 97.6608 Stderr: ± 1.1592 Additional Notes Ensure that you validate all the placeholders with accurate and specific details about the model, its developers, and its applications as needed.