metadata

base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
library_name: peft
datasets:
  - sanaa-11/math-dataset
language:
  - fr

Model Card for LLaMA 3.1 Fine-Tuned Model

Model Details

Model Description

Developed by: Sanaa Abril
Model Type: Fine-tuned Causal Language Model
Language(s) (NLP): French
License:
Finetuned from model: Meta LLaMA 3.1 8B Instruct

Model Sources [optional]

Repository: https://huggingface.co./sanaa-11/mathematic-exercice-generator/tree/main

Uses

Direct Use

Primary Application: This model is primarily used for generating math exercises tailored to Moroccan students in French, based on specific lessons and difficulty levels.
Example Use Case: Educators can input lesson topics to generate corresponding exercises for classroom use or online learning platforms.

Downstream Use [optional]

Potential Applications: The model can be extended or adapted to create exercises in other languages or for different educational levels.

Out-of-Scope Use

Not Suitable For: The model is not designed for high-stakes assessments, as it may generate exercises that require further validation by subject matter experts.

Bias, Risks, and Limitations

Bias: The model may inherit biases from the data it was trained on, potentially generating exercises that reflect unintended cultural or linguistic biases.
Risks: There is a risk of generating mathematically incorrect exercises or exercises that do not align with the intended curriculum.
Limitations: The model's accuracy and relevance may decrease when generating exercises outside of its training domain or when applied to advanced mathematical topics not covered during fine-tuning.

Recommendations

For Educators: It is recommended to review the generated exercises for correctness and relevance before using them in a classroom setting.
For Developers: Fine-tune the model further or adjust the training data to mitigate any biases and improve the quality of the generated content.

How to Get Started with the Model

Use the following code snippet to load and generate exercises using the model:

from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig
from peft import PeftModel, PeftConfig
import torch

# Base model name
model_name = "meta-llama/Meta-Llama-3.1-8B-Instruct"

# Load the base model without specifying rope_scaling
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    device_map="auto",  # Adjust based on your environment
    offload_folder="./offload_dir",  # Specify a folder for offloading if necessary
    torch_dtype=torch.float16,  # Use float16 for better performance on compatible hardware
    revision="main"  # Specify the correct revision if needed
)

# Load the adapter configuration
config = PeftConfig.from_pretrained("sanaa-11/mathematic-exercice-generator")

# Load the adapter weights into the model
model = PeftModel.from_pretrained(model, "sanaa-11/mathematic-exercice-generator")

# Load the tokenizer
tokenizer = AutoTokenizer.from_pretrained(model_name, use_fast=True)

generated_text = ""
prompt = "Fournis un exercice basé sur la vie reelle de difficulté moyenne de niveau 2 annee college sur les fractions."
for _ in range(5):  
    inputs = tokenizer(prompt + generated_text, return_tensors="pt").to(model.device)
    outputs = model.generate(
        **inputs,
        max_length=1065,  
        temperature=0.7,
        top_p=0.9,
        num_beams=5,
        repetition_penalty=1.2,
        no_repeat_ngram_size=2,
        pad_token_id=tokenizer.eos_token_id,
        early_stopping=False
)
    new_text = tokenizer.decode(outputs[0], skip_special_tokens=True)
    generated_text += new_text
    print(new_text)

Training Details

Training Data

Dataset: The model was fine-tuned on a custom dataset consisting of 3.6K rows of math exercises, lesson content, and solutions, specifically designed for Moroccan students in French laungage.

Training Procedure

Preprocessing [optional]

Data Cleaning: Text normalization, tokenization, and padding were applied to prepare the data.
Tokenization: The French tokenizer provided by Hugging Face was used to process the text data.

Training Hyperparameters

Training Regime: The model was fine-tuned using 4-bit quantization with QLoRA to optimize GPU and RAM usage. The training was performed on a Kaggle environment with limited resources.
Batch Size: 1 (with gradient accumulation steps of 8)
Number of Epochs: 8
Learning Rate: 5e-5

Evaluation

Testing Data, Factors & Metrics

Testing Data

A separate subset of 10% of the dataset was reserved for evaluation.

Factors

Complexity of Generated Exercises: Exercises were evaluated based on their complexity relative to the intended difficulty level.

Metrics

Training Loss: The loss measured during training.
Validation Loss: The loss measured on the validation dataset during training.

Results

Training and Validation Loss: The model was evaluated based on training and validation loss over 8 epochs. The results indicate that the model's performance improved significantly after the first few epochs, with a steady decrease in both training and validation loss. The final validation loss achieved was 0.154888, indicating a good fit to the validation data without significant overfitting.

Summary

Model Examination

The model demonstrated a consistent reduction in both training and validation loss across the training epochs, suggesting effective learning and generalization from the provided dataset.

Environmental Impact

Carbon Emissions

Hardware Type: Tesla T4 GPU
Hours Used: 12 hours
Cloud Provider: Kaggle
Carbon Emitted: [Can be estimated using the Machine Learning Impact calculator by Lacoste et al. (2019)]

Technical Specifications [optional]

Model Architecture and Objective

The model is based on the LLaMA 3.1 architecture, fine-tuned to generate text in French for educational purposes, specifically math exercises.

Compute Infrastructure

The model was trained on Kaggle’s free-tier environment, leveraging a single Tesla T4 GPU.

Hardware

GPU: Tesla T4 with 16GB RAM

Software

Transformers Version: 4.44.0
PEFT Version: 0.12.0

Citation [optional]

BibTeX:

@misc{your_name_2024_model,
  author = {Sanaa Abril},
  title = {Fine-Tuned LLaMA 3.1 for Generating Math Exercises},
  year = {2024},
  publisher = {Hugging Face},
  note = {\url{https://huggingface.co./sanaa-11/mathematic-exercice-generator}}
}
**APA**:
Abril, S. (2024). Fine-Tuned LLaMA 3.1 for Generating Math Exercises. Hugging Face. https://huggingface.co./sanaa-11/mathematic-exercice-generator

### More Information [optional]
- For further details or questions, feel free to reach out to the model card authors.

### Model Card Authors [optional]
- **Sanaa Abril** - [email protected]

### Framework versions
- **Transformers**: 4.44.0
- **PEFT**: 0.12.0