nikhil07prakash
/

float-7b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

float-7b / README.md

nikhil07prakash's picture

nikhil07prakash

Update README.md

de609e8 verified 11 months ago

|

1.91 kB

	---
	license: mit
	---

	# Model Card for Model ID

	<!-- Provide a quick summary of what the model is/does. -->

	This model is a fully fine-tuned version of the [Llama-7B](https://huggingface.co./huggyllama/llama-7b) model on synthetically generated arithmetic tasks. It was introduced in [this](https://openreview.net/forum?id=8sKcAWOf2D) paper. It is very similar to [Goat-7B](https://github.com/liutiedong/goat), except it was trained without LoRA.

	For inquiries about checkpoints during the fine-tuning process, kindly reach out to [Nikhil](mailto:[email protected]) via email.

	## Model Details

	### Model Description

	<!-- Provide a longer summary of what this model is. -->

	- Developed by: [Nikhil Prakash](https://nix07.github.io/)
	- Model type: Autoregressive Decoder-only Language Model
	- License: MIT License
	- Finetuned from model: [Llama-7B](https://huggingface.co./huggyllama/llama-7b)

	### Model Sources

	<!-- Provide the basic links for the model. -->

	- Repository: [Link](https://github.com/Nix07/finetuning/)
	- Paper : [Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking](https://arxiv.org/abs/2402.14811)

	## How to Get Started with the Model

	Use the code below to get started with the model.

	```python
	from transformers import AutoModel
	model = AutoModel.from_pretrained("nikhil07prakash/float-7b")
	```

	## Citation

	<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->

	BibTeX:

	```python
	@inproceedings{prakash2023fine,
	title={Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking},
	author={Prakash, Nikhil and Shaham, Tamar Rott and Haklay, Tal and Belinkov, Yonatan and Bau, David},
	booktitle={Proceedings of the 2024 International Conference on Learning Representations},
	note={arXiv:2402.14811},
	year={2024}
	}
	```