hezarai
/

crnn-base-fa-64x256

Model card Files Files and versions Community

crnn-base-fa-64x256 / README.md

arxyzan's picture

Update README.md

5e866f6 about 1 year ago

|

775 Bytes

	---
	language:
	- fa
	pipeline_tag: image-to-text
	---

	A CRNN model for Persian OCR. This model is based on a simple CNN + LSTM architecture inspired by [this paper](https://arxiv.org/abs/1507.05717).
	More info about data and training will be provided soon.

	Note that this model is only optimized for scanned documents and supports input characters of up to 32 (For an end-to-end OCR pipeline, use a text detector
	model first to extract text boxes preferrably in word-level and then use this model), but it can be used to be fine-tuned on other domains like license plate
	or handwritten texts.

	#### Usage
	```
	pip install hezar
	```

	```python
	from hezar import Model

	crnn = Model.load("hezarai/crnn-base-fa-64x256")
	texts = crnn.predict(["sample_image.jpg"])
	print(texts)
	```