metadata
license: mit
title: Omniparser-api
sdk: docker
emoji: 😻
colorFrom: red
colorTo: yellow
Omniparser API
The Omniparser API is a versatile and efficient tool designed to parse, process, and analyze various types of documents or datasets using machine learning models.
Features
- Upload and process documents (e.g., images, PDFs).
- Detect objects, text, or patterns within uploaded files.
- Analyze and parse structured or unstructured content.
- Highly configurable thresholds for precision and flexibility.
How to Use
- Upload a Document: Send a file (e.g., an image or PDF) via the
/process/
endpoint. - Adjust Thresholds: Configure
box_threshold
andiou_threshold
for desired accuracy. - Receive Results: Get a JSON response with parsed content and processed outputs.
Endpoints
GET /
: Welcome page for the API.POST /process/
: Upload and process a document with configurable thresholds.
Installation
- Clone the repository:
git clone https://github.com/your-username/omniparser-api.git cd omniparser-api