metadata

license: mit
title: Omniparser-api
sdk: docker
emoji: 😻
colorFrom: red
colorTo: yellow

Omniparser API

The Omniparser API is a versatile and efficient tool designed to parse, process, and analyze various types of documents or datasets using machine learning models.

Features

Upload and process documents (e.g., images, PDFs).
Detect objects, text, or patterns within uploaded files.
Analyze and parse structured or unstructured content.
Highly configurable thresholds for precision and flexibility.

How to Use

Upload a Document: Send a file (e.g., an image or PDF) via the /process/ endpoint.
Adjust Thresholds: Configure box_threshold and iou_threshold for desired accuracy.
Receive Results: Get a JSON response with parsed content and processed outputs.

Endpoints

GET /: Welcome page for the API.
POST /process/: Upload and process a document with configurable thresholds.

Installation

Clone the repository:

git clone https://github.com/your-username/omniparser-api.git
cd omniparser-api