Spaces:
Running
Running
metadata
title: OSINT Tool
emoji: 🏢
colorFrom: gray
colorTo: indigo
sdk: streamlit
sdk_version: 1.42.2
app_file: app.py
pinned: false
license: mit
OSINT Tool & GitHub Repository Analysis
Overview
This project is designed to perform Open Source Intelligence (OSINT) analysis on GitHub repositories and fetch titles from URLs. It also provides functionalities to upload datasets in CSV format for fine-tuning machine learning models. Currently, it supports fine-tuning models like distilbert-base-uncased
for sequence classification tasks.
Features
- GitHub Repository Analysis: Analyze GitHub repositories by entering the repository owner and name.
- URL Title Fetcher: Fetch titles from given URLs.
- Dataset Upload & Model Fine-Tuning: Upload CSV files for fine-tuning models and perform sequence classification tasks.
Prerequisites
Before running the project, make sure you have the following dependencies installed:
- Python 3.6 or higher
- PyTorch (for model fine-tuning)
- Hugging Face Transformers
- Other dependencies listed in
requirements.txt
Badges
Installation
1. Clone the repository
git clone https://github.com/<username>/<repository>.git
cd <repository>
2. Create and activate a virtual environment
For Linux/MacOS:
python3 -m venv venv
source venv/bin/activate
For Windows:
python -m venv venv
.\venv\Scripts\activate
3. Install dependencies
pip install -r requirements.txt
4. Install PyTorch (if not already installed)
pip install torch
Usage
Running the Application
To run the application, simply execute the following command:
python app.py
Features
GitHub Repository Analysis
- Enter the GitHub repository owner and name.
- The application will fetch details and analyze the repository.
URL Title Fetcher
- Enter a URL, and the application will fetch the title of the page.
Dataset Upload & Model Fine-Tuning
- Upload a CSV file (limit 200MB).
- Select the model for fine-tuning (e.g.,
distilbert-base-uncased
). - Fine-tune the model for sequence classification tasks.
Example CSV Format for Fine-Tuning
text,label
"This is an example sentence.",1
"This is another example.",0
Running the Model Fine-Tuning
python fine_tune.py --model distilbert-base-uncased --data dataset.csv
Contributing
Contributions are welcome! Please open an issue or submit a pull request with your suggestions or fixes.
Steps to Contribute
- Fork the repository.
- Clone your fork:
git clone https://github.com/<your-username>/<repository>.git
- Create a branch:
git checkout -b feature/your-feature
- Commit your changes:
git commit -m "Add feature"
- Push to the branch:
git push origin feature/your-feature
- Open a pull request.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Acknowledgments
- Hugging Face for providing the pre-trained models.
- PyTorch for deep learning frameworks.
- Shields.io for providing awesome badges.