--- title: Llama 3.2 WebGPU emoji: 🦙 colorFrom: green colorTo: pink sdk: static pinned: false license: apache-2.0 models: - onnx-community/Llama-3.2-1B-Instruct-q4f16 short_description: A powerful AI chatbot that runs locally in your browser thumbnail: https://huggingface.co./spaces/webml-community/llama-3.2-webgpu/resolve/main/banner.png --- # Llama-3.2 WebGPU A simple React + Vite application for running [Llama-3.2-1B-Instruct](https://huggingface.co./onnx-community/Llama-3.2-1B-Instruct-q4f16), a powerful small language model, locally in the browser using Transformers.js and WebGPU-acceleration. ## Getting Started Follow the steps below to set up and run the application. ### 1. Clone the Repository Clone the examples repository from GitHub: ```sh git clone https://github.com/huggingface/transformers.js-examples.git ``` ### 2. Navigate to the Project Directory Change your working directory to the `llama-3.2-webgpu` folder: ```sh cd transformers.js-examples/llama-3.2-webgpu ``` ### 3. Install Dependencies Install the necessary dependencies using npm: ```sh npm i ``` ### 4. Run the Development Server Start the development server: ```sh npm run dev ``` The application should now be running locally. Open your browser and go to `http://localhost:5173` to see it in action.