victor (Victor Mustar)

reacted to nicolay-r's post with 👀 about 13 hours ago

Post

1548

📢 For those who start to work with LLM streaming in web, here is a minimalistic example in JS for accessing server hosted by FastAPI via REST:
https://gist.github.com/nicolay-r/840425749cf6d3e397da3d329e894d59

The code above is a revised verison for accessing Replicate API posted earlier
https://huggingface.co./posts/nicolay-r/390307941200307

The key difference from Replicate API:
- using only POST for passing a body with parameters and fetching the reader.

reacted to clem's post with 🔥 about 13 hours ago

Post

1881

We crossed 1B+ tokens routed to inference providers partners on HF, that we released just a few days ago.

Just getting started of course but early users seem to like it & always happy to be able to partner with cool startups in the ecosystem.

Have you been using any integration and how can we make it better?

https://huggingface.co./blog/inference-providers

reacted to sayakpaul's post with 🔥 about 13 hours ago

Post

1379

Inference-time scaling meets Flux.1-Dev (and others) 🔥

Presenting a simple re-implementation of "Inference-time scaling diffusion models beyond denoising steps" by Ma et al.

I did the simplest random search strategy, but results can potentially be improved with better-guided search methods.

Supports Gemini 2 Flash & Qwen2.5 as verifiers for "LLMGrading" 🤗

The steps are simple:

For each round:

1> Starting by sampling 2 starting noises with different seeds.
2> Score the generations w.r.t a metric.
3> Obtain the best generation from the current round.

If you have more compute budget, go to the next search round. Scale the noise pool (2 ** search_round) and repeat 1 - 3.

This constitutes the random search method as done in the paper by Google DeepMind.

Code, more results, and a bunch of other stuff are in the repository. Check it out here: https://github.com/sayakpaul/tt-scale-flux/ 🤗

reacted to Quazim0t0's post with 👍 about 13 hours ago

Post

1131

My first attempt at using SmolAgents:
Quazim0t0/CSVAgent

The video attached was an example for this space.

Based on ZennyKenny's SqlAgent:
ZennyKenny/sqlAgent

You can upload a CSV file and it will automatically populate the table, then you can ask questions about the data.

Grab a sample CSV file here: https://github.com/datablist/sample-csv-files

The questions that can be asked may be limited.

_______________________
Second: Quazim0t0/TXTAgent
Created an Agent that converts a .txt file into a CSV file, then you can ask about the data and also download the CSV file that was generated.

_______________________
Third: Quazim0t0/ReportAgent
Upload Multiple TXT/DOC files to then generate a report from those files.

_______________________
Lastly: Quazim0t0/qResearch
A Research tool that uses DuckDuckGo for Web Searches, Wikipedia and tries to refine the answers in MLA Format.

reacted to prithivMLmods's post with 🚀 about 13 hours ago

Post

4127

The last week of Impression Craft Arts and sketches from strangerzonehf🎨🧑🏻‍🎨

- Collection : strangerzonehf/Flux-Ultimate-LoRA-Collection

Adapters:
+ Ld-Art : strangerzonehf/Ld-Art
+ Animeopix-Flux : strangerzonehf/Animeopix-Flux
+ Flux-Super-Paint-LoRA : strangerzonehf/Flux-Super-Paint-LoRA
+ CinematicShot-Pics-Flux : strangerzonehf/cinematicShot-Pics-Flux
+ Oil-Wall-Art-Flux : strangerzonehf/Oil-Wall-Art-Flux
+ Pixelo-Flux : strangerzonehf/Pixelo-Flux
+ Abstract-Shattered : strangerzonehf/Abstract-Shattered
+ Neon-Impressionism-Flux : strangerzonehf/Neon-Impressionism-Flux
+ NewG-Art : strangerzonehf/NewG-Art

🪧Demo : prithivMLmods/FLUX-LoRA-DLC
🤗Page : https://huggingface.co./strangerzonehf

reacted to ZennyKenny's post with 👍 1 day ago

Post

1799

Okay this is pretty crazy. Snowflake has CortexAI and Uber is already teasing QueryGPT, both of which prominently feature plain text to SQL features to query your database.

I decided to see how hard it would be to put together something similar using 🤗 smolagents. Turns out, it was pretty straightforward. I managed to get it done in London Luton airport this afternoon.

ZennyKenny/sqlAgent

2 replies

·

reacted to Jaward's post with 👀 1 day ago

Post

2732

Finally here it is: a faster, custom, scalable GRPO trainer for smaller models with < 500M params, can train on 8gb ram cpu, also supports gpu for sanity sake (includes support for vllm + flash attention). Using smolLM2-135M/360M-instructs as ref & base models. Experience your own “aha” moment 🐳 on 8gb ram.
Code: https://github.com/Jaykef/ai-algorithms/blob/main/smollm2_360M_135M_grpo_gsm8k.ipynb

1 reply

·

reacted to Akjava's post with 🔥 1 day ago

Post

2275

I shared smolagents examples

Akjava/open_Deep-Research-DuckDuckGo
Akjava/open_Deep-Research-DuckDuckGo-Groq

Replacing img-src to "#" in mdconvert.py help reducing tokens
I added translate final answer to my language

reacted to schuler's post with 😎 1 day ago

Post

2883

🔮 GPT-3 implemented in pure Free Pascal!
https://github.com/joaopauloschuler/gpt-3-for-pascal

This implementation follows the GPT-3 Small architecture from the landmark paper "Language Models are Few-Shot Learners":

┌─────────────────────────┐
│     Input Layer       │
├─────────────────────────┤
│ Token & Positional    │
│     Embedding         │
├─────────────────────────┤
│   12x Transformer     │
│      Blocks           │
│  - 12 heads           │
│  - 768 hidden dims    │
│  - 3072 intermediate  │
├─────────────────────────┤
│   Output Layer        │
└─────────────────────────┘

Clean Pascal Implementation

for CntLayer := 1 to {Layers=}12 do
begin
  Result.AddTransformerBlockCAI(
    {Heads=}12, 
    {intermediate dimensions=}4*768, 
    {NoForward=}true, 
    {HasNorm=}true, 
    false
  );
end;

reacted to onekq's post with 👍 4 days ago

Post

1728

R1 is still trending. Here is a collection of works trying to replicate R1.
onekq-ai/r1-reproduction-works-67a93f2fb8b21202c9eedf0b

Players include Huggingface (Open R1), Stanford (simple scaling), Berkeley (Bespoke, Open thoughts, etc.), ServiceNow, etc. I know there is another work from HKUST but couldn't find it on 🤗. Let me know if I miss any teams.

5 replies

·

reacted to AdinaY's post with 🚀 4 days ago

Post

2478

Ovis2 🔥 a multimodal LLM released by Alibaba AIDC team.
AIDC-AI/ovis2-67ab36c7e497429034874464
✨1B/2B/4B/8B/16B/34B
✨Strong CoT for deeper problem solving
✨Multilingual OCR – Expanded beyond English & Chinese, with better data extraction

reacted to mrzjy's post with 👀 4 days ago

Post

1246

A very small project:

Introducing CreativeTinyZero:
mrzjy/Qwen2.5-1.5B-GRPO-Creative-Ad-Generation

Unlike the impressive DeepSeek-R1(-Zero), this project focuses on a pure reinforcement learning (RL) experiment applied to an open-domain task: creative advertisement generation.

Objective:

- To investigate the feasibility of applying R1-like methods to an open-domain task without a verifiable ground-truth reward, while at least demonstrating its potential.
- To explore whether <think> and <answer> rewards can be explicitly designed to provide strong guidance through RL based on human prior knowledge.

Note:
- Our goal is not to induce self-reflective thinking, but to align with human thought processes purely through RL, without any supervised fine-tuning (SFT) on any constructed dataset.

Despite its small size, the resulting 1.5B-GRPO model demonstrates intriguing generative capabilities—though it's still far from perfect.

1 reply

·

reacted to Duskfallcrew's post with 👍 4 days ago

Post

1395

I don't have the stamina to port my articles tonight, i've been dealign with my CPTSD seizures again - but here's a fun update over on Bluesky!
https://bsky.app/profile/duskfallcrew.bsky.social/post/3li4zwdhy5c2q
HF's been my open source home since before i got on Civitai, and while i've largely left Civitai, i can't leave AI yet.
SO if y'all don't mind me trying to rebuild my "empire" one nerd block at a time, i'll keep my content easily accessible, :)

OH PSST New AI/ML Discord i made recently:
(It's also a shill for my main twitch/media/music hub)

Join us on this journey. Welcome to Ktiseos Nyx.

Our Discord:
https://discord.gg/HhBSvM9gBY

Earth & Dusk Media
https://discord.gg/5t2kYxt7An

:3 Cant' wait to hang out, and i've always linked back to HF for my E&D. content in terms of my lora backups and checkpoints!

Y'all who make diffusers versions of my content:
YOU ROCK. Do me a smidge favor: :3 aside from linking back can you maaaaaybe add the new K/N discord on there?

it's my geeky new AI safe space. XD
Also yea, if you've watched the new Beetlejuice movie, you know that i will never quit the ectoplasmic nerd train XD

1 reply

·

reacted to as-cle-bert's post with 🚀 4 days ago

Post

1336

𝐒𝐜𝐢𝐍𝐞𝐰𝐬𝐁𝐨𝐭 - 𝐑𝐞𝐩𝐨𝐫𝐭 𝐝𝐚𝐢𝐥𝐲 𝐒𝐜𝐢𝐞𝐧𝐜𝐞 𝐧𝐞𝐰𝐬 𝐨𝐧 𝐁𝐥𝐮𝐞𝐒𝐤𝐲

GitHub 👉 https://github.com/AstraBert/SciNewsBot
BlueSky 👉 https://bsky.app/profile/sci-news-bot.bsky.social

Hi there HF Community!🤗
I just created a very simple AI-powered bot that shares fact-checked news about Science, Environment, Energy and Technology on BlueSky :)

The bot takes news from Google News, filters out the sources that are not represented in the Media Bias Fact Check database, and then evaluates the reliability of the source based on the MBFC metrics. After that, it creates a catchy headline for the article and publishes the post on BlueSky📰

The cool thing? SciNewsBot is open-source and is cheap to maintain, as it is based on mistralai/Mistral-Small-24B-Instruct-2501 (via Mistral API). You can reproduce it locally, spinning it up on your machine, and even launch it on cloud through a comfy Docker setup🐋

Have fun and spread Science!✨

reacted to m-ric's post with 🚀 4 days ago

Post

2236

For those who haven't come across it yet, here's a handy trick to discuss an entire GitHub repo with an LLM:

=> Just replace "github" with "gitingest" in the url, and you get the whole repo as a single string that you can then paste in your LLMs

reacted to regisss's post with 🔥 4 days ago

Post

1558

Nice paper comparing the fp8 inference efficiency of Nvidia H100 and Intel Gaudi2: An Investigation of FP8 Across Accelerators for LLM Inference (2502.01070)

The conclusion is interesting: "Our findings highlight that the Gaudi 2, by leveraging FP8, achieves higher throughput-to-power efficiency during LLM inference"

One aspect of AI hardware accelerators that is often overlooked is how they consume less energy than GPUs. It's nice to see researchers starting carrying out experiments to measure this!

Gaudi3 results soon...

reacted to nicolay-r's post with 👍 4 days ago

Post

1455

📢 If you're around Replicate AI models and wish to use them in streaming mode via JS, then this snippet might be a quick way to experiment with streaming API usage:
https://gist.github.com/nicolay-r/86fc212086c0955d541244253ec0564b

Why it matters? The original docs has:
🟢 No the relate support for JS rather only Python/HTTP and NodeJS by using the replicate package.
🟢 Mixture of NodeJS and bash curl snippets:
https://replicate.com/docs/topics/predictions/streaming

Special thanks to the reated template for accessing APIs of other vendors like Claude / OpenAI by Simon Willson in the following post:
https://til.simonwillison.net/llms/streaming-llm-apis

Default model: meta-llama/Meta-Llama-3-70B

PS: I am happy and open for your comments related to this solution

reacted to m-ric's post with 👀 4 days ago

Post

2460

𝗚𝗿𝗲𝗮𝘁 𝗳𝗲𝗮𝘁𝘂𝗿𝗲 𝗮𝗹𝗲𝗿𝘁: you can now share agents to the Hub! 🥳🥳

And any agent pushed to Hub get a cool Space interface to directly chat with it.

This was a real technical challenge: for instance, serializing tools to export them meant that you needed to get all the source code for a tool, verify that it was standalone (not relying on external variables), and gathering all the packages required to make it run.

Go try it out! 👉 https://github.com/huggingface/smolagents

2 replies

·

reacted to davanstrien's post with 👍 4 days ago

Post

1253

Made some significant updates to my 🤗 semantic datasets search app. If you love falling into a wiki black hole, you might like this...

librarian-bots/huggingface-datasets-semantic-search

reacted to AdinaY's post with 🔥 5 days ago

Post

3485

InspireMusic 🎵🔥 an open music generation framework by Alibaba FunAudio Lab
Model: FunAudioLLM/InspireMusic-1.5B-Long
Demo: FunAudioLLM/InspireMusic
✨ Music, songs, audio - ALL IN ONE
✨ High quality audio: 24kHz & 48kHz sampling rates
✨ Long-Form Generation: enables extended audio creation
✨ Efficient Fine-Tuning: precision (BF16, FP16, FP32) with user-friendly scripts

1 reply

·

Victor Mustar PRO

AI & ML interests

Recent Activity

Organizations

victor's activity