Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
26.7
TFLOPS
2
Rizki Firmansyah
Rizki-firman
Follow
0 followers
ยท
8 following
https://cdn-uploads.huggingface.co/production/uploads/5f17f0a0925b9863e28ad517/yq2InsCsmbfigMaTdPeUp.png
RizAI
AI & ML interests
None yet
Recent Activity
reacted
to
m-ric
's
post
with ๐ฅ
3 days ago
Today we make the biggest release in smolagents so far: ๐๐ฒ ๐ฒ๐ป๐ฎ๐ฏ๐น๐ฒ ๐๐ถ๐๐ถ๐ผ๐ป ๐บ๐ผ๐ฑ๐ฒ๐น๐, ๐๐ต๐ถ๐ฐ๐ต ๐ฎ๐น๐น๐ผ๐๐ ๐๐ผ ๐ฏ๐๐ถ๐น๐ฑ ๐ฝ๐ผ๐๐ฒ๐ฟ๐ณ๐๐น ๐๐ฒ๐ฏ ๐ฏ๐ฟ๐ผ๐๐๐ถ๐ป๐ด ๐ฎ๐ด๐ฒ๐ป๐๐! ๐ฅณ Our agents can now casually open up a web browser, and navigate on it by scrolling, clicking elements on the webpage, going back, just like a user would. The demo below shows Claude-3.5-Sonnet browsing GitHub for task: "Find how many commits the author of the current top trending repo did over last year." Hi @mlabonne ! Go try it out, it's the most cracked agentic stuff I've seen in a while ๐คฏ (well, along with OpenAI's Operator who beat us by one day) For more detail, read our announcement blog ๐ https://huggingface.co./blog/smolagents-can-see The code for the web browser example is here ๐ https://github.com/huggingface/smolagents/blob/main/examples/vlm_web_browser.py
upvoted
a
paper
24 days ago
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
upvoted
a
paper
24 days ago
VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control
View all activity
Organizations
None yet
spaces
2
Sort:ย Recently updated
No application file
๐
Firman
No application file
๐
Riz AI
PDM
models
None public yet
datasets
None public yet