28 27 208

Kaizhao Liang PRO

kz919

https://kyleliang919.github.io/

AI & ML interests

Search = AGI?

Recent Activity

reacted to maxiw's post with 🤗 3 days ago

You can now try out computer use models from the hub to automate your local machine with https://github.com/askui/vision-agent. 💻 ``` import time from askui import VisionAgent with VisionAgent() as agent: agent.tools.webbrowser.open_new("http://www.google.com") time.sleep(0.5) agent.click("search field in the center of the screen", model_name="Qwen/Qwen2-VL-7B-Instruct") agent.type("cats") agent.keyboard("enter") time.sleep(0.5) agent.click("text 'Images'", model_name="AskUI/PTA-1") time.sleep(0.5) agent.click("second cat image", model_name="OS-Copilot/OS-Atlas-Base-7B") ``` Currently these models are integrated with Gradio Spaces API. Also planning to add local inference soon! Currently supported: - https://huggingface.co./Qwen/Qwen2-VL-7B-Instruct - https://huggingface.co./Qwen/Qwen2-VL-2B-Instruct - https://huggingface.co./AskUI/PTA-1 - https://huggingface.co./OS-Copilot/OS-Atlas-Base-7B

reacted to maxiw's post with 🚀 3 days ago

reacted to maxiw's post with 👍 3 days ago

View all activity

Organizations

kz919's activity

upvoted a paper 9 days ago

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published 10 days ago • 25

upvoted an article 10 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

13 days ago

• 679

upvoted an article 12 days ago

Article

Welcome to Inference Providers on the Hub 🔥

13 days ago

• 294

upvoted a paper 22 days ago

Proximal Policy Optimization Algorithms

Paper • 1707.06347 • Published Jul 20, 2017 • 6

upvoted a paper about 2 months ago

Structured 3D Latents for Scalable and Versatile 3D Generation

Paper • 2412.01506 • Published Dec 2, 2024 • 58

upvoted a paper 2 months ago

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes

Paper • 2306.13649 • Published Jun 23, 2023 • 19

upvoted a paper 3 months ago

Cautious Optimizers: Improving Training with One Line of Code

Paper • 2411.16085 • Published Nov 25, 2024 • 15

upvoted a paper 5 months ago

Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency

Paper • 2409.02634 • Published Sep 4, 2024 • 93

upvoted a paper 6 months ago

Memory-Efficient LLM Training with Online Subspace Descent

Paper • 2408.12857 • Published Aug 23, 2024 • 14

upvoted an article 6 months ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15, 2024

• 173

upvoted a paper 7 months ago

Longhorn: State Space Models are Amortized Online Learners

Paper • 2407.14207 • Published Jul 19, 2024 • 18

upvoted a paper 8 months ago

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 89

upvoted an article 8 months ago

Article

Putting RL back in RLHF

Jun 12, 2024

• 74

upvoted 3 papers 9 months ago

The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry

Paper • 2402.04347 • Published Feb 6, 2024 • 14

Towards Modular LLMs by Building and Reusing a Library of LoRAs

Paper • 2405.11157 • Published May 18, 2024 • 28

SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts

Paper • 2405.07518 • Published May 13, 2024 • 26

upvoted a paper 10 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 256

upvoted a paper 11 months ago

Efficiently Adapting Pretrained Language Models To New Languages

Paper • 2311.05741 • Published Nov 9, 2023 • 11

upvoted 2 papers 12 months ago

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Paper • 2402.17193 • Published Feb 27, 2024 • 24

Training-Free Long-Context Scaling of Large Language Models

Paper • 2402.17463 • Published Feb 27, 2024 • 21