Benhao Tang PRO

benhaotang

AI & ML interests

Physics Master student in theoretical particle physics at Universität Heidelberg, actively looking into the possibilities of integrating AI into future physics research.

Recent Activity

updated a model 1 day ago

benhaotang/llama3.2-1B-physics-finetuned

liked a dataset 6 days ago

driaforall/verifiable-pythonic-function-calling-lite

liked a model 6 days ago

driaforall/Tiny-Agent-a-1.5B

View all activity

Organizations

None yet

benhaotang's activity

updated a model 1 day ago

benhaotang/llama3.2-1B-physics-finetuned

Text Generation • Updated 1 day ago • 85

liked a dataset 6 days ago

driaforall/verifiable-pythonic-function-calling-lite

Viewer • Updated 22 days ago • 16.4k • 252 • 6

liked a model 6 days ago

driaforall/Tiny-Agent-a-1.5B

Text Generation • Updated 11 days ago • 195 • 3

liked a model 7 days ago

homebrewltd/AlphaMaze-v0.2-1.5B

Text Generation • Updated 5 days ago • 1.19k • • 83

liked a Space 8 days ago

1.79k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

replied to their post 9 days ago

OK, grok 3 deep research also failed on my benchmark...

And this is the final solution it gives me:

Use wsl --shutdown before hibernating; if it fails, try net stop LxssManager.

What? How about just tell me if WSL have problem, just do not using WSL... How can this be a solution when there is even an official troubleshooting guide that provide more solutions. This is the even worst than gemini and perplexity, at least they read the official guide, just got lost in github issue threads... Now I really want to know how OpenAI's compares to mine, if I have 200 dollars.

upvoted an article 9 days ago

Article

Building a Real-Time Video Chat with Gemini 2.0, Gradio, and WebRTC 👀👂

•

Jan 13

• 6

reacted to schuler's post with 😎 10 days ago

Post

3371

🔮 GPT-3 implemented in pure Free Pascal!
https://github.com/joaopauloschuler/gpt-3-for-pascal

This implementation follows the GPT-3 Small architecture from the landmark paper "Language Models are Few-Shot Learners":

┌─────────────────────────┐
│     Input Layer       │
├─────────────────────────┤
│ Token & Positional    │
│     Embedding         │
├─────────────────────────┤
│   12x Transformer     │
│      Blocks           │
│  - 12 heads           │
│  - 768 hidden dims    │
│  - 3072 intermediate  │
├─────────────────────────┤
│   Output Layer        │
└─────────────────────────┘

Clean Pascal Implementation

for CntLayer := 1 to {Layers=}12 do
begin
  Result.AddTransformerBlockCAI(
    {Heads=}12, 
    {intermediate dimensions=}4*768, 
    {NoForward=}true, 
    {HasNorm=}true, 
    false
  );
end;

liked a model 11 days ago

agents-course/notebooks

Updated 3 days ago • 156

upvoted an article 11 days ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 790

reacted to louisbrulenaudet's post with 🤗 11 days ago

Post

3051

I am pleased to introduce my first project built upon Hugging Face’s smolagents framework, integrated with Alpaca for financial market analysis automation 🦙🤗

The project implements technical indicators such as the Relative Strength Index (RSI) and Bollinger Bands to provide momentum and volatility analysis. Market data is retrieved through the Alpaca API, enabling access to historical price information across various timeframes.

AI-powered insights are generated using Hugging Face’s inference API, facilitating the analysis of market trends through natural language processing with DuckDuckGo search integration for real-time sentiment analysis based on financial news 🦆

Link to the GitHub project: https://github.com/louisbrulenaudet/agentic-market-tool

posted an update 13 days ago

Post

2386

Try out my updated implementation of forked OpenDeepResearcher(link below) as an OpenAI compatible endpoint, but with full control, can be deployed completely free with Gemini api or completely locally with ollama, or pay-as-you-go in BYOK format, the AI agents will think dynamically based on the difficulties of given research, compatible with any OpenAI compatible configurable clients(Msty, Chatbox, even vscode AI Toolkit playground).

If you don't want to pay OpenAI $200 to use or want to take control of your deep research, check out here:
👉 https://github.com/benhaotang/OpenDeepResearcher-via-searxng

**Personal take**

Based on my testing against Perplexity's and Gemini's implementation with some Physics domain questions, mine is comparable and very competent at finding even the most rare articles or methods.

Also a funny benchmark of mine to test all these searching models, is to trouble shot a WSL2 hanging issue I experienced last year, with prompt:

> wsl2 in windows hangs in background with high vmmem cpu usage once in a while, especially after hibernation, no error logs captured in linux, also unable to shutdown in powershell, provide solutions

the final solution that took me a day last year to find is to patch the kernel with some steps documented in carlfriedrich's repo and wait Microsoft to solve it(it is buried deep in wsl issues). Out of the three, only my Deep Research agent has found this solution, Perplexity and Gemini just focus on other force restart or memory management methods. I am very impressed with how it has this kind of obscure and scarce trouble shooting ability.

**Limitations**

Some caveats to be done later:
- Multi-turn conversation is not yet supported, so no follow-up questions
- System message is only extra writing instructions, don't affect on search
- Small local model may have trouble citing source reliably, I am working on a fix to fact check all citation claims