7 4 14

Sam Rahimi

DeFactOfficial

http://defact.org

samrahimi

AI & ML interests

agents, RAG, search-enabled AI, chatbots, fact checking, AI journalism, multimodal pipelines

Recent Activity

new activity 7 days ago

agentica-org/DeepScaleR-1.5B-Preview:Non- Math Use Cases?

reacted to benhaotang's post with 🔥 13 days ago

Try out my updated implementation of forked OpenDeepResearcher(link below) as an OpenAI compatible endpoint, but with full control, can be deployed completely free with Gemini api or completely locally with ollama, or pay-as-you-go in BYOK format, the AI agents will think dynamically based on the difficulties of given research, compatible with any OpenAI compatible configurable clients(Msty, Chatbox, even vscode AI Toolkit playground). If you don't want to pay OpenAI $200 to use or want to take control of your deep research, check out here: 👉 https://github.com/benhaotang/OpenDeepResearcher-via-searxng **Personal take** Based on my testing against Perplexity's and Gemini's implementation with some Physics domain questions, mine is comparable and very competent at finding even the most rare articles or methods. Also a funny benchmark of mine to test all these searching models, is to trouble shot a WSL2 hanging issue I experienced last year, with prompt: > wsl2 in windows hangs in background with high vmmem cpu usage once in a while, especially after hibernation, no error logs captured in linux, also unable to shutdown in powershell, provide solutions the final solution that took me a day last year to find is to patch the kernel with some steps documented in carlfriedrich's repo and wait Microsoft to solve it(it is buried deep in wsl issues). Out of the three, only my Deep Research agent has found this solution, Perplexity and Gemini just focus on other force restart or memory management methods. I am very impressed with how it has this kind of obscure and scarce trouble shooting ability. **Limitations** Some caveats to be done later: - Multi-turn conversation is not yet supported, so no follow-up questions - System message is only extra writing instructions, don't affect on search - Small local model may have trouble citing source reliably, I am working on a fix to fact check all citation claims

new activity 2 months ago

deepseek-ai/DeepSeek-V3:minimum vram?

View all activity

Organizations

None yet

DeFactOfficial's activity

New activity in agentica-org/DeepScaleR-1.5B-Preview 7 days ago

Non- Math Use Cases?

#18 opened 7 days ago by

DeFactOfficial

reacted to benhaotang's post with 🔥 13 days ago

Post

2386

Try out my updated implementation of forked OpenDeepResearcher(link below) as an OpenAI compatible endpoint, but with full control, can be deployed completely free with Gemini api or completely locally with ollama, or pay-as-you-go in BYOK format, the AI agents will think dynamically based on the difficulties of given research, compatible with any OpenAI compatible configurable clients(Msty, Chatbox, even vscode AI Toolkit playground).

If you don't want to pay OpenAI $200 to use or want to take control of your deep research, check out here:
👉 https://github.com/benhaotang/OpenDeepResearcher-via-searxng

**Personal take**

Based on my testing against Perplexity's and Gemini's implementation with some Physics domain questions, mine is comparable and very competent at finding even the most rare articles or methods.

Also a funny benchmark of mine to test all these searching models, is to trouble shot a WSL2 hanging issue I experienced last year, with prompt:

> wsl2 in windows hangs in background with high vmmem cpu usage once in a while, especially after hibernation, no error logs captured in linux, also unable to shutdown in powershell, provide solutions

the final solution that took me a day last year to find is to patch the kernel with some steps documented in carlfriedrich's repo and wait Microsoft to solve it(it is buried deep in wsl issues). Out of the three, only my Deep Research agent has found this solution, Perplexity and Gemini just focus on other force restart or memory management methods. I am very impressed with how it has this kind of obscure and scarce trouble shooting ability.

**Limitations**

Some caveats to be done later:
- Multi-turn conversation is not yet supported, so no follow-up questions
- System message is only extra writing instructions, don't affect on search
- Small local model may have trouble citing source reliably, I am working on a fix to fact check all citation claims

1 reply

New activity in deepseek-ai/DeepSeek-V3 2 months ago

minimum vram?

#9 opened 2 months ago by

CHNtentes

New activity in huggingchat/chat-ui 3 months ago

Advanced overview of the (Dynamic Prompt OR Dynamic links) function

#442 opened 10 months ago by

philosopher-from-god

upvoted a collection 4 months ago

API LMs

Collection

216 items • Updated 24 days ago • 7

liked a Space 4 months ago

182

OmniParser

😻

Convert GUI screen to structured elements

updated a Space 4 months ago

Reasoning Lab

🖼

Try your favorite models with advanced reasoning

replied to ajibawa-2023's post 4 months ago

Is the data synthetic? Scraped? Mix?

reacted to ajibawa-2023's post with ❤️ 4 months ago

Post

3169

New Dataset: Software-Architecture
Link: ajibawa-2023/Software-Architecture

I am releasing a Large Dataset covering topics related to Software-Architecture. This dataset consists of around 450,000 lines of data in jsonl.

I have included following topics:

Architectural Frameworks

Architectural Patterns for Reliability

Architectural Patterns for Scalability

Architectural Patterns

Architectural Quality Attributes

Architectural Testing

Architectural Views

Architectural Decision-Making

Advanced Research

Cloud-Based Architectures

Component-Based Architecture

Data Architecture

Emerging Trends

Event-Driven Architecture

Evolvability and Maintainability

Microservices and Monolithic

Microservices Architecture

Security Architecture

Service-Oriented Architecture

Software Design Principles

and Many More!

This dataset is useful in LLM development. Also those who are working on developing Software development related LLMs then this dataset can be useful.

This dataset is very useful to Researchers as well.

4 replies

updated a Space 4 months ago

MMAPI Service

📢

Generate images and speech from text input

updated a dataset 4 months ago

DeFactOfficial/diffusion-models-hf-hub

Updated Oct 29, 2024 • 54

New activity in Nymbo/FLUX.1-Schnell-Serverless 4 months ago

Parameter Format for Serverless Inference (Text to Image)

#1 opened 5 months ago by

DeFactOfficial

reacted to MonsterMMORPG's post with 🔥 4 months ago

Post

3689

Stability AI published their most power newest model Stable Diffusion 3.5 Large. This model unlike FLUX is full model not distilled and has huge potential. I have done extensive research and publishing all of it in this video regarding how to use SD 3.5 Large with the best settings. Moreover, I am sharing how to use FLUX DEV with the best possible configuration as well. Moreover, I am making a huge comparison between SD 3.5 and FLUX and you are going to learn who is the winner.

https://youtu.be/-zOKhoO9a5s

62 Prompts tested on all experiments to find best Sampler + Scheduler for Stable Diffusion 3.5 Large and SD 3.5 Large vs FLUX DEV > https://youtu.be/-zOKhoO9a5s

FLUX Dev vs SD 3.5 Large fully compared.

SD 3.5 Large FP16 vs Scaled FP8 fully compared.

T5 XXL FP8 vs Scaled FP8 vs FP16 fully compared.

FLUX FP16 vs Scaled FP8 fully compared.

Also how to install SwarmUI on Windows, Massed Compute and RunPod shown in the tutorial.

I have shown how to use FLUX and SD 3.5 Large in details as well.