313 344 572

Yatharth Sharma

YaTharThShaRma999

AI & ML interests

None yet

Recent Activity

new activity about 11 hours ago

openbmb/MiniCPM-o-2_6:It's such an amazing model, but to support multiple languages...

reacted to rubenroy's post with 🔥 1 day ago

🎉 Fully released my newest models trained on my GammaCorpus dataset, Zurich 7B & 14B and Geneva 12B. Here is the model collections: Zurich: https://huggingface.co./collections/rubenroy/zurich-679b21284e207e2844bc025d Geneva: https://huggingface.co./collections/rubenroy/geneva-679e33a55d1576319b0d9cd4 If you would like to test them, feel free to visit their spaces: https://huggingface.co./spaces/rubenroy/Geneva-12B https://huggingface.co./spaces/rubenroy/Zurich-14B https://huggingface.co./spaces/rubenroy/Zurich-7B

reacted to chansung's post with 👍 1 day ago

A brief summary of the o3-mini The OpenAI o3-mini model is a significant improvement over the o1-mini, reaching o1 performance levels. While generally good, its performance isn't universally better than previous models (o1, o1-prev.) or GPT-4o across all benchmarks. This means workflows should be re-evaluated with each model upgrade. The o3-mini has "low," "medium," and "high" versions, with "low" being the base model used for benchmarking. It's speculated that the higher versions simply involve more processing. A fair comparison with other models like Gemini 2.0 Thinking or DeepSeek-R1 would likely need to use the "low" version and a similar "think more" mechanism. The system card is recommended reading due to its comprehensive benchmark data. https://openai.com/index/openai-o3-mini/

View all activity

Organizations

None yet

YaTharThShaRma999's activity

New activity in openbmb/MiniCPM-o-2_6 about 11 hours ago

It's such an amazing model, but to support multiple languages...

#23 opened 12 days ago by

Jongsim

reacted to rubenroy's post with 🔥 1 day ago

Post

1777

🎉 Fully released my newest models trained on my GammaCorpus dataset, Zurich 7B & 14B and Geneva 12B. Here is the model collections:

Zurich:
rubenroy/zurich-679b21284e207e2844bc025d

Geneva:
rubenroy/geneva-679e33a55d1576319b0d9cd4

If you would like to test them, feel free to visit their spaces:
rubenroy/Geneva-12B
rubenroy/Zurich-14B
rubenroy/Zurich-7B

reacted to chansung's post with 👍 1 day ago

Post

2398

A brief summary of the o3-mini

The OpenAI o3-mini model is a significant improvement over the o1-mini, reaching o1 performance levels. While generally good, its performance isn't universally better than previous models (o1, o1-prev.) or GPT-4o across all benchmarks. This means workflows should be re-evaluated with each model upgrade.

The o3-mini has "low," "medium," and "high" versions, with "low" being the base model used for benchmarking. It's speculated that the higher versions simply involve more processing. A fair comparison with other models like Gemini 2.0 Thinking or DeepSeek-R1 would likely need to use the "low" version and a similar "think more" mechanism.

The system card is recommended reading due to its comprehensive benchmark data.

https://openai.com/index/openai-o3-mini/

liked a Space 3 days ago

Running on Zero

💡

Lumina Image 2.0

Generate images with Lumina Image 2.0

New activity in coqui/XTTS-v2 3 days ago

Paid or free for commercial usage?

#106 opened 4 days ago by

banank1989

updated a model 3 days ago

YaTharThShaRma999/ToucanTTS

Updated 3 days ago

liked a model 3 days ago

m-a-p/YuE-s1-7B-anneal-en-cot

Text Generation • Updated 4 days ago • 16.4k • 312

reacted to sayakpaul's post with 🚀 4 days ago

Post

1647

We have been cooking a couple of fine-tuning runs on CogVideoX with finetrainers, smol datasets, and LoRA to generate cool video effects like crushing, dissolving, etc.

We are also releasing a LoRA extraction utility from a fully fine-tuned checkpoint. I know that kind of stuff has existed since eternity, but the quality on video models was nothing short of spectacular. Below are some links:

* Models and datasets: https://huggingface.co./finetrainers
* finetrainers: https://github.com/a-r-r-o-w/finetrainers
* LoRA extraction: https://github.com/huggingface/diffusers/blob/main/scripts/extract_lora_from_model.py

1 reply

New activity in stabilityai/stable-diffusion-3.5-medium 4 days ago

Huge memory consumption with SD3.5-medium

#18 opened 2 months ago by

oddball516

upvoted a paper 4 days ago

People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text

Paper • 2501.15654 • Published 8 days ago • 9

reacted to sayakpaul's post with 🔥 4 days ago

Post

1647

1 reply

reacted to victor's post with 😎🚀🔥 6 days ago

Post

2851

Finally, an open-source AI that turns your lyrics into full songs is here—meet YuE! Unlike other tools that only create short clips, YuE can make entire songs (up to 5 minutes) with vocals, melody, and instruments all working together. Letsss go!

m-a-p/YuE-s1-7B-anneal-en-cot