Yatharth  Sharma's picture

Yatharth Sharma

YaTharThShaRma999

AI & ML interests

None yet

Recent Activity

View all activity

Organizations

None yet

YaTharThShaRma999's activity

reacted to rubenroy's post with πŸ”₯ 1 day ago
reacted to chansung's post with πŸ‘ 1 day ago
view post
Post
2398
A brief summary of the o3-mini

The OpenAI o3-mini model is a significant improvement over the o1-mini, reaching o1 performance levels. While generally good, its performance isn't universally better than previous models (o1, o1-prev.) or GPT-4o across all benchmarks. This means workflows should be re-evaluated with each model upgrade.

The o3-mini has "low," "medium," and "high" versions, with "low" being the base model used for benchmarking. It's speculated that the higher versions simply involve more processing. A fair comparison with other models like Gemini 2.0 Thinking or DeepSeek-R1 would likely need to use the "low" version and a similar "think more" mechanism.

The system card is recommended reading due to its comprehensive benchmark data.

https://openai.com/index/openai-o3-mini/
New activity in coqui/XTTS-v2 3 days ago
reacted to sayakpaul's post with πŸš€ 4 days ago
view post
Post
1647
We have been cooking a couple of fine-tuning runs on CogVideoX with finetrainers, smol datasets, and LoRA to generate cool video effects like crushing, dissolving, etc.

We are also releasing a LoRA extraction utility from a fully fine-tuned checkpoint. I know that kind of stuff has existed since eternity, but the quality on video models was nothing short of spectacular. Below are some links:

* Models and datasets: https://huggingface.co./finetrainers
* finetrainers: https://github.com/a-r-r-o-w/finetrainers
* LoRA extraction: https://github.com/huggingface/diffusers/blob/main/scripts/extract_lora_from_model.py
  • 1 reply
Β·
reacted to sayakpaul's post with πŸ”₯ 4 days ago
view post
Post
1647
We have been cooking a couple of fine-tuning runs on CogVideoX with finetrainers, smol datasets, and LoRA to generate cool video effects like crushing, dissolving, etc.

We are also releasing a LoRA extraction utility from a fully fine-tuned checkpoint. I know that kind of stuff has existed since eternity, but the quality on video models was nothing short of spectacular. Below are some links:

* Models and datasets: https://huggingface.co./finetrainers
* finetrainers: https://github.com/a-r-r-o-w/finetrainers
* LoRA extraction: https://github.com/huggingface/diffusers/blob/main/scripts/extract_lora_from_model.py
  • 1 reply
Β·
reacted to victor's post with πŸ˜ŽπŸš€πŸ”₯ 6 days ago
view post
Post
2851
Finally, an open-source AI that turns your lyrics into full songs is hereβ€”meet YuE! Unlike other tools that only create short clips, YuE can make entire songs (up to 5 minutes) with vocals, melody, and instruments all working together. Letsss go!

m-a-p/YuE-s1-7B-anneal-en-cot
reacted to clem's post with β€οΈπŸ€—πŸ”₯ 7 days ago
view post
Post
6780
AI is not a zero-sum game. Open-source AI is the tide that lifts all boats!