ECNU (East China Normal University)

posted an update about 2 months ago

Post

689

Checkout phi-4 from Microsoft, dropped a day ago... If you ❤️ the Phi series, then here is the GGUF - Sri-Vigneshwar-DJ/phi-4-GGUF. phi-4 is a 14B highly efficient open LLM that beats much larger models at math and reasoning - check out evaluations on the Open LLM.

Technical paper - https://arxiv.org/pdf/2412.08905 ; The Data Synthesis approach is interesting

heroding77

authored 2 papers about 2 months ago

SEAGraph: Unveiling the Whole Story of Paper Review Comments

Paper • 2412.11939 • Published Dec 16, 2024

RELIEF: Reinforcement Learning Empowered Graph Feature Prompt Tuning

Paper • 2408.03195 • Published Aug 6, 2024

Sri-Vigneshwar-DJ

posted an update about 2 months ago

Post

2080

Just sharing a thought: I started using DeepSeek V3 a lot, and an idea struck me about agents "orchestrating during inference" on a test-time compute model like DeepSeek V3 or the O1 series.

Agents (Instruction + Function Calls + Memory) execute during inference, and based on the output decision, a decision is made to scale the time to reason or perform other tasks.

Sri-Vigneshwar-DJ

posted an update about 2 months ago

Post

2348

Combining smolagents with Anthropic’s best practices simplifies building powerful AI agents:

1. Code-Based Agents: Write actions as Python code, reducing steps by 30%.
2. Prompt Chaining: Break tasks into sequential subtasks with validation gates.
3. Routing: Classify inputs and direct them to specialized handlers.
4. Fallback: Handle tasks even if classification fails.

https://huggingface.co./blog/Sri-Vigneshwar-DJ/building-effective-agents-with-anthropics-best-pra