Evgeniy Hristoforu
ehristoforu
AI & ML interests
Diffusers, LLM and others ML.
Recent Activity
replied to
their
post
4 days ago
Introducing our first standalone model ā FluentlyLM Prinum
Introducing the first standalone model from Project Fluently LM! We worked on it for several months, used different approaches and eventually found the optimal one.
General characteristics:
- Model type: Causal language models (QwenForCausalLM, LM Transformer)
- Number of parameters: 32.5B
- Number of parameters (not embedded): 31.0B
- Number of layers: 64
- Context: 131,072 tokens
- Language(s) (NLP): English, French, Spanish, Russian, Chinese, Japanese, Persian (officially supported)
- License: MIT
Creation strategy:
The basis of the strategy is shown in Pic. 2.
We used Axolotl & Unsloth for SFT-finetuning with PEFT LoRA (rank=64, alpha=64) and Mergekit for SLERP and TIES mergers.
Evolution:
š 12th place in the Open LLM Leaderboard (https://huggingface.co./spaces/open-llm-leaderboard/open_llm_leaderboard#) (21.02.2025)
Detailed results and comparisons are presented in Pic. 3.
Links:
- Model: https://huggingface.co./fluently-lm/FluentlyLM-Prinum
- GGUF version: https://huggingface.co./mradermacher/FluentlyLM-Prinum-GGUF
- Demo on ZeroGPU: https://huggingface.co./spaces/ehristoforu/FluentlyLM-Prinum-demo
reacted
to
their
post
with š„
4 days ago
Introducing our first standalone model ā FluentlyLM Prinum
Introducing the first standalone model from Project Fluently LM! We worked on it for several months, used different approaches and eventually found the optimal one.
General characteristics:
- Model type: Causal language models (QwenForCausalLM, LM Transformer)
- Number of parameters: 32.5B
- Number of parameters (not embedded): 31.0B
- Number of layers: 64
- Context: 131,072 tokens
- Language(s) (NLP): English, French, Spanish, Russian, Chinese, Japanese, Persian (officially supported)
- License: MIT
Creation strategy:
The basis of the strategy is shown in Pic. 2.
We used Axolotl & Unsloth for SFT-finetuning with PEFT LoRA (rank=64, alpha=64) and Mergekit for SLERP and TIES mergers.
Evolution:
š 12th place in the Open LLM Leaderboard (https://huggingface.co./spaces/open-llm-leaderboard/open_llm_leaderboard#) (21.02.2025)
Detailed results and comparisons are presented in Pic. 3.
Links:
- Model: https://huggingface.co./fluently-lm/FluentlyLM-Prinum
- GGUF version: https://huggingface.co./mradermacher/FluentlyLM-Prinum-GGUF
- Demo on ZeroGPU: https://huggingface.co./spaces/ehristoforu/FluentlyLM-Prinum-demo
posted
an
update
4 days ago
Introducing our first standalone model ā FluentlyLM Prinum
Introducing the first standalone model from Project Fluently LM! We worked on it for several months, used different approaches and eventually found the optimal one.
General characteristics:
- Model type: Causal language models (QwenForCausalLM, LM Transformer)
- Number of parameters: 32.5B
- Number of parameters (not embedded): 31.0B
- Number of layers: 64
- Context: 131,072 tokens
- Language(s) (NLP): English, French, Spanish, Russian, Chinese, Japanese, Persian (officially supported)
- License: MIT
Creation strategy:
The basis of the strategy is shown in Pic. 2.
We used Axolotl & Unsloth for SFT-finetuning with PEFT LoRA (rank=64, alpha=64) and Mergekit for SLERP and TIES mergers.
Evolution:
š 12th place in the Open LLM Leaderboard (https://huggingface.co./spaces/open-llm-leaderboard/open_llm_leaderboard#) (21.02.2025)
Detailed results and comparisons are presented in Pic. 3.
Links:
- Model: https://huggingface.co./fluently-lm/FluentlyLM-Prinum
- GGUF version: https://huggingface.co./mradermacher/FluentlyLM-Prinum-GGUF
- Demo on ZeroGPU: https://huggingface.co./spaces/ehristoforu/FluentlyLM-Prinum-demo
Organizations
ehristoforu's activity
Adding Evaluation Results
#1 opened 8 days ago
by
ehristoforu

š© Report: Not working
3
#1106 opened 11 days ago
by
ehristoforu

! Eval error in Llama3.3 70b based model
9
#1089 opened 23 days ago
by
ehristoforu

no Eval for L3.3 70b based model
4
#1091 opened 21 days ago
by
Steelskull

š© Report: Not working
3
#1084 opened 27 days ago
by
ehristoforu

Model results missing
3
#1075 opened about 1 month ago
by
maldv

Adding Evaluation Results
#1 opened 25 days ago
by
ehristoforu

Adding Evaluation Results
#1 opened 25 days ago
by
ehristoforu

š© Report: Not working
4
#1086 opened 25 days ago
by
ehristoforu

š© Report: Not working
11
#1082 opened 30 days ago
by
ehristoforu

Qwen2.5-32B merge eval error
3
#1078 opened about 1 month ago
by
ehristoforu

Model Request: CohereForAI/c4ai-command-r7b-12-2024
#1 opened about 2 months ago
by
ehristoforu

[Feature] Remove "model voting"
3
#1059 opened about 2 months ago
by
T145

ā Too long a process of evolution
1
#1058 opened about 2 months ago
by
ehristoforu

š© Report: Not working
1
#2 opened about 2 months ago
by
ehristoforu

ŠŠ“Šµ Š¾Š±ŠµŃŠ°Š½Š½ŃŠ¹ Š“Š°ŃŠ°ŃŠµŃ?
5
#1 opened about 2 months ago
by
ehristoforu

Adding `safetensors` variant of this model
#1 opened 2 months ago
by
SFconvertbot

š© Report: Legal issue(s)
1
#34 opened 3 months ago
by
ehristoforu

Adding Evaluation Results
#1 opened 3 months ago
by
leaderboard-pr-bot

Adding Evaluation Results
#1 opened 3 months ago
by
leaderboard-pr-bot
