etemiz (Emin Temiz)

posted an update 1 day ago

Post

345

fixing AI one weight at a time

https://huggingface.co./blog/etemiz/ways-to-align-ai-with-human-values

published an article 2 days ago

Article

Ways to Align AI with Human Values

By

•

2 days ago

updated a model 3 days ago

some1nostr/Nostr-Llama-3.1-8B

Updated 3 days ago • 40

New activity in some1nostr/Nostr-Llama-3.1-8B 3 days ago

Update README.md

#1 opened 3 days ago by

etemiz

upvoted an article 4 days ago

Article

Unleashing the Power of Unsloth and QLora:Redefining Language Model Fine-Tuning

By

•

Jan 19, 2024

• 13

posted an update 6 days ago

Post

526

https://www.youtube.com/watch?v=EMyAGuHnDHk

In the video above some LLMs favored atheist, some favored the believer. In the picture below, atheist favoring LLMs are on the left, believer favoring LLMs are on the right.

The ones on the left are also lower ranking in my leaderboard and the ones on the right are higher ranking. My leaderboard:
https://sheet.zohopublic.com/sheet/published/mz41j09cc640a29ba47729fed784a263c1d08

Coincidence? My leaderboard has more domains. Does ranking high in faith mean ranking high in healthy living, nutrition, bitcoin and nostr on average?

reacted to clem's post with 👍 10 days ago

Post

2682

What are the best organizations to follow on @huggingface ?

On top of my head:
- Deepseek (35,000 followers): https://huggingface.co./deepseek-ai
- Meta Llama (27,000 followers): https://huggingface.co./meta-llama
- Black Forrest Labs (11,000 followers): https://huggingface.co./black-forest-labs
- OpenAI (5,000 followers): https://huggingface.co./openai
- Nvidia (16,000 followers): https://huggingface.co./nvidia
- MIcrosoft (9,000 followers): https://huggingface.co./microsoft
- AllenAI (2,000 followers): https://huggingface.co./allenai
- Mistral (5,000 followers): https://huggingface.co./mistralai
- XAI (600 followers): https://huggingface.co./xai-org
- Stability AI (16,000 followers): https://huggingface.co./stabilityai
- Qwen (16,000 followers): https://huggingface.co./Qwen
- GoogleAI (8,000 followers): https://huggingface.co./google
- Unsloth (3,000 followers): https://huggingface.co./unsloth
- Bria AI (4,000 followers): https://huggingface.co./briaai
- NousResearch (1,300 followers): https://huggingface.co./NousResearch

Bonus, the agent course org with 17,000 followers: https://huggingface.co./agents-course

1 reply

·

posted an update 10 days ago

Post

1786

--- AHA Leaderboard ---

We all want AI to be properly aligned so it benefits humans with every answer it generates. While there are tremendous research around this and so many people working on it, I am choosing another route: Curation of people and then curation of datasets that are used in the LLM training. Curation of datasets comprising of people who try to uplift humanity should result in LLMs that try to help humans.

This work has revolved around two tasks:

1. Making LLMs that are benefiting humans
2. Measuring misinformation in other LLMs

The idea about the second task is, once we make and gather better LLMs and set them as "ground truth" we now can measure how much other LLMs are distancing themselves from those ground truths.
For that I am working on something I will call "AHA Leaderboard" (AHA stands for AI -- human alignment).

Link to the spreadsheet:

https://sheet.zohopublic.com/sheet/published/mz41j09cc640a29ba47729fed784a263c1d08

The columns are ground truths. The rows are the mainstream LLMs. If a mainstream LLM produces similar answers to the ground truth LLM, it gets a higher score. The LLMs that are higher in the leaderboard should be considered aligned with humans. Simple idea. This is like analyzing LLMs in different domains asking hundreds of questions and checking if they match the answers that try to mimic humans that care about other humans. Will it going to be effective? What do you think?

We want mainstream LLMs to copy answers of ground truth LLMs in certain domains. This may refocus AI towards being more beneficial. There have been 5 content providers and 6 curators as of now in the project. Join us and be one of the pioneers that fixed AI! You can be a curator, content provider or general researcher or something else.

liked a model 11 days ago

etemiz/Hoopoe-8B-Llama-3.1

Updated Jan 18 • 33 • 3

liked a model 12 days ago

etemiz/Llama-3.1-405B-Inst-GGUF

Updated Dec 19, 2024 • 129 • 4

posted an update 14 days ago

Post

566

if you are looking for alternative voices, updated the Nostr model
some1nostr/Nostr-Llama-3.1-8B
Note the quants by others are old

posted an update 15 days ago

Post

3810

Some things are simple

commented on Open-R1: Update #1 26 days ago

can we also work together in regards to human alignment?
https://huggingface.co./blog/etemiz/aha-indicator

posted an update 27 days ago

Post

1156

Another AHA moment

https://huggingface.co./blog/etemiz/aha-indicator

published an article 27 days ago

Article

The AHA Indicator

By

•

27 days ago

• 3

posted an update 28 days ago

Post

379

Having bad LLMs is ok and can be utilized well. They can allow us to find ideas that work faster.

Reinforcement algorithm could be: "take what a proper model says and negate what a bad LLM says". Or in a mixture of agents situation we could say refute the bad LLM output and combine with the output of the good LLM.

This could mean having two wings (or more) in search of "ideas that work for most people most of the time".

1 reply

·

replied to their post about 1 month ago

That's a hard question! I think some humans are really creating content for other humans to live happily, healthily and abundantly. I am in favor of giving more weight to those kind of carefully curated humans in the LLM. This can be as simple as pretraining again with their content. I have done that and it works.

Definitely not what the majority says! Majority is often really wrong on many subjects. The mediocrity of current AI systems might be because of this, majority of content is coming from mediocre IQ and EQ and *Q.

A curator council who can choose the "beneficial" humans and the content coming from these can be exaggerated in an LLM, ultimately giving more weight to those thoughts that will be beneficial to many humans most of the time. Ideas that will work in favor of humans in many cases is my definition I guess of human alignment.

posted an update about 1 month ago

Post

531

The beloved DeepSeek R1 has some problems...

Is DeepSeek free or the price we pay is the detachment from human alignment?

https://huggingface.co./blog/etemiz/deepseek-r1-human-alignment

3 replies

·

published an article about 1 month ago

Article

DeepSeek R1 Human Alignment Tests

By

•

Jan 25

• 1

replied to their post about 1 month ago

I am comparing R1's answers to other models that I find 'aligned'. This is my similar work

https://wikifreedia.xyz/based-llm-leaderboard/npub1nlk894teh248w2heuu0x8z6jjg2hyxkwdc8cxgrjtm9lnamlskcsghjm9c

I should probably make another leaderboard on HF!

Positive values mean the model is better aligned with aligned models. Negative means their ideas differ.

The idea is find aligned models and use them as benchmarks. I also build models that does well in terms of human alignment according to me. This is mostly a subjective work but if other people is interested we could work together.

Emin Temiz PRO

AI & ML interests

Recent Activity

Organizations

etemiz's activity

Ways to Align AI with Human Values

some1nostr/Nostr-Llama-3.1-8B

Update README.md

Unleashing the Power of Unsloth and QLora:Redefining Language Model Fine-Tuning

etemiz/Hoopoe-8B-Llama-3.1

etemiz/Llama-3.1-405B-Inst-GGUF

The AHA Indicator

DeepSeek R1 Human Alignment Tests