1697 152 65

Stefan Schweter PRO

stefan-it

AI & ML interests

Flair Library 💕, NER & PoS Tagging, LM Pretraining (mostly encoder-only & encoder-decoder), Historical Language Models

Recent Activity

liked a model about 2 hours ago

chandar-lab/NeoBERT

commented on a paper about 2 hours ago

NeoBERT: A Next-Generation BERT

posted an update about 12 hours ago

After running some 3DMark and FurMark benchmarks on Windows to make sure that my new 5090 is not causing melting cables [1] and some nice shots with a thermal camera (I don't think that's too much), running some fine-tuning experiments with my favorite Flair & Transformers libraries are very easy to perform. Important steps: Good idea is to start with a fresh Ubuntu 24.04 installation with latest CUDA 12.8 and the open NVIDIA driver - follow more advices from [2]: ```bash sudo apt -y install cuda-toolkit-12-8 nvidia-open ``` I tried update from an existing Ubuntu installation with an older CUDA and driver version and it resulted in a non-startable system. If you are using PyTorch 2.6 with built CUDA 12.6 it will result in: ```bash NVIDIA Graphics Device with CUDA capability sm_120 is not compatible with the current PyTorch installation. The current PyTorch install supports CUDA capabilities sm_50 sm_60 sm_70 sm_75 sm_80 sm_86 sm_90. ``` But no worries! For PyTorch you need just to use a nightly 2.7 version that was built with CUDA 12.8. This can easily done via: ```bash pip install --pre torch --index-url https://download.pytorch.org/whl/nightly/cu128 ``` After that the latest Flair version can be installed and fine-tuning will work! References: [1]: https://www.reddit.com/r/nvidia/comments/1inpox7/rtx_50_series_12vhpwr_megathread/ [2]: https://developer.nvidia.com/cuda-downloads?target_os=Linux&target_arch=x86_64&Distribution=Ubuntu&target_version=24.04&target_type=deb_network

View all activity

Organizations

Posts 3

Post

619

sudo apt -y install cuda-toolkit-12-8 nvidia-open

I tried update from an existing Ubuntu installation with an older CUDA and driver version and it resulted in a non-startable system.

If you are using PyTorch 2.6 with built CUDA 12.6 it will result in:

NVIDIA Graphics Device with CUDA capability sm_120 is not compatible with the current PyTorch installation.
The current PyTorch install supports CUDA capabilities sm_50 sm_60 sm_70 sm_75 sm_80 sm_86 sm_90.

But no worries! For PyTorch you need just to use a nightly 2.7 version that was built with CUDA 12.8. This can easily done via:

pip install --pre torch --index-url https://download.pytorch.org/whl/nightly/cu128

After that the latest Flair version can be installed and fine-tuning will work!

References:

[1]: https://www.reddit.com/r/nvidia/comments/1inpox7/rtx_50_series_12vhpwr_megathread/
[2]: https://developer.nvidia.com/cuda-downloads?target_os=Linux&target_arch=x86_64&Distribution=Ubuntu&target_version=24.04&target_type=deb_network

View all Posts

Articles 1

Article

Fine-tune Flair Models on NER Dataset with 🤗 AutoTrain SpaceRunner

View all Articles

Collections 14

models 1334

stefan-it/it5-efficient-small-el32

Text2Text Generation • Updated 4 days ago • 131 • 2

stefan-it/bert5urk

Updated about 1 month ago • 38 • 3

stefan-it/bort-full

Fill-Mask • Updated Jan 18 • 86

stefan-it/span-marker-gelectra-large-germeval14

Token Classification • Updated Dec 16, 2024 • 1.76k • 2

stefan-it/zeitungs-lm-v1

Updated Dec 5, 2024 • 30 • 4

stefan-it/wav2vec2-large-xlsr-53-basque

Automatic Speech Recognition • Updated Nov 9, 2024 • 2.42k

stefan-it/german-gpt2-larger

Text Generation • Updated Oct 30, 2024 • 2.03k • • 8

stefan-it/xlstm-german-wikipedia

Text Generation • Updated Sep 26, 2024 • 79 • 7

stefan-it/flair-barner-wiki-coarse-gbert-large

Token Classification • Updated Sep 23, 2024 • 24 • 1

stefan-it/flair-clean-conll-5

Token Classification • Updated Jul 7, 2024 • 17

datasets 12

stefan-it/senti-anno

Viewer • Updated Nov 29, 2024 • 929 • 132

stefan-it/offenseval2020_tr

Viewer • Updated Nov 22, 2024 • 35.3k • 1.29k

stefan-it/dewiki-20230701-nltk-corpus

Viewer • Updated Sep 6, 2024 • 39.4M • 107 • 2

stefan-it/germeval14_no_wikipedia

Preview • Updated May 29, 2024 • 76

stefan-it/histnero

Viewer • Updated May 10, 2024 • 217k • 1.03k

stefan-it/HisGermaNER

Preview • Updated Mar 28, 2024 • 401 • 2

stefan-it/co-funer

Preview • Updated Mar 25, 2024 • 122

stefan-it/german-dbmdz-bert-corpus

Viewer • Updated Dec 22, 2023 • 52.8M • 158 • 2

stefan-it/span-marker-base-model-detection

Viewer • Updated Sep 5, 2023 • 28 • 84

stefan-it/flair-base-model-detection

Viewer • Updated Sep 5, 2023 • 52 • 73 • 1

Stefan Schweter PRO

AI & ML interests

Recent Activity

Organizations

Posts 3

Articles 1

Fine-tune Flair Models on NER Dataset with 🤗 AutoTrain SpaceRunner

Collections 14

Papers 7

spaces 2 Sort: Recently updated

hmLeaderboard

My NER Dataset Annotations

models 1334 Sort: Recently updated

datasets 12 Sort: Recently updated

spaces 2

models 1334

datasets 12