Edward Neuhaus's picture

Edward Neuhaus

Pretergeek

·

https://ko-fi.com/pretergeek

pretergeek

AI & ML interests

NLP, ML, LLMs, AI Ethics, Privacy in AI

Recent Activity

liked a dataset about 18 hours ago

lmms-lab/LLaVA-OneVision-Data

liked a Space about 18 hours ago

opencompass/MMBench

upvoted a collection about 18 hours ago

The Big Benchmarks Collection

View all activity

Organizations

None yet

Pretergeek's activity

upvoted a collection about 18 hours ago

The Big Benchmarks Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 192

upvoted a paper 24 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 26 days ago • 252

upvoted 2 collections 2 months ago

Useful Spaces

13 items • Updated Nov 29, 2024 • 1

OpenChat-3.5-0106 with Extended Context

1 item • Updated Nov 29, 2024 • 1

upvoted a collection 3 months ago

Vision Language Models Papers 🖼️💬📝

Papers about vision-language models, most important ones are on top of the list. • 27 items • Updated Apr 30, 2024 • 35

upvoted a paper 3 months ago

Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers

Paper • 2409.20537 • Published Sep 30, 2024 • 13

upvoted a collection 3 months ago

RL/Alignment

197 items • Updated Jun 18, 2024 • 23

upvoted a paper 3 months ago

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 52

upvoted an article 3 months ago

Article

🇮🇹🇯🇵🇧🇷 Generating multilingual instruction datasets with Magpie 🐦‍⬛

By

•

Oct 21, 2024

• 19

upvoted 2 papers 4 months ago

RoFormer: Enhanced Transformer with Rotary Position Embedding

Paper • 2104.09864 • Published Apr 20, 2021 • 11

Large Language Models Must Be Taught to Know What They Don't Know

Paper • 2406.08391 • Published Jun 12, 2024 • 1

upvoted 3 papers 5 months ago

Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming

Paper • 2408.16725 • Published Aug 29, 2024 • 53

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Paper • 2309.12307 • Published Sep 21, 2023 • 88

TableBench: A Comprehensive and Complex Benchmark for Table Question Answering

Paper • 2408.09174 • Published Aug 17, 2024 • 52

upvoted a paper 6 months ago

RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual Dexterous Robot Hands

Paper • 2408.11048 • Published Aug 20, 2024 • 4

upvoted 2 collections 6 months ago

Emotional Intelligence Datasets

9 items • Updated Nov 2, 2024 • 4

OpenChat-3.5-0106 with Additional Layers

Upscaled models using the Block Expansion method. Unlike the more common DUP Scaling, BE doesn't require fine-tuning to recover lost performance. • 7 items • Updated Nov 29, 2024 • 2

upvoted 3 papers 6 months ago

A Rank Stabilization Scaling Factor for Fine-Tuning with LoRA

Paper • 2312.03732 • Published Nov 28, 2023 • 8

PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

Paper • 2309.10400 • Published Sep 19, 2023 • 26

Rotary Position Embedding for Vision Transformer

Paper • 2403.13298 • Published Mar 20, 2024 • 4