The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism Paper ā¢ 2407.10457 ā¢ Published Jul 15 ā¢ 22
Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer Paper ā¢ 2403.13570 ā¢ Published Mar 20 ā¢ 3
Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper ā¢ 2406.14491 ā¢ Published Jun 20 ā¢ 85
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training Paper ā¢ 2309.10400 ā¢ Published Sep 19, 2023 ā¢ 26
Evaluating Text-to-Visual Generation with Image-to-Text Generation Paper ā¢ 2404.01291 ā¢ Published Apr 1 ā¢ 6
Mamba: Linear-Time Sequence Modeling with Selective State Spaces Paper ā¢ 2312.00752 ā¢ Published Dec 1, 2023 ā¢ 138
Preference Datasets for KTO Collection This collection contains a list of curated preference datasets for KTO fine-tuning for intent alignment of LLMs through signals. ā¢ 5 items ā¢ Updated Jul 30 ā¢ 14
NER in Spanish Collection Fine-tuned models to perform NER in Spanish using the framework SpanMarker and different encoders and datasets ā¢ 3 items ā¢ Updated Sep 2 ā¢ 4
Seamless Communication Collection A significant step towards removing language barriers through expressive, fast and high-quality AI translation. ā¢ 16 items ā¢ Updated Jan 16 ā¢ 150
Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation Paper ā¢ 2403.06164 ā¢ Published Mar 10 ā¢ 2