Ayami I's picture

50 1

Ayami I

Ayakinokiki

·

AI & ML interests

None yet

Organizations

None yet

Ayakinokiki's activity

upvoted a paper 24 days ago

Efficient Detection of Toxic Prompts in Large Language Models

Paper • 2408.11727 • Published 30 days ago • 11

upvoted 2 papers 2 months ago

AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases

Paper • 2407.12784 • Published Jul 17 • 48

Learning to Refuse: Towards Mitigating Privacy Risks in LLMs

Paper • 2407.10058 • Published Jul 14 • 29

upvoted 2 papers 5 months ago

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published Apr 29 • 68

Scaling Instructable Agents Across Many Simulated Worlds

Paper • 2404.10179 • Published Mar 13 • 25

upvoted 3 papers 6 months ago

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Paper • 2404.00399 • Published Mar 30 • 40

TC4D: Trajectory-Conditioned Text-to-4D Generation

Paper • 2403.17920 • Published Mar 26 • 15

Mora: Enabling Generalist Video Generation via A Multi-Agent Framework

Paper • 2403.13248 • Published Mar 20 • 76

upvoted 3 papers 7 months ago

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Paper • 2403.04692 • Published Mar 7 • 40

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5 • 56

L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects

Paper • 2402.09052 • Published Feb 14 • 16

upvoted 3 papers 8 months ago

Large-scale Reinforcement Learning for Diffusion Models

Paper • 2401.12244 • Published Jan 20 • 28

Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text

Paper • 2401.12070 • Published Jan 22 • 42

TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10 • 63

upvoted a paper 10 months ago

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Paper • 2312.00849 • Published Dec 1, 2023 • 8

upvoted 9 papers 11 months ago

RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation

Paper • 2311.01455 • Published Nov 2, 2023 • 28

Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models

Paper • 2310.13671 • Published Oct 20, 2023 • 18

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Paper • 2310.12823 • Published Oct 19, 2023 • 35

Eureka: Human-Level Reward Design via Coding Large Language Models

Paper • 2310.12931 • Published Oct 19, 2023 • 26

Safe RLHF: Safe Reinforcement Learning from Human Feedback

Paper • 2310.12773 • Published Oct 19, 2023 • 28

MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models

Paper • 2310.11954 • Published Oct 18, 2023 • 24

Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts

Paper • 2310.11784 • Published Oct 18, 2023 • 10

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 74

BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 96

upvoted 2 papers 12 months ago

Large Language Models Cannot Self-Correct Reasoning Yet

Paper • 2310.01798 • Published Oct 3, 2023 • 32

Exploring Large Language Models' Cognitive Moral Development through Defining Issues Test

Paper • 2309.13356 • Published Sep 23, 2023 • 36

upvoted 24 papers about 1 year ago

Efficient RLHF: Reducing the Memory Usage of PPO

Paper • 2309.00754 • Published Sep 1, 2023 • 13

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

Paper • 2308.05374 • Published Aug 10, 2023 • 27

AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning

Paper • 2308.03526 • Published Aug 7, 2023 • 25

From Sparse to Soft Mixtures of Experts

Paper • 2308.00951 • Published Aug 2, 2023 • 20

Unified Model for Image, Video, Audio and Language Tasks

Paper • 2307.16184 • Published Jul 30, 2023 • 14

AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?

Paper • 2307.16368 • Published Jul 31, 2023 • 11

RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control

Paper • 2307.15818 • Published Jul 28, 2023 • 27

The Hydra Effect: Emergent Self-repair in Language Model Computations

Paper • 2307.15771 • Published Jul 28, 2023 • 18

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

Paper • 2307.16789 • Published Jul 31, 2023 • 97

Scaling TransNormer to 175 Billion Parameters

Paper • 2307.14995 • Published Jul 27, 2023 • 21

PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback

Paper • 2307.14936 • Published Jul 27, 2023 • 42

Scaling Up and Distilling Down: Language-Guided Robot Skill Acquisition

Paper • 2307.14535 • Published Jul 26, 2023 • 13

Measuring Faithfulness in Chain-of-Thought Reasoning

Paper • 2307.13702 • Published Jul 17, 2023 • 27

A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis

Paper • 2307.12856 • Published Jul 24, 2023 • 35

3D-LLM: Injecting the 3D World into Large Language Models

Paper • 2307.12981 • Published Jul 24, 2023 • 35

Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla

Paper • 2307.09458 • Published Jul 18, 2023 • 10

Learning to Retrieve In-Context Examples for Large Language Models

Paper • 2307.07164 • Published Jul 14, 2023 • 21

Copy Is All You Need

Paper • 2307.06962 • Published Jul 13, 2023 • 33

DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations

Paper • 2307.07047 • Published Jul 13, 2023 • 15

Kosmos-2: Grounding Multimodal Large Language Models to the World

Paper • 2306.14824 • Published Jun 26, 2023 • 34

Guiding Language Models of Code with Global Context using Monitors

Paper • 2306.10763 • Published Jun 19, 2023 • 7

GLIMMER: generalized late-interaction memory reranker

Paper • 2306.10231 • Published Jun 17, 2023 • 7

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models

Paper • 2306.11698 • Published Jun 20, 2023 • 12

BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models

Paper • 2306.10968 • Published Jun 19, 2023 • 7