AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models Paper • 2406.10900 • Published Jun 16 • 11
Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld Paper • 2311.16714 • Published Nov 28, 2023 • 1
Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning Paper • 2310.11716 • Published Oct 18, 2023 • 5
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models Paper • 2310.14566 • Published Oct 23, 2023 • 25
InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models Paper • 2306.03082 • Published Jun 5, 2023 • 5