Introducing Synthetic Data Workshop: Your Gateway to Easy Synthetic Dataset Creation 15 days ago • 11
Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models Mar 20 • 36
Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model Aug 22, 2023 • 14
Show Less, Instruct More: Enriching Prompts with Definitions and Guidelines for Zero-Shot NER Paper • 2407.01272 • Published 4 days ago • 6 • 1
Arboretum: A Large Multimodal Dataset Enabling AI for Biodiversity Paper • 2406.17720 • Published 10 days ago • 7 • 1
LiveBench: A Challenging, Contamination-Free LLM Benchmark Paper • 2406.19314 • Published 8 days ago • 12 • 2
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models Paper • 2406.13542 • Published 16 days ago • 15 • 2
Large Scale Transfer Learning for Tabular Data via Language Modeling Paper • 2406.12031 • Published 18 days ago • 6 • 1
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens Paper • 2406.11271 • Published 19 days ago • 10 • 1
An Empirical Study of LLM-as-a-Judge for LLM Evaluation: Fine-tuned Judge Models are Task-specific Classifiers Paper • 2403.02839 • Published Mar 5 • 1 • 1
TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper • 2305.07759 • Published May 12, 2023 • 30 • 8
Becoming self-instruct: introducing early stopping criteria for minimal instruct tuning Paper • 2307.03692 • Published Jul 5, 2023 • 24 • 4
Generative AI for Synthetic Data Generation: Methods, Challenges and the Future Paper • 2403.04190 • Published Mar 7 • 2
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published May 2 • 107 • 11
ALoRA: Allocating Low-Rank Adaptation for Fine-tuning Large Language Models Paper • 2403.16187 • Published Mar 24 • 2
CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model Paper • 2403.08350 • Published Mar 13 • 2
Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model Paper • 2404.10306 • Published Apr 16 • 2
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation Paper • 2403.11808 • Published Mar 18 • 3
PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation Paper • 2403.09192 • Published Mar 14 • 2
Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey Paper • 2403.14608 • Published Mar 21 • 2
MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA based Mixture of Experts Paper • 2404.15159 • Published Apr 22 • 2
BAdam: A Memory Efficient Full Parameter Training Method for Large Language Models Paper • 2404.02827 • Published Apr 3 • 2
Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading Comprehension Paper • 2404.17991 • Published Apr 27 • 5
GeMQuAD : Generating Multilingual Question Answering Datasets from Large Language Models using Few Shot Learning Paper • 2404.09163 • Published Apr 14 • 2
IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages Paper • 2404.16816 • Published Apr 25 • 1 • 2
Optimizing Language Model's Reasoning Abilities with Weak Supervision Paper • 2405.04086 • Published May 7 • 1 • 3