Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2402.07827

multilingual vision models

Some papers I read for understanding vision models and also adding multilingual capabilities to them

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27 • 84
Visual Instruction Tuning

Paper • 2304.08485 • Published Apr 17, 2023 • 12
Improved Baselines with Visual Instruction Tuning

Paper • 2310.03744 • Published Oct 5, 2023 • 36
PALO: A Polyglot Large Multimodal Model for 5B People

Paper • 2402.14818 • Published Feb 22 • 23

LLM Fine-Tuning

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Paper • 2402.17193 • Published Feb 27 • 23
Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models

Paper • 2402.14848 • Published Feb 19 • 18
Instruction-tuned Language Models are Better Knowledge Learners

Paper • 2402.12847 • Published Feb 20 • 24
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

Paper • 2402.07827 • Published Feb 12 • 45

OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models

Paper • 2402.01739 • Published Jan 29 • 26
Rethinking Interpretability in the Era of Large Language Models

Paper • 2402.01761 • Published Jan 30 • 21
Self-Discover: Large Language Models Self-Compose Reasoning Structures

Paper • 2402.03620 • Published Feb 6 • 109
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

Paper • 2402.07827 • Published Feb 12 • 45

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15 • 94
How to Train Data-Efficient LLMs

Paper • 2402.09668 • Published Feb 15 • 38
BitDelta: Your Fine-Tune May Only Be Worth One Bit

Paper • 2402.10193 • Published Feb 15 • 17
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts

Paper • 2402.09727 • Published Feb 15 • 35

ggerganov/whisper.cpp

Updated Dec 10, 2023 • 701
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

Paper • 2402.07827 • Published Feb 12 • 45
CohereForAI/aya-101

Text2Text Generation • Updated Mar 31 • 5.45k • 602

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

Paper • 2402.07827 • Published Feb 12 • 45

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

Paper • 2402.07827 • Published Feb 12 • 45
Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15 • 94
Generative Representational Instruction Tuning

Paper • 2402.09906 • Published Feb 15 • 51

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

Paper • 2402.07827 • Published Feb 12 • 45

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

Paper • 2402.07827 • Published Feb 12 • 45
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models

Paper • 2104.08663 • Published Apr 17, 2021 • 3
Orca 2: Teaching Small Language Models How to Reason

Paper • 2311.11045 • Published Nov 18, 2023 • 70
Generative Representational Instruction Tuning

Paper • 2402.09906 • Published Feb 15 • 51

Non-English Embeddings and Models

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 28
Contrastive Language-Image Pre-training for the Italian Language

Paper • 2108.08688 • Published Aug 19, 2021 • 2
IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation

Paper • 2203.03759 • Published Mar 7, 2022 • 5
Spanish Pre-trained BERT Model and Evaluation Data

Paper • 2308.02976 • Published Aug 6, 2023 • 3

Previous
1
2
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs