Hui Sun's picture

Hui Sun

CocoSun

·

AI & ML interests

None yet

Organizations

CocoSun's activity

upvoted a collection 19 days ago

Florence

9 items • Updated 24 days ago • 136

upvoted a collection 20 days ago

MobileNetV4 pretrained weights

Weights for MobileNet-V4 pretrained in timm • 13 items • Updated 14 days ago • 9

upvoted a paper 25 days ago

An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels

Paper • 2406.09415 • Published 26 days ago • 47

upvoted 3 collections 26 days ago

The best SDXL models

The best diffusion models (checkpoints) based on SDXL • 7 items • Updated 29 days ago • 4

The best SD1.5 models

The best diffusion models (checkpoints) based on SD1.5 • 6 items • Updated Jun 6 • 3

🎡 Demo MLLMs

6 items • Updated 26 days ago • 1

upvoted a collection 27 days ago

🎠 Demo Diffusions

6 items • Updated 26 days ago • 1

upvoted an article 27 days ago

Article

Making sense of this mess

Jun 7

• 14

upvoted a paper 27 days ago

An Image is Worth 32 Tokens for Reconstruction and Generation

Paper • 2406.07550 • Published 28 days ago • 53

upvoted an article about 1 month ago

Article

MTEB: Massive Text Embedding Benchmark

Oct 19, 2022

• 20

upvoted a paper about 1 month ago

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

Paper • 2405.19327 • Published May 29 • 43

upvoted 2 articles about 1 month ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28

• 116

Article

Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens tokens and 11 languages

May 24

• 20

upvoted an article about 2 months ago

Article

Let's talk about LLM evaluation

By

•

May 23

• 93

upvoted a collection about 2 months ago

PaliGemma FT Models

108 items • Updated 12 days ago • 24

upvoted 2 articles about 2 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14

• 163

Article

Hugging Face x LangChain : A new partner package in LangChain

May 14

• 87

upvoted an article 2 months ago

Article

seemore: Implement a Vision Language Model from Scratch

By

•

16 days ago

• 48

upvoted 2 collections 3 months ago

OpenELM Pretrained Models

4 items • Updated 20 days ago • 43

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 22 items • Updated May 31 • 362

upvoted an article 3 months ago

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19

• 82

upvoted 4 papers 3 months ago

Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

Paper • 2404.08197 • Published Apr 12 • 26

BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text

Paper • 2403.18421 • Published Mar 27 • 21

ViTAR: Vision Transformer with Any Resolution

Paper • 2403.18361 • Published Mar 27 • 48

InternLM2 Technical Report

Paper • 2403.17297 • Published Mar 26 • 26

upvoted 4 papers 4 months ago

FeatUp: A Model-Agnostic Framework for Features at Any Resolution

Paper • 2403.10516 • Published Mar 15 • 16

SynthCLIP: Are We Ready for a Fully Synthetic CLIP Training?

Paper • 2402.01832 • Published Feb 2 • 4

Synth^2: Boosting Visual-Language Models with Synthetic Captions and Image Embeddings

Paper • 2403.07750 • Published Mar 12 • 19

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

Paper • 2402.17177 • Published Feb 27 • 88

upvoted a collection 5 months ago

Sora Reference Papers

A collection of all papers referenced in OpenAI's "Video generation models as world simulators" technical report • openai.com/sora • 30 items • Updated Feb 20 • 51

upvoted a paper 5 months ago

InstaGen: Enhancing Object Detection by Training on Synthetic Dataset

Paper • 2402.05937 • Published Feb 8 • 8

upvoted 3 papers 6 months ago

CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation

Paper • 2401.12208 • Published Jan 22 • 20

Scalable Pre-training of Large Autoregressive Image Models

Paper • 2401.08541 • Published Jan 16 • 35

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 132

upvoted 4 papers 7 months ago

Visual In-Context Prompting

Paper • 2311.13601 • Published Nov 22, 2023 • 14

ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs

Paper • 2311.13600 • Published Nov 22, 2023 • 41

Towards Accurate Differential Diagnosis with Large Language Models

Paper • 2312.00164 • Published Nov 30, 2023 • 8

MEDITRON-70B: Scaling Medical Pretraining for Large Language Models

Paper • 2311.16079 • Published Nov 27, 2023 • 19

upvoted a paper 8 months ago

OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents

Paper • 2306.16527 • Published Jun 21, 2023 • 44

upvoted a paper 10 months ago

Nougat: Neural Optical Understanding for Academic Documents

Paper • 2308.13418 • Published Aug 25, 2023 • 33

upvoted a paper 12 months ago

TinyStories: How Small Can Language Models Be and Still Speak Coherent English?

Paper • 2305.07759 • Published May 12, 2023 • 30

upvoted a paper about 1 year ago

Dynamic-Resolution Model Learning for Object Pile Manipulation

Paper • 2306.16700 • Published Jun 29, 2023 • 5