Thomas Anderson's picture

9 35

Thomas Anderson

farpluto

·

AI & ML interests

None yet

Organizations

None yet

farpluto's activity

upvoted 2 collections about 2 months ago

TriLMs-Unpacked

TriLMs unpacked to FP16 - compatible with any implementation supporting LLaMa architecture in huggingface's transformers format. • 9 items • Updated Jul 9 • 4

Common Corpus

The largest public domain dataset for training LLMs. • 27 items • Updated Jul 17 • 111

upvoted an article about 2 months ago

Article

How to train a new language model from scratch using Transformers and Tokenizers

Feb 14, 2020

• 16

upvoted 4 collections about 2 months ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated 11 days ago • 339

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 4 days ago • 584

InternVL 2.0

Expanding Performance Boundaries of Open-Source MLLM • 16 items • Updated Aug 10 • 73

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 173

upvoted a collection 3 months ago

Core ML Gallery Models

7 items • Updated Jun 19 • 30

upvoted a collection 5 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 27 items • Updated 11 days ago • 467