Tonic (Joseph Pollack)

upvoted an article 8 days ago

Article

Welcome Gemma 2 - Google's new open LLM

12 days ago

• 87

upvoted a collection 8 days ago

LLM Compiler

Collection

Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated 12 days ago • 137

upvoted a collection 12 days ago

Gemma 2 Release

Collection

10 items • Updated 12 days ago • 114

upvoted a collection 13 days ago

Probably DPO datasets

Collection

A collection of datasets that probably support DPO • 146 items • Updated 13 days ago • 8

upvoted 2 papers 17 days ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5 • 67

Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations

Paper • 2312.08935 • Published Dec 14, 2023 • 4

upvoted an article 17 days ago

Article

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

May 1

• 59

upvoted a paper 18 days ago

4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities

Paper • 2406.09406 • Published 26 days ago • 12

upvoted a collection 18 days ago

Instruction Pre-Training

Collection

8 items • Updated 18 days ago • 24

upvoted a collection 20 days ago

Florence

Collection

9 items • Updated 24 days ago • 136

upvoted a paper 21 days ago

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16 • 74

upvoted a collection 22 days ago

DeepSeekCoder-V2

Collection

4 items • Updated 25 days ago • 57

upvoted a collection 25 days ago

SciRIFF

Collection

Data and models to enhance instruction-following for scientific literature understanding. • 9 items • Updated 26 days ago • 4

upvoted a paper 26 days ago

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published 27 days ago • 48

upvoted a paper 27 days ago

CRAG -- Comprehensive RAG Benchmark

Paper • 2406.04744 • Published Jun 7 • 38

upvoted a paper 28 days ago

AnchorAL: Computationally Efficient Active Learning for Large and Imbalanced Datasets

Paper • 2404.05623 • Published Apr 8 • 3

upvoted 4 collections about 1 month ago

upvoted 2 papers about 1 month ago

Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models

Paper • 2405.15574 • Published May 24 • 52

Transformers Can Do Arithmetic with the Right Embeddings

Paper • 2405.17399 • Published May 27 • 50

upvoted a paper about 2 months ago

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16 • 113

upvoted a collection about 2 months ago

RecurrentGemma Release

Collection

8 items • Updated 12 days ago • 37

upvoted a paper about 2 months ago

Anchor-based Large Language Models

Paper • 2402.07616 • Published Feb 12 • 3

upvoted a collection about 2 months ago

MAmmoTH2

Collection

Scaling up instruction data from the web for to build better LLMs • 11 items • Updated May 26 • 7

upvoted a paper about 2 months ago

To Asymmetry and Beyond: Structured Pruning of Sequence to Sequence Models for Improved Inference Efficiency

Paper • 2304.02721 • Published Apr 5, 2023 • 3

upvoted 5 collections about 2 months ago

CommonCanvas

Collection

Collection of models trained on the CommonCatalogue datasets • 8 items • Updated May 16 • 6

Video-LLaVA 1.0 Model

Collection

a collection of Video-LLaVA 1.0 • 3 items • Updated May 23 • 4

CommonCatalog

Collection

Common Catalog, a dataset with Creative Commons licensed images and machine-generated caption pairs • 8 items • Updated May 16 • 13

MADLAD-400

Collection

Models and spaces for MADLAD-400: A Multilingual And Document-Level Large Audited Dataset • 8 items • Updated Nov 14, 2023 • 5

Chronos Models & Datasets

Collection

Chronos: Pretrained (language) models for time series forecasting based on the T5 architecture. • 8 items • Updated 12 days ago • 26

upvoted an article 2 months ago

Article

Speech Synthesis, Recognition, and More With SpeechT5

Feb 8, 2023

• 2

upvoted a collection 2 months ago

Speaker Diarization Datasets

Collection

A collection of speaker diarization datasets compatible with Diarizers. • 6 items • Updated May 29 • 1

upvoted 2 papers 2 months ago

End-to-end speaker segmentation for overlap-aware resegmentation

Paper • 2104.04045 • Published Apr 8, 2021 • 1

Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation

Paper • 2210.13248 • Published Oct 24, 2022 • 1

upvoted an article 2 months ago

Article

Train custom AI models with the trainer API and adapt them to 🤗

By

•

10 days ago

• 29

upvoted 3 collections 2 months ago

Granite Code Models

Collection

A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 20 items • Updated 10 days ago • 145

llama 3 self-align experiments

Collection

Replicating the pipeline for StarCoder-2 Instruct on Llama-3-8B with some tweaks https://huggingface.co/blog/sc2-instruct • 4 items • Updated May 9 • 6

Community Tools

Collection

Cool HF tools that I and others at HF work on that I regularly use • 4 items • Updated May 21 • 3

upvoted 2 papers 2 months ago

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published Apr 30 • 45

Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval

Paper • 2311.05800 • Published Nov 10, 2023 • 3

upvoted a collection 2 months ago

🦢SWIM-IR Dataset

Collection

29 million Synthetic Wikipedia-based Multilingual Retrieval Training Pairs. • 4 items • Updated Apr 28 • 7

upvoted 2 papers 2 months ago

PersonaLLM: Investigating the Ability of Large Language Models to Express Personality Traits

Paper • 2305.02547 • Published May 4, 2023 • 7

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4 • 58

upvoted 2 collections 2 months ago

Top LLM

Collection

Collection of TOP Open Source LLM • 4 items • Updated May 6 • 7

📀 Dataset comparison models

Collection

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated 27 days ago • 26

upvoted a paper 2 months ago

Generalizable Face Landmarking Guided by Conditional Face Warping

Paper • 2404.12322 • Published Apr 18 • 1

upvoted a collection 3 months ago

Caduceus

Collection

https://caduceus-dna.github.io/ • 8 items • Updated Apr 19 • 9

upvoted 4 papers 3 months ago

ChemLLM: A Chemical Large Language Model

Paper • 2402.06852 • Published Feb 10 • 25

Enabling Natural Zero-Shot Prompting on Encoder Models via Statement-Tuning

Paper • 2404.12897 • Published Apr 19 • 1

Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis

Paper • 2404.13686 • Published Apr 21 • 26

ODIN: Disentangled Reward Mitigates Hacking in RLHF

Paper • 2402.07319 • Published Feb 11 • 13

upvoted 2 collections 3 months ago

Antidote Project

Collection

Data and models generated within the Antidote Project (https://univ-cotedazur.eu/antidote) • 20 items • Updated May 6 • 5

LLM

Collection

14 items • Updated Apr 24 • 1

upvoted 4 papers 3 months ago

BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing

Paper • 2206.15076 • Published Jun 30, 2022 • 3

Arcee's MergeKit: A Toolkit for Merging Large Language Models

Paper • 2403.13257 • Published Mar 20 • 18

Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 45

Mixtral of Experts

Paper • 2401.04088 • Published Jan 8 • 155

upvoted a collection 3 months ago

Models - Fintech

Collection

6 items • Updated Apr 17 • 3

Joseph Pollack

AI & ML interests

Organizations

Tonic's activity

Welcome Gemma 2 - Google's new open LLM

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

Speech Synthesis, Recognition, and More With SpeechT5

Train custom AI models with the trainer API and adapt them to 🤗