satyamt (Satyam)

upvoted a collection 7 days ago

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated 11 days ago • 219

upvoted an article about 1 month ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

By

•

Jul 5

• 112

upvoted a paper about 1 month ago

Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming

Paper • 2408.16725 • Published Aug 29 • 51

upvoted an article about 2 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14

• 201

upvoted a paper about 2 months ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15 • 51

upvoted a collection 2 months ago

PaliGemma Release

Collection

Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Jul 31 • 136

upvoted an article 2 months ago

Article

Constitutional AI with Open LLMs

Feb 1

• 11

upvoted a collection 2 months ago

Probably function calling datasets

Collection

Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17 • 35

upvoted an article 2 months ago

Article

Serverless Inference with Hugging Face and NVIDIA NIMs

Jul 29

• 26

upvoted an article 3 months ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24

• 171

upvoted 2 papers 4 months ago

GenQA: Generating Millions of Instructions from a Handful of Prompts

Paper • 2406.10323 • Published Jun 14 • 5

Show, Don't Tell: Aligning Language Models with Demonstrated Feedback

Paper • 2406.00888 • Published Jun 2 • 30

upvoted a collection 4 months ago

sentence-transformers-from-synthetic-data

Collection

Example of using distilabel to generate synthetic triplets data for fine-tuning a Sentence Transformer model • 4 items • Updated Jun 21 • 21

upvoted a paper 4 months ago

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Paper • 2405.15071 • Published May 23 • 35

upvoted an article 5 months ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Mar 20

• 61

upvoted a paper 5 months ago

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Paper • 2405.04434 • Published May 7 • 13

upvoted an article 5 months ago

Article

seemore: Implement a Vision Language Model from Scratch

By

•

Jun 23

• 59

upvoted an article 6 months ago

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19

• 102

upvoted a paper 6 months ago

TransformerFAM: Feedback attention is working memory

Paper • 2404.09173 • Published Apr 14 • 43

upvoted a collection 6 months ago

EasyContext

Collection

https://github.com/jzhang38/EasyContext • 7 items • Updated Apr 19 • 4

upvoted 2 papers 6 months ago

Long-context LLMs Struggle with Long In-context Learning

Paper • 2404.02060 • Published Apr 2 • 34

Advancing LLM Reasoning Generalists with Preference Trees

Paper • 2404.02078 • Published Apr 2 • 43

upvoted a collection 6 months ago

Eurus

Collection

Advancing LLM Reasoning Generalists with Preference Trees • 11 items • Updated Apr 15 • 24

upvoted a paper 6 months ago

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28 • 103

upvoted 9 papers 7 months ago

upvoted a collection 7 months ago

Caduceus

Collection

https://caduceus-dna.github.io/ • 8 items • Updated 30 days ago • 9

upvoted a paper 7 months ago

Common 7B Language Models Already Possess Strong Math Capabilities

Paper • 2403.04706 • Published Mar 7 • 16

upvoted 2 collections 7 months ago

UDOP

Collection

UDOP is a general multimodal model for document AI • 4 items • Updated Jul 11 • 22

OpenCodeInterpreter

Collection

18 items • Updated Mar 3 • 82

upvoted 2 papers 7 months ago

RoleCraft-GLM: Advancing Personalized Role-Playing in Large Language Models

Paper • 2401.09432 • Published Dec 17, 2023 • 2

RoleEval: A Bilingual Role Evaluation Benchmark for Large Language Models

Paper • 2312.16132 • Published Dec 26, 2023 • 2

upvoted a paper 8 months ago

Instruction-tuned Language Models are Better Knowledge Learners

Paper • 2402.12847 • Published Feb 20 • 24

upvoted a collection 8 months ago

DPO

Collection

5 items • Updated Dec 1, 2023 • 1

upvoted 2 papers 8 months ago

How to Train Data-Efficient LLMs

Paper • 2402.09668 • Published Feb 15 • 38

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

Paper • 2310.08659 • Published Oct 12, 2023 • 22

upvoted a collection 8 months ago

datasets-SPIN

Collection

Generated synthetic data used to finetune SPIN. • 8 items • Updated Feb 9 • 11

upvoted 4 papers 8 months ago

The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs

Paper • 2210.14986 • Published Oct 26, 2022 • 4

In-Context Principle Learning from Mistakes

Paper • 2402.05403 • Published Feb 8 • 14

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5 • 67

ReGAL: Refactoring Programs to Discover Generalizable Abstractions

Paper • 2401.16467 • Published Jan 29 • 8

upvoted a paper 9 months ago

Convergent Learning: Do different neural networks learn the same representations?

Paper • 1511.07543 • Published Nov 24, 2015 • 2

upvoted a collection 9 months ago

Medical Merges

Collection

Playful merges that try to improve small medical LMs by merging them with models with higher reasoning capabilities. • 35 items • Updated Mar 5 • 2

upvoted a paper 9 months ago

Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Paper • 2306.02858 • Published Jun 5, 2023 • 18

upvoted a collection 9 months ago

AIM

Collection

AIM: Autoregressive Image Models • 5 items • Updated 3 days ago • 48

upvoted 2 papers 9 months ago

ReFT: Reasoning with Reinforced Fine-Tuning

Paper • 2401.08967 • Published Jan 17 • 27

Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 75

upvoted 4 collections 9 months ago

Preference Datasets for DPO

Collection

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Jul 30 • 28

Hermes 2

Collection

Nous' Flagship LLM Series • 23 items • Updated Aug 15 • 101

Comparing DPO with IPO and KTO

Collection

A collection of chat models to explore the differences between three alignment techniques: DPO, IPO, and KTO. • 56 items • Updated Jan 9 • 31

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12 • 212

upvoted 2 papers 9 months ago

Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws

Paper • 2401.00448 • Published Dec 31, 2023 • 27

Generative AI for Math: Part I -- MathPile: A Billion-Token-Scale Pretraining Corpus for Math

Paper • 2312.17120 • Published Dec 28, 2023 • 25

Satyam

AI & ML interests

Organizations

satyamt's activity

ColPali: Efficient Document Retrieval with Vision Language Models 👀

PaliGemma – Google's Cutting-Edge Open Vision Language Model

Constitutional AI with Open LLMs

Serverless Inference with Hugging Face and NVIDIA NIMs

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

seemore: Implement a Vision Language Model from Scratch

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare