dhruva-sarma (Dhruvajyoti Sarma)

upvoted 2 papers 2 days ago

Scaling Retrieval-Based Language Models with a Trillion-Token Datastore

Paper • 2407.12854 • Published 13 days ago • 26

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published 4 days ago • 37

upvoted a paper 6 days ago

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published 10 days ago • 106

upvoted an article 6 days ago

Article

SmolLM - blazingly fast and remarkably powerful

7 days ago

• 152

upvoted an article 10 days ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

12 days ago

• 68

upvoted a paper 20 days ago

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published 24 days ago • 84

upvoted 3 papers 24 days ago

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Paper • 2406.18495 • Published 26 days ago • 12

Octo-planner: On-device Language Model for Planner-Action Agents

Paper • 2406.18082 • Published 26 days ago • 47

Adam-mini: Use Fewer Learning Rates To Gain More

Paper • 2406.16793 • Published 28 days ago • 65

upvoted 2 papers 27 days ago

LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs

Paper • 2406.15319 • Published about 1 month ago • 57

Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges

Paper • 2406.12624 • Published Jun 18 • 35

upvoted 5 papers about 1 month ago

HARE: HumAn pRiors, a key to small language model Efficiency

Paper • 2406.11410 • Published Jun 17 • 38

GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks

Paper • 2406.12925 • Published Jun 14 • 20

RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content

Paper • 2406.11811 • Published Jun 17 • 15

Tokenization Falling Short: The Curse of Tokenization

Paper • 2406.11687 • Published Jun 17 • 13

From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries

Paper • 2406.12824 • Published Jun 18 • 20

upvoted a collection about 1 month ago

Florence

Collection

9 items • Updated 11 days ago • 138

upvoted 2 articles about 1 month ago

Article

The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models

Jan 29

• 8

Article

🧨 Diffusers welcomes Stable Diffusion 3

Jun 12

• 77

upvoted a paper about 1 month ago

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10 • 62

upvoted an article about 2 months ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13

• 238

upvoted a collection about 2 months ago

sentence-transformers-from-synthetic-data

Collection

Example of using distilabel to generate synthetic triplets data for fine-tuning a Sentence Transformer model • 4 items • Updated about 1 month ago • 20

upvoted an article about 2 months ago

Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

By

•

29 days ago

• 30

upvoted 4 papers about 2 months ago

Dual3D: Efficient and Consistent Text-to-3D Generation with Dual-mode Multi-view Latent Diffusion

Paper • 2405.09874 • Published May 16 • 15

Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts

Paper • 2405.19893 • Published May 30 • 26

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27 • 78

Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control

Paper • 2405.12970 • Published May 21 • 22

upvoted an article 2 months ago

Article

Evaluating Language Model Bias with 🤗 Evaluate

Oct 24, 2022

• 2

upvoted a collection 2 months ago

Phi-3

Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 23 items • Updated 11 days ago • 372

upvoted 2 papers 2 months ago

ALPINE: Unveiling the Planning Capability of Autoregressive Learning in Language Models

Paper • 2405.09220 • Published May 15 • 23

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13 • 62

upvoted an article 2 months ago

Article

License to Call: Introducing Transformers Agents 2.0

May 13

• 102

upvoted 2 papers 2 months ago

Customizing Text-to-Image Models with a Single Image Pair

Paper • 2405.01536 • Published May 2 • 17

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 116

upvoted 4 papers 3 months ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2 • 109

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published Apr 30 • 102

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published Apr 29 • 67

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published Apr 22 • 124

upvoted an article 3 months ago

Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Apr 22

• 76

upvoted 11 papers 3 months ago

How Far Can We Go with Practical Function-Level Program Repair?

Paper • 2404.12833 • Published Apr 19 • 6

Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models

Paper • 2404.13013 • Published Apr 19 • 28

OpenBezoar: Small, Cost-Effective and Open Models Trained on Mixes of Instruction Data

Paper • 2404.12195 • Published Apr 18 • 11

YaART: Yet Another ART Rendering Technology

Paper • 2404.05666 • Published Apr 8 • 14

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Paper • 2404.05719 • Published Apr 8 • 58

upvoted 5 papers 4 months ago

Infinite-ID: Identity-preserved Personalization via ID-semantics Decoupling Paradigm

Paper • 2403.11781 • Published Mar 18 • 17

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Paper • 2403.10704 • Published Mar 15 • 56

Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation

Paper • 2403.12015 • Published Mar 18 • 60

RAFT: Adapting Language Model to Domain Specific RAG

Paper • 2403.10131 • Published Mar 15 • 65

SaulLM-7B: A pioneering Large Language Model for Law

Paper • 2403.03883 • Published Mar 6 • 72

upvoted 5 papers 9 months ago

ConvNets Match Vision Transformers at Scale

Paper • 2310.16764 • Published Oct 25, 2023 • 18

3D-GPT: Procedural 3D Modeling with Large Language Models

Paper • 2310.12945 • Published Oct 19, 2023 • 52

Can GPT models be Financial Analysts? An Evaluation of ChatGPT and GPT-4 on mock CFA Exams

Paper • 2310.08678 • Published Oct 12, 2023 • 11

InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation

Paper • 2309.06380 • Published Sep 12, 2023 • 32

IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models

Paper • 2308.06721 • Published Aug 13, 2023 • 26

Dhruvajyoti Sarma

AI & ML interests

Organizations

dhruva-sarma's activity

SmolLM - blazingly fast and remarkably powerful

How NuminaMath Won the 1st AIMO Progress Prize

The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models

🧨 Diffusers welcomes Stable Diffusion 3

Uncensor any LLM with abliteration

SeeMoE: Implementing a MoE Vision Language Model from Scratch

Evaluating Language Model Bias with 🤗 Evaluate

License to Call: Introducing Transformers Agents 2.0

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent