KingNish (Nishith Jain)

upvoted a paper 2 days ago

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Paper • 2409.12183 • Published 16 days ago • 35

upvoted a paper 5 days ago

Logic-of-Thought: Injecting Logic into Contexts for Full Reasoning in Large Language Models

Paper • 2409.17539 • Published 9 days ago • 1

upvoted a collection 7 days ago

Emu3

Collection

3 items • Updated 8 days ago • 47

upvoted 2 articles 8 days ago

Article

In-browser LLM app in pure Python: Gemini Nano + Gradio-Lite

By

•

Jul 12

• 9

Article

Assisted Generation: a new direction toward low-latency text generation

May 11, 2023

• 26

upvoted an article 9 days ago

Article

Llama can now see and run on your device - welcome Llama 3.2

10 days ago

• 136

upvoted a collection 9 days ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 11 items • Updated 9 days ago • 322

upvoted an article 9 days ago

Article

RAG chatbot using llama3

By

•

Jul 7

• 73

upvoted an article 10 days ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24

• 169

upvoted 2 collections 10 days ago

MagpieLM

Collection

Aligning LMs with Fully Open Recipe (data+training configs+logs) • 9 items • Updated 12 days ago • 13

Moshi v0.1 Release

Collection

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated 16 days ago • 201

upvoted a collection 11 days ago

Paper-to-Read

Collection

5 items • Updated 11 days ago • 2

upvoted a paper 11 days ago

YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models

Paper • 2409.13592 • Published 14 days ago • 45

upvoted a collection 11 days ago

RealFlux (Flux)

Collection

2 items • Updated 11 days ago • 15

upvoted a paper 11 days ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published 16 days ago • 120

upvoted 3 collections 11 days ago

upvoted a paper 11 days ago

Imagine yourself: Tuning-Free Personalized Image Generation

Paper • 2409.13346 • Published 14 days ago • 65

upvoted 2 collections 13 days ago

Realistic Vision (SD1.5)

Collection

8 items • Updated Dec 4, 2023 • 33

RealVisXL (SDXL)

Collection

14 items • Updated Sep 2 • 63

upvoted 2 papers 15 days ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published 15 days ago • 127

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published 17 days ago • 80

upvoted a collection 15 days ago

Collection Zero & Demo

Collection

Image Gen - Text -to-Image • 22 items • Updated 26 days ago • 10

upvoted a paper 16 days ago

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published 16 days ago • 69

upvoted 3 collections 16 days ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 16 days ago • 220

Qwen2.5-Coder

Collection

Code-specific model series based on Qwen2.5 • 14 items • Updated 9 days ago • 69

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 9 items • Updated 12 days ago • 34

upvoted a paper 18 days ago

Chain of Thought Empowers Transformers to Solve Inherently Serial Problems

Paper • 2402.12875 • Published Feb 20 • 12

upvoted an article 18 days ago

Article

Introducing Community Tools on HuggingChat

19 days ago

• 26

upvoted an article 21 days ago

Article

"Diffusers Image Fill" guide

By

•

21 days ago

• 31

upvoted a paper 22 days ago

Agent Workflow Memory

Paper • 2409.07429 • Published 23 days ago • 27

upvoted a collection 22 days ago

Agents

Collection

8 items • Updated 7 days ago • 1

upvoted a paper about 1 month ago

Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning

Paper • 2407.10718 • Published Jul 15 • 17

upvoted 2 collections about 1 month ago

Video generation models (Image-to-Video)

Collection

4 items • Updated Aug 27 • 1

Video Generation models

Collection

The domain of video generation is booming. Here are the list of selected Open Access video generation (T2V) models. • 14 items • Updated Aug 27 • 12

upvoted 6 articles about 1 month ago

Article

quanto: a pytorch quantization toolkit

Mar 18

• 28

Article

Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models

By

•

Aug 26

• 35

Article

Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap

Jun 19

• 11

Article

Student Ambassador Program's call for applications is open!

May 13, 2022

• 2

Article

Announcing the Hugging Face Fellowship Program

May 17, 2022

• 5

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12

• 98

upvoted a collection about 1 month ago

Minitron

Collection

A family of compressed models obtained via pruning and knowledge distillation • 9 items • Updated 1 day ago • 54

upvoted a paper about 2 months ago

Imagen 3

Paper • 2408.07009 • Published Aug 13 • 60

upvoted 2 articles about 2 months ago

Article

XetHub is joining Hugging Face!

Aug 8

• 78

Article

Memory-efficient Diffusion Transformers with Quanto and Diffusers

Jul 30

• 52

upvoted 2 articles 2 months ago

Article

Our Transformers Code Agent beats the GAIA benchmark!

Jul 1

• 45

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23

• 197

upvoted a collection 2 months ago

Top LLM

Collection

Collection of TOP Open Source LLM, Sort by Best on top • 6 items • Updated Jul 26 • 11

upvoted 2 articles 3 months ago

Article

Train a Llama model from scratch

By

•

Jul 29

• 42

Article

Introducing HelpingAI-Flash: Emotionally Intelligent Conversational AI for All Devices

By

•

Jul 19

• 2

upvoted a paper 3 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 153

upvoted an article 3 months ago

Article

Introducing HelpingAI-15B: Emotionally Intelligent Conversational AI

By

•

Jul 12

• 3

upvoted a paper 3 months ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18 • 142

upvoted an article 3 months ago

Article

🧨 Diffusers welcomes Stable Diffusion 3

Jun 12

• 87

upvoted an article 4 months ago

Article

Thoughts on LoRA Training #1

By

•

Jun 18

• 31

upvoted a paper 4 months ago

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Paper • 2403.04692 • Published Mar 7 • 40

upvoted 3 articles 4 months ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13

• 335

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22

• 221

Article

Introducing NPC-Playground, a 3D playground to interact with LLM-powered NPCs

Jun 5

• 17

Nishith Jain

AI & ML interests

Articles

How OpenGPT 4o works

HelpingAI 9B: Cutting Edge Emotionally Intelligent AI

Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI.

Organizations

KingNish's activity

In-browser LLM app in pure Python: Gemini Nano + Gradio-Lite

Assisted Generation: a new direction toward low-latency text generation

Llama can now see and run on your device - welcome Llama 3.2

RAG chatbot using llama3

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Introducing Community Tools on HuggingChat

"Diffusers Image Fill" guide

quanto: a pytorch quantization toolkit

Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models

Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap

Student Ambassador Program's call for applications is open!

Announcing the Hugging Face Fellowship Program

Welcome FalconMamba: The first strong attention-free 7B model

XetHub is joining Hugging Face!

Memory-efficient Diffusion Transformers with Quanto and Diffusers

Our Transformers Code Agent beats the GAIA benchmark!

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Train a Llama model from scratch

Introducing HelpingAI-Flash: Emotionally Intelligent Conversational AI for All Devices

Introducing HelpingAI-15B: Emotionally Intelligent Conversational AI

🧨 Diffusers welcomes Stable Diffusion 3

Thoughts on LoRA Training #1

Uncensor any LLM with abliteration

Fine-tune Llama 3 with ORPO

Introducing NPC-Playground, a 3D playground to interact with LLM-powered NPCs