DmitryRyumin (Dmitry Ryumin)

upvoted a paper 3 days ago

MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages

Paper • 2410.01036 • Published 5 days ago • 12

upvoted a paper 8 days ago

HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors

Paper • 2408.06019 • Published Aug 12 • 13

upvoted a paper 10 days ago

Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Paper • 2409.18124 • Published 10 days ago • 23

upvoted a collection 11 days ago

Llama 3.2 All Versions

Collection

Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 20 items • Updated about 19 hours ago • 31

upvoted an article 11 days ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

19 days ago

• 146

upvoted 2 articles 12 days ago

Article

Exploring the Daily Papers Page on Hugging Face

14 days ago

• 25

Article

XetHub is joining Hugging Face!

Aug 8

• 78

upvoted a collection 18 days ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 18 days ago • 226

upvoted 5 papers about 1 month ago

The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design

Paper • 2408.12503 • Published Aug 22 • 21

Controllable Text Generation for Large Language Models: A Survey

Paper • 2408.12599 • Published Aug 22 • 61

upvoted a collection about 2 months ago

Jamba-1.5

Collection

The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models • 2 items • Updated Aug 22 • 76

upvoted an article about 2 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

By

•

Aug 19

• 72

upvoted a paper about 2 months ago

Transformer Language Models without Positional Encodings Still Learn Positional Information

Paper • 2203.16634 • Published Mar 30, 2022 • 5

upvoted a collection about 2 months ago

Qwen2-Audio

Collection

Audio-language model series based on Qwen2 • 4 items • Updated 19 days ago • 41

upvoted a paper about 2 months ago

Qwen2-Audio Technical Report

Paper • 2407.10759 • Published Jul 15 • 54

upvoted an article about 2 months ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12

• 98

upvoted a collection about 2 months ago

🦅 🐍 FalconMamba 7B

Collection

This collection features the FalconMamba 7B base model, the instruction-tuned version, their 4-bit and GGUF variants, and the demo. • 13 items • Updated 19 days ago • 25

upvoted a collection 3 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 11 days ago • 587

upvoted 2 papers 3 months ago

YouTube-SL-25: A Large-Scale, Open-Domain Multilingual Sign Language Parallel Corpus

Paper • 2407.11144 • Published Jul 15 • 7

LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

Paper • 2407.03168 • Published Jul 3 • 2

upvoted an article 4 months ago

Article

HelpingAI 9B: Cutting Edge Emotionally Intelligent AI

By

•

May 31

• 3

upvoted a paper 4 months ago

Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer

Paper • 2405.17405 • Published May 27 • 14

upvoted 2 papers 5 months ago

Reducing Transformer Key-Value Cache Size with Cross-Layer Attention

Paper • 2405.12981 • Published May 21 • 28

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19 • 150

upvoted a collection 5 months ago

Phi-3

Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 27 items • Updated 18 days ago • 474

upvoted a paper 5 months ago

SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization

Paper • 2405.11582 • Published May 19 • 12

upvoted a collection 5 months ago

PaliGemma Release

Collection

Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Jul 31 • 136

upvoted a paper 5 months ago

InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation

Paper • 2404.19427 • Published Apr 30 • 71

upvoted a collection 5 months ago

🤔 Facial Expressions Recognition

Collection

Embrace the future of Facial Expressions Recognition with the latest AI-powered technologies! 🚀 • 4 items • Updated Jun 11 • 6

upvoted an article 5 months ago

Article

Custom architectures with HuggingFace 🤗

By

•

Apr 22

• 21

upvoted a collection 6 months ago

Russian speaking 7B models

Collection

There is some my 7B models good speak and understand Russian language. Approved by some data-set my own tests. Will be link to github repo soon...🪬 • 7 items • Updated May 17 • 4

upvoted an article 6 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22

• 221

upvoted a collection 6 months ago

🤗 Big Five Personality Traits

Collection

The latest AI technologies usher in a new era of Big Five personality assessment 🚀 • 4 items • Updated May 1 • 2

upvoted an article 6 months ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15

• 161

upvoted a collection 6 months ago

🤖 LLM Spaces

Collection

A collection of applications demonstrating large language models (LLMs) 🚀 • 17 items • Updated May 30 • 6

upvoted 5 papers 6 months ago

PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations

Paper • 2404.04421 • Published Apr 5 • 16

GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image

Paper • 2404.02152 • Published Apr 2 • 3

Efficient 3D Implicit Head Avatar with Mesh-anchored Hash Table Blendshapes

Paper • 2404.01543 • Published Apr 2 • 3

Adversarial AutoMixup

Paper • 2312.11954 • Published Dec 19, 2023 • 2

ViTAR: Vision Transformer with Any Resolution

Paper • 2403.18361 • Published Mar 27 • 51

upvoted 2 papers 7 months ago

Audio-Visual Compound Expression Recognition Method based on Late Modality Fusion and Rule-based Decision

Paper • 2403.12687 • Published Mar 19 • 3

VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis

Paper • 2403.08764 • Published Mar 13 • 34

upvoted 3 collections 7 months ago

🎭 Avatars

Collection

The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 67 items • Updated 8 days ago • 73

🖼️ Image Enhancement

Collection

Embrace the future of Image Enhancement with the latest AI-powered technologies! 🚀 • 1 item • Updated May 1 • 5

🔊 Speech Enhancement

Collection

Unlocking a new era in Speech Enhancement, powered by the latest AI technologies, for superior audio quality improvements! 🚀 • 8 items • Updated May 1 • 9

upvoted a paper 7 months ago

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Paper • 2403.04692 • Published Mar 7 • 40

upvoted a collection 8 months ago

OpenCodeInterpreter

Collection

18 items • Updated Mar 3 • 82

upvoted 3 papers 8 months ago

Neural Network Diffusion

Paper • 2402.13144 • Published Feb 20 • 94

YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Paper • 2402.13616 • Published Feb 21 • 45

Vision-Based Hand Gesture Customization from a Single Demonstration

Paper • 2402.08420 • Published Feb 13 • 7

upvoted a collection 12 months ago

ICCV 2023 Demos

Collection

Demos for ICCV 2023 papers • 38 items • Updated Oct 5, 2023 • 10

upvoted a paper about 1 year ago

CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

Paper • 2309.09400 • Published Sep 17, 2023 • 82

Dmitry Ryumin

AI & ML interests

Organizations

DmitryRyumin's activity

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Exploring the Daily Papers Page on Hugging Face

XetHub is joining Hugging Face!

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

Welcome FalconMamba: The first strong attention-free 7B model

HelpingAI 9B: Cutting Edge Emotionally Intelligent AI

Custom architectures with HuggingFace 🤗

Fine-tune Llama 3 with ORPO

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community