view article Article EU Training Data Transparency: A Proposal for a Sufficiently Detailed Summary 📑📚🖼️🇪🇺 By yjernite • 6 days ago • 7
HuggingFace's Transformers: State-of-the-art Natural Language Processing Paper • 1910.03771 • Published Oct 9, 2019 • 16
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*⚡ By xhluca • about 3 hours ago • 25
view article Article Build Agentic Workflow using OpenAGI and HuggingFace models By lucifertrj • 13 days ago • 6
view article Article Financial Analysis with Langchain and CrewAI Agents By herooooooooo • 9 days ago • 4
view article Article Enhancing Image Model Dreambooth Training Through Effective Captioning: Key Observations By alvdansen • 20 days ago • 11
view article Article Introducing Synthetic Data Workshop: Your Gateway to Easy Synthetic Dataset Creation By davanstrien • 19 days ago • 11
view article Article Unveiling CIVICS: A New Dataset for Examining Cultural Values in Language Models By giadap • 20 days ago • 7
view article Article Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖 By m-ric • 19 days ago • 25
view article Article Low Latency CPU Based Educational Value Classifier With Generic Educational Value By kenhktsui • 26 days ago • 7
view article Article An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct By leonardlin • 28 days ago • 41
SimCLRv1 PyTorch Weights Collection Official PyTorch converted weights of SimCLRv1 • 4 items • Updated 27 days ago • 1
view article Article Introducing the Hugging Face Embedding Container for Amazon SageMaker Jun 7 • 11
view article Article Orquestrando Small Language Models (SLM) usando JavaScript e a API de Inferência do Hugging Face By rrg92 • Jun 4 • 1
view article Article Orchestrating Small Language Models (SLM) using JavaScript and the Hugging Face Inference API By rrg92 • Jun 4 • 1
view article Article FaceChain-FACT: Open-source 10-second portrait generation, reusing massive LoRa styles, a base-model-friendly portrait application. By haoyufirst • May 31 • 1
view article Article Formatting Datasets for Chat Template Compatibility By nroggendorff • 10 days ago • 6
view article Article Fine Tuning TinyLlama for Text Generation with TRL By nroggendorff • 10 days ago • 3
view article Article FiftyOne Computer Vision Datasets Come to the Hugging Face Hub By jamarks • Jun 3 • 11
view article Article Orchestration of Experts: The First-Principle Multi-Model System By alirezamsh • May 30 • 14
3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting Paper • 2405.18424 • Published May 28 • 7
VeLoRA: Memory Efficient Training using Rank-1 Sub-Token Projections Paper • 2405.17991 • Published May 28 • 9
LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models Paper • 2405.18377 • Published May 28 • 16
Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning Paper • 2405.18386 • Published May 28 • 17
view article Article ⚗️ 🔥 Building High-Quality Datasets with distilabel and Prometheus 2 By burtenshaw • Jun 3 • 21
view article Article Journey With Me Into The Mind of Large Language Models: Interesting Findings in AnthropicAI's Scaling Monosemanticity paper. By Jaward • May 22 • 2
view article Article Synthetic dataset generation techniques: generating custom sentence similarity data By davanstrien • May 23 • 13
MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data Paper • 2403.11207 • Published Mar 17 • 14
view article Article Enjoy the Power of Phi-3 with ONNX Runtime on your device By Emma-N • May 22 • 22