To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning Paper • 2409.12183 • Published 16 days ago • 35
Logic-of-Thought: Injecting Logic into Contexts for Full Reasoning in Large Language Models Paper • 2409.17539 • Published 9 days ago • 1
view article Article In-browser LLM app in pure Python: Gemini Nano + Gradio-Lite By whitphx • Jul 12 • 9
view article Article Assisted Generation: a new direction toward low-latency text generation May 11, 2023 • 26
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 11 items • Updated 9 days ago • 322
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models Jun 24 • 169
MagpieLM Collection Aligning LMs with Fully Open Recipe (data+training configs+logs) • 9 items • Updated 12 days ago • 13
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated 16 days ago • 201
YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models Paper • 2409.13592 • Published 14 days ago • 45
Qwen2-Audio Collection Audio-language model series based on Qwen2 • 4 items • Updated 16 days ago • 41
Qwen2-VL Collection Vision-language model series based on Qwen2 • 15 items • Updated 16 days ago • 129
Imagine yourself: Tuning-Free Personalized Image Generation Paper • 2409.13346 • Published 14 days ago • 65
Training Language Models to Self-Correct via Reinforcement Learning Paper • 2409.12917 • Published 15 days ago • 127
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution Paper • 2409.12191 • Published 16 days ago • 69
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 16 days ago • 220
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 14 items • Updated 9 days ago • 69
Qwen2.5-Math Collection Math-specific model series based on Qwen2.5 • 9 items • Updated 12 days ago • 34
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems Paper • 2402.12875 • Published Feb 20 • 12
Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning Paper • 2407.10718 • Published Jul 15 • 17
Video Generation models Collection The domain of video generation is booming. Here are the list of selected Open Access video generation (T2V) models. • 14 items • Updated Aug 27 • 12
view article Article Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models By isidentical • Aug 26 • 35
view article Article Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap Jun 19 • 11
Minitron Collection A family of compressed models obtained via pruning and knowledge distillation • 9 items • Updated 1 day ago • 54
Top LLM Collection Collection of TOP Open Source LLM, Sort by Best on top • 6 items • Updated Jul 26 • 11
view article Article Introducing HelpingAI-Flash: Emotionally Intelligent Conversational AI for All Devices By Abhaykoul • Jul 19 • 2
view article Article Introducing HelpingAI-15B: Emotionally Intelligent Conversational AI By Abhaykoul • Jul 12 • 3
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation Paper • 2403.04692 • Published Mar 7 • 40
view article Article Introducing NPC-Playground, a 3D playground to interact with LLM-powered NPCs Jun 5 • 17