view post Post 510 Reply Unsocial Intelligence: an Investigation of the Assumptions of AGI DiscourseI don't agree with some of the assertions made here, but it is an interesting paper and a good overview. https://arxiv.org/abs/2401.13142
view post Post 1497 Reply What We Learned from a Year of Building with LLMsIt's a nice perspective outlined in here. “When a measure becomes a target, it ceases to be a good measure.”— Goodhart’s Lawhttps://www.oreilly.com/radar/what-we-learned-from-a-year-of-building-with-llms-part-i/
Performance LLMs - Base Models Qwen/Qwen1.5-0.5B Text Generation • Updated Apr 5 • 56.7k • 132 stabilityai/stablelm-2-1_6b Text Generation • Updated about 1 month ago • 69.3k • 173 openbmb/MiniCPM-2B-128k Text Generation • Updated May 24 • 1.03k • 33 stabilityai/stablelm-3b-4e1t Text Generation • Updated Mar 7 • 23.6k • 307
Performance LLMs - Fine tuned KnutJaegersberg/Qwen2-Deita-500m Text Generation • Updated 29 days ago • 435 • 3 KnutJaegersberg/Deita-2b Text Generation • Updated Mar 4 • 676 • 2 microsoft/Phi-3-mini-128k-instruct Text Generation • Updated 4 days ago • 2.07M • 1.44k NousResearch/Hermes-2-Pro-Mistral-7B Text Generation • Updated about 16 hours ago • 40.3k • 476