view article Article Releasing Common Corpus: the largest public domain dataset for training LLMs By Pclanglais • Mar 20 • 14
Common Corpus Collection The largest public domain dataset for training LLMs. • 27 items • Updated Jul 17 • 112