The largest public domain dataset for training LLMs.
PleIAs
company
AI & ML interests
Open Science LLMs
Organization Card
About org cards
PleIAs is a French startup training LLMs with an open science approach.
For more information, visit our website : https://pleias.fr/
Contact us : contact@pleias.fr
Collections
1
models
7
PleIAs/Estienne
Token Classification
•
Updated
•
28
PleIAs/French-TV-Headline-Classification
Text Classification
•
Updated
•
3
PleIAs/ocr-error-classifier
Updated
PleIAs/Pleias-Topic-Detection
Text2Text Generation
•
Updated
•
27
PleIAs/editorial-segmentation-common-corpus
Token Classification
•
Updated
•
8
PleIAs/Classifier-Common-Corpus
Text Classification
•
Updated
•
2
PleIAs/OCRonos
Text Generation
•
Updated
•
7
•
1
datasets
33
PleIAs/post-ocr
Updated
•
2
PleIAs/English-PD
Preview
•
Updated
•
4
PleIAs/Italian-PD
Preview
•
Updated
•
12
•
7
PleIAs/French-PD-diverse
Viewer
•
Updated
•
302k
•
1
PleIAs/Latin-PD
Viewer
•
Updated
•
95.8k
•
17
•
1
PleIAs/NewZealand-PD-Newspapers-XML
Preview
•
Updated
•
5
PleIAs/YouTube-Commons
Updated
•
116
•
287
PleIAs/Spanish-PD-Books
Preview
•
Updated
•
2
•
3
PleIAs/NewZealand-PD-Newspapers
Viewer
•
Updated
•
1.77M
•
1
PleIAs/German-PD
Viewer
•
Updated
•
9.23k
•
4
•
10