Luca Soldaini
soldni
AI & ML interests
question answering, information retrieval, scientific document processing
Organizations
soldni's activity
What is the total # tokens after sampling proportion? 1.7T or 1.65T
2
#36 opened about 2 months ago
by
ivanzhouyq
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6313f5f4c093ff968e0ec6c8/LVTpwU-pXVDhnJcEbDAEx.jpeg)
v1_7 update
#28 opened 3 months ago
by
kylel
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1584459920518-noauth.jpeg)
Does allenai/c4 and the subset C4 in allenai/dolma is the same dataset?
4
#10 opened 4 months ago
by
speiqin
Can't download two files
1
#19 opened 6 months ago
by
mrgorjan
Prompting to OLMo
2
#8 opened 6 months ago
by
herambpatil2004
Update README.md
#10 opened 9 months ago
by
Muennighoff
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5f1eb362eec0ad2a071ad6e2/IXMYkYKuTwn6kBdWnQeeY.png)
Add download instructions
#8 opened 9 months ago
by
Muennighoff
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5f1eb362eec0ad2a071ad6e2/IXMYkYKuTwn6kBdWnQeeY.png)
Fix size
#9 opened 9 months ago
by
Muennighoff
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5f1eb362eec0ad2a071ad6e2/IXMYkYKuTwn6kBdWnQeeY.png)
sample for analysis?
1
#1 opened 11 months ago
by
KnutJaegersberg
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1669551186189-63732ebbbd81fae2b3aaf3fb.jpeg)
Semantic Scholar API metadata for this dataset?
2
#1 opened about 1 year ago
by
MicPie
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1614280013027-noauth.jpeg)
How to generate one token after the other with Scibert?
1
#4 opened about 1 year ago
by
junoriosity
Update README.md
#6 opened over 1 year ago
by
thefuzz
updated dataset_infos for version 0.3.0
#2 opened almost 2 years ago
by
soldni
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5f04d8c45d08220171a0ad32/o-9-ZITCjP5k1UAFzG7yz.png)
updated to dataset v 0.3 + added test split
2
#1 opened almost 2 years ago
by
soldni
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5f04d8c45d08220171a0ad32/o-9-ZITCjP5k1UAFzG7yz.png)
Update config.json
1
#3 opened almost 2 years ago
by
johngiorgi
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1660092421231-5ef3a3e6518622264685b0da.png)
Licensing
1
#1 opened almost 2 years ago
by
Jacobw