Ai2 Dolma: 3T token open corpus for language model pretraining (2023)
1 points
by tosh
1 hour ago
| 0 comments
| allenai.org
| HN
No one has commented on this post.