@scadsfct

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

, , , , , , und . Advances in Neural Information Processing Systems 35 (NeurIPS 2022), 35, Seite 31809-31826. Curran Associates, Inc., (Dezember 2022)

Links und Ressourcen

Tags