Niklas Deckers, und Martin Potthast. WARC-DL: Scalable Web Archive Processing for Deep Learning. 2022. [PUMA: Archive Deep Learning Processing Scalable WARC-DL Web] URL