Niklas Deckers, and Martin Potthast. WARC-DL: Scalable Web Archive Processing for Deep Learning. 2022. [PUMA: Archive Deep Learning Processing Scalable WARC-DL Web Xack] URL