Artikel in einem Konferenzbericht,

CopyCat: Near-duplicates within and between the ClueWeb and the common crawl

, , , , , , und .
Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, New York, NY, USA, ACM, (Juli 2021)

Metadaten

Tags

    Nutzer

    • @scadsfct

    Kommentare und Rezensionen