Quarks & Bits
  • Home
Sign in Subscribe

Minhash

A collection of 1 post
Minhash

SHINGLING + MINHASH: BASIC NEAR DUPLICATE DOCUMENT DETECTION

Introduction Picture it…New York City 2014, two documents walk into a bar. We are given the task to determine if the documents are duplicates of each other or if they are just near duplicates. How would we do this if we weren't allowed to actually read the documents? How
Jul 4, 2014 5 min read
Page 1 of 1
Quarks & Bits © 2023
Powered by Ghost