Quarks & Bits
  • Home
Subscribe
Tagged

Locality Sensitive Hashing

A collection of 1 post

Minhash

SHINGLING + MINHASH: BASIC NEAR DUPLICATE DOCUMENT DETECTION

Introduction Picture it…New York City 2014, two documents walk into a bar. We are given the task to determine if the documents are duplicates of each other or if they are just near duplicates. How would we do this if we weren't allowed to actually read the documents? How

  • z0mb13
z0mb13 Jul 4, 2014 • 5 min read
Quarks & Bits © 2022
Powered by Ghost