Introduction
Picture it…New York City 2014, two documents walk into a bar. We are given the
task to determine if the documents are duplicates of each other or if they are
just near duplicates. How would we do this if we weren't allowed to actually
read the documents? How