The Untold Story of NeXT...
Learning From The Past I really enjoy learning about the history of computing and scientific discoveries as the stories behind the work are often even more interesting than the innovations themselves. I think that it is important as technical professionals in industry and academia to understand how technology and the
THE CONFERENCE...FOR YOU KNOW SEARCH: ELASTICON 2015
It is very fitting that my first blog post in a long while is about Elasticsearch and some highlights of what I learned at ElasticOn 2015. I have been using Elasticsearch professionally to varying degrees for almost two years now (note, I’ve also used another Lucene backed product called
SHINGLING + MINHASH: BASIC NEAR DUPLICATE DOCUMENT DETECTION
Introduction Picture it…New York City 2014, two documents walk into a bar. We are given the task to determine if the documents are duplicates of each other or if they are just near duplicates. How would we do this if we weren't allowed to actually read the documents? How
A CURIOUS TALE: TWITTER STREAMING API + GOOGLE APP ENGINE
Introducing OpenLSH A few months ago I attended a Boston Data Mining meetup where a cool, fellow techie J Singh did a presentation on an algorithm called Locality Sensitive Hashing. During his presentation J expressed an interest in developing an open source library that implements LSH and few weeks later