EditDistanceMatcher - NodeJS script for doing edit distance 1 matching

05 February 2011   0 comments   Javascript


Powered by Fusion×

I needed a very basic spell correction string matcher in my current NodeJS project so I wrote a simple class called EditDistanceMatcher that compares a string against another string and matches if it's 1 edit distance away. With it you can do things like Google search's "Did you mean: poop?" when you search for pop.

Note, this code doesn't check popularity of correct words (e.g. "pop" might appear much more often than "poop" so it'll suggest "pop" if you enter "poup"). Anyway this simple snippet from the unit tests will reveal how it works:

     /* The match() method */
     var edm = new EditDistanceMatcher(["peter"]);
     // edm.match returns an array and remember,
     // in javascript ['peter'] == ['peter'] => false
     test.equal(edm.match("petter").length, 1);
     test.equal(edm.match("petter")[0], 'peter');
     test.equal(edm.match("junk").length, 0);

     /* the is_matched() method */
     var edm = new EditDistanceMatcher(["peter"]);
     test.equal(typeof edm.is_matched('petter'), 'boolean');
     test.equal(typeof edm.is_matched('junk'), 'boolean');

The most basic use case is if you have a quiz and you want to accept some spelling mistakes. "What's the capital of Sweden?; STOKHOLM; Correct!"

For the unlazy this NodeJS code can very easily be used in a browser by simply removing the exports stuff.



Note! I wrote this in an airport lounge so I'm sure it can be improved lots more.


Your email will never ever be published

Related posts

DoneCal on MumbaiMirror 03 February 2011
DoneCal homepage now able to do 10,000 requests/second 13 February 2011
Related by keywords:
To assert or assertEqual in Python unit testing 14 February 2009
How I stopped worrying about IO blocking Tornado 18 September 2012
RequireJS versus HeadJS 09 January 2011
"Did you mean this domain?" Auto-correction for the browser's address bar 05 April 2013
setuptools usability - not good, what can be done? 15 July 2009
Correction: running Django tests with MongoDB is NOT slow 30 May 2010
Mocking a Python standard library 14 March 2008
gorun.py - Using (py)inotify to run commands when files change 20 July 2009
Django vs. Java 25 October 2008
Slides about Kwissle from yesterdays London Python Dojo 08 July 2011
Local NodeJS development environment with Nginx 01 September 2010
Lots of shit about shit 28 February 2004