EditDistanceMatcher - NodeJS script for doing edit distance 1 matching

05 February 2011   0 comments   Javascript

https://gist.github.com/812443

Mind That Age!

This blog post is 7 years old! Most likely, its content is outdated. Especially if it's technical.

I needed a very basic spell correction string matcher in my current NodeJS project so I wrote a simple class called EditDistanceMatcher that compares a string against another string and matches if it's 1 edit distance away. With it you can do things like Google search's "Did you mean: poop?" when you search for pop.

Note, this code doesn't check popularity of correct words (e.g. "pop" might appear much more often than "poop" so it'll suggest "pop" if you enter "poup"). Anyway this simple snippet from the unit tests will reveal how it works:

     /* The match() method */
     var edm = new EditDistanceMatcher(["peter"]);
     // edm.match returns an array and remember,
     // in javascript ['peter'] == ['peter'] => false
     test.equal(edm.match("petter").length, 1);
     test.equal(edm.match("petter")[0], 'peter');
     test.equal(edm.match("junk").length, 0);

     /* the is_matched() method */
     var edm = new EditDistanceMatcher(["peter"]);
     test.equal(typeof edm.is_matched('petter'), 'boolean');
     test.equal(typeof edm.is_matched('junk'), 'boolean');
     test.ok(edm.is_matched("petter"));
     test.ok(!edm.is_matched("junk"));

The most basic use case is if you have a quiz and you want to accept some spelling mistakes. "What's the capital of Sweden?; STOKHOLM; Correct!"

For the unlazy this NodeJS code can very easily be used in a browser by simply removing the exports stuff.

edit_distance.js

tests/test_edit_distance.js

Note! I wrote this in an airport lounge so I'm sure it can be improved lots more.

Comments

Your email will never ever be published


Related posts

Previous:
DoneCal on MumbaiMirror 03 February 2011
Next:
DoneCal homepage now able to do 10,000 requests/second 13 February 2011
Related by Keyword:
How's My WiFi? 08 December 2017
Python slow-down of exception handling or condition checking 14 May 2015
"Did you mean this domain?" Auto-correction for the browser's address bar 05 April 2013
How I stopped worrying about IO blocking Tornado 18 September 2012
Slides about Kwissle from yesterdays London Python Dojo 08 July 2011
Related by Text:
Be very careful with your add_header in Nginx! You might make your site insecure 11 February 2018
jQuery and Highslide JS 08 January 2008
I'm back! Peterbe.com has been renewed 05 June 2005
Anti-McCain propaganda videos 12 August 2008
I'm Prolog 01 May 2007