Thinking Allowed

medical / technology / education / art / flub

showing posts for 'similarity'

Using the right tools for the job

Since this blog has been up I've fiddled with some text analysis stuff by analysing the text and making recommendations for similar blog entries. Did it all in PHP and MySQL just to understand how the algorithms work. Eventually it started to take about 5 hours to: tokenise and stemming the textcalculate...