Sorensen–Dice coefficient: Difference between revisions

m
typo, formatting
m (fix links, formatting, typos, grammar)
m (typo, formatting)
Line 15:
di if ff fe er
 
Different implementationimplementations may do slightly different transforms. For our purposes, fold case so that all characters are the same case, split words, and ignore white-space, but keep punctuation.
 
The phrase "Don't Panic!" will be tokenized to the bi-grams:
Line 35:
;Task
* Use the list of Rosetta Code task and draft task names as your "dictionary" to search.
* Using that dictionary, search for the mangled task names: ''''Primordial primes'''', ''''Sunkist-Giuliani formula'''', ''''Sieve of Euripides'''', ''''Chowder numbers''''.
* Show the search term and the coefficient / match for the five closest, most similar matches.
 
10,333

edits