Sorensen–Dice coefficient: Difference between revisions
Content added Content deleted
Thundergnat (talk | contribs) m (typo, formatting) |
Thundergnat (talk | contribs) m (yet more typos) |
||
Line 28: | Line 28: | ||
SDI = 2 × (A ∩ B) / (A ⊎ B) |
SDI = 2 × (A ∩ B) / (A ⊎ B) |
||
The Sørensen–Dice coefficient is a "percent similarity" between the two |
The Sørensen–Dice coefficient is a "percent similarity" between the two populations between 0.0 and 1.0. |
||
SDI ''can'' by used for spellchecking, but it's not really good at it, especially for short words. Where it really shines is for fuzzy matching of short phrases like book or movie titles. It may not return exactly what you are looking for, but often gets remarkably close with some pretty poor inputs. |
SDI ''can'' by used for spellchecking, but it's not really good at it, especially for short words. Where it really shines is for fuzzy matching of short phrases like book or movie titles. It may not return exactly what you are looking for, but often gets remarkably close with some pretty poor inputs. |