[mythtv-commits] Ticket #13514: ttvdb does not choose the best show when searching
MythTV
noreply at mythtv.org
Wed Nov 20 15:49:18 UTC 2019
#13514: ttvdb does not choose the best show when searching
----------------------------------+------------------------------
Reporter: mspieth | Owner: mspieth
Type: Bug Report - General | Status: assigned
Priority: minor | Milestone: needs_triage
Component: MythTV - General | Version: Unspecified
Severity: medium | Resolution:
Keywords: | Ticket locked: 0
----------------------------------+------------------------------
Comment (by Gary Buhrmaster):
Replying to [comment:2 mspieth]:
> Is it better to use this or fuzzywuzzy?
>
> Opinions wanted.
Well, since you asked....
Do you have enough samples of the ttvdb (poor) choices to result in a
meaningful comparison of the various algorithms to determine which is
statistically better(*) for MythTV? Those might include Jaccard
Similarity, Cosine Similarity, Levenshtein Distance. If you have enough
samples, you could use something like the textdistance library (which
provides for the choice of many algorithms) as the the test platform for
obtaining your statistics.
fuzzywuzzy is an implementation of Levenshtein distance, as I recall, as
is python-Levenshtein, so in addition to the existing internal matching,
they should be expected to all produce essentially equivalent results (and
obviously the internal function is already in place, which has a different
advantage).
I suppose that requiring textdistance, with an optional user override of
the algorithm, provides the most flexibility, but I have not researched
how widely textdistance has been packaged in the supported distros, and if
(for example) Levenshtein is always good, using the internal functions
might be good enough.
(*) The general problem of natural language searching is not yet a
completely solved problem, of course, so "better" is as good as it gets.
--
Ticket URL: <https://code.mythtv.org/trac/ticket/13514#comment:3>
MythTV <http://www.mythtv.org>
MythTV Media Center
More information about the mythtv-commits
mailing list