[mythtv-commits] Ticket #13514: ttvdb does not choose the best show when searching

MythTV noreply at mythtv.org
Wed Nov 20 15:49:18 UTC 2019


#13514: ttvdb does not choose the best show when searching
----------------------------------+------------------------------
 Reporter:  mspieth               |          Owner:  mspieth
     Type:  Bug Report - General  |         Status:  assigned
 Priority:  minor                 |      Milestone:  needs_triage
Component:  MythTV - General      |        Version:  Unspecified
 Severity:  medium                |     Resolution:
 Keywords:                        |  Ticket locked:  0
----------------------------------+------------------------------

Comment (by Gary Buhrmaster):

 Replying to [comment:2 mspieth]:
 > Is it better to use this or fuzzywuzzy?
 >
 > Opinions wanted.

 Well, since you asked....

 Do you have enough samples of the ttvdb (poor) choices to result in a
 meaningful comparison of the various algorithms to determine which is
 statistically better(*) for MythTV?  Those might include Jaccard
 Similarity, Cosine Similarity, Levenshtein Distance.  If you have enough
 samples, you could use something like the textdistance library (which
 provides for the choice of many algorithms) as the the test platform for
 obtaining your statistics.

 fuzzywuzzy is an implementation of Levenshtein distance, as I recall, as
 is python-Levenshtein, so in addition to the existing internal matching,
 they should be expected to all produce essentially equivalent results (and
 obviously the internal function is already in place, which has a different
 advantage).

 I suppose that requiring textdistance, with an optional user override of
 the algorithm, provides the most flexibility, but I have not researched
 how widely textdistance has been packaged in the supported distros, and if
 (for example) Levenshtein is always good, using the internal functions
 might be good enough.


 (*) The general problem of natural language searching is not yet a
 completely solved problem, of course, so "better" is as good as it gets.

-- 
Ticket URL: <https://code.mythtv.org/trac/ticket/13514#comment:3>
MythTV <http://www.mythtv.org>
MythTV Media Center


More information about the mythtv-commits mailing list