[mythtv-users] Log output of user jobs via MythLog class?

Sam Jacobs samlists at ijacobs.co.uk
Wed May 7 22:07:49 UTC 2014


On 7 May 2014 at 09:31:32, Karl Dietz (dekarl at spaetfruehstuecken.org) wrote:
> On 07.05.2014 10:10, Sam Jacobs wrote:
> > You’re right—it was the “Sitcom” at the beginning of the subtitle that was throwing  
> > me off. I wonder why myth decided to remove *that* from the description but not the 
> > rest of the detected subtitle (for me the detected subtitle always gets removed 
> > from the description). 
>  
> MythTV didn't decide anything, the broadcaster sends a short and a long
> text. 

That’s interesting, thanks. I thought that short text simply referred to the programme title. Probably long text isn’t used in the UK, then.


> Some broadcasters do send an episode title (or some nonsense for
> movies, not the tagline, but "Movie 2014 DE/AU" written differently
> every time) but a shorter description instead.
> One common thing with this broadcaster is to put a "genreword" *or* a
> "genre sentence related to the series" in front of the short description.
> Then start the long description with a generic series description
> followed by a long episode description.

Then it’s perhaps inappropriate for MythTV to use the short text for the subtitle field for data from these broadcasters. Based on what you’ve written, I would personally suggest removing the part of the beginning of the long text that matches the end of the short text, then concatenating the two strings and using them for the description. In the examples Nicolas has provided, there is clearly nothing in the short text or the long that could reasonably be the subtitle of the programme.

At the least, somebody should look at the distribution of word counts in the data[1][2] and implement a sensible word limit—automatically using a string of 22 words doesn’t seem sensible, to me.

Sam


[1] I’ve been using this SQL to look at the distribution of word counts in my program table, where video sources 1 and 3 are fed from EIT:

SELECT IF(LENGTH(subtitle) > 0, LENGTH(subtitle) - LENGTH(REPLACE(subtitle, ' ', ''))+1, 0) subtitle_len, count(*) FROM program WHERE chanid IN (select chanid from channel where (sourceid=1 or sourceid=3)) GROUP BY subtitle_len;

If *all* of one’s video sources are EIT fed, one can remove the WHERE clause:

SELECT IF(LENGTH(subtitle) > 0, LENGTH(subtitle) - LENGTH(REPLACE(subtitle, ' ', ''))+1, 0) subtitle_len, count(*) FROM program GROUP BY subtitle_len;


[2] My data, from a UK EIT fed program table:

subtitle	programme
length		count
--------	---------
0		55630
1		4198
2		7853
3		8718
4		4639
5		2721
6		1919
7		2090
8		1040
9		1358


--  
Sam Jacobs on MythTV 0.27, UK Freeview SD & Freesat HD, EIT-only EPG
Mac mini, MBE+FE: OS X Mavericks, bundled FE, macports BE
Acer Aspire L310, SBE: Debian testing, Elgato EyeTV for DTT Stick (Hauppauge Nova-T USB Stick in disguise!) & DVBSky S960 DVB-S2 USB

Sent with Airmail


More information about the mythtv-users mailing list