[mythtv-users] Incorrect characters in the EPG and program name

Karl Dietz dekarl at spaetfruehstuecken.org
Mon Jan 14 08:27:56 UTC 2013


Hi Steve,

replying to myself with an example of what we need to fix the guide
until the broadcaster fixes their SI generator. (While creating the
example I noticed that the broadcaster has fixed their guide, so there
is still hope :)

------------------------------------------------------------
SECT-Packet: 00000032   PID: 18 (0x0012), Length: 475 (0x01db)
Time received: Mon 2013-01-14  09:15:31.707
------------------------------------------------------------
   0000:  4e f1 d8 40 2a dd 00 01  22 03 21 14 01 4e d1 75
          N..@*...".!..N.u
   0010:  db f2 07 55 00 00 50 00  81 bd 4d 2f 44 45 55 15
          ...U..P...M/DEU.
   0020:  05 44 65 72 20 6d 61 73  6b 69 65 72 74 65 20 52
          .Der maskierte R

Notice the 05 that starts the last line, this is the first byte of
the string and signals Latin Alphabet 5 aka ISO 8859-9.


   0030:  e4 75 62 65 72 15 05 44  65 72 20 6d 61 73 6b 69
          .uber..Der maski
   0040:  65 72 74 65 20 52 e4 75  62 65 72 4e fe 01 44 45
          erte R.uberN..DE

Before the encoding was signaled I had to guess from e4 between 'R' and
'uber' that this must be an a-umlaut as thats how you spell Räuber.
Looking up the various candidate encodings that map a-umlaut to e4 I
could narrow the encoding down. By looking up more characters (and using
other hints, like the encoding of other channels on the same multiplex)
I could narrow it down to only one remaining candidate and added a fixup
for that channel.

Guess table from table id...
EIT-decoding....
Table_ID: 78 (0x4e)  [= Event Information Table (EIT) - actual 
Service_ID: xxxxx (0xxxxx)  [=  --> refers to PMT program_number]
Transport_stream_ID: 8707 (0x2203)
Original_network_ID: 8468 (0x2114)  [= German Digital Terrestrial

These three values are used to identify the service. With a DVB-C
provider that inserts its own guide its possible that keying off just
the Original_network_ID is enough.


     ISO639_2_language_code:  DEU
   event_name_length: 21 (0x15)
   event_name: "Der maskierte R?uber" -- Charset: Latin alphabet no. 5
   text_length: 21 (0x15)
   text_char: "Der maskierte R?uber"  -- Charset: Latin alphabet no. 5

Here it was saying just "Latin alphabet" before they fixed it.

Regards,
Karl


More information about the mythtv-users mailing list