[mythtv-users] Immir tv_grab_au again!

Eyal Lebedinsky eyal at eyal.emu.id.au
Sun Jun 18 01:34:49 UTC 2006


first last wrote:
>>Code point \u0011 is not a valid character in XML at 
>>/usr/lib/perl5/vendor_perl/5.8.8/XMLTV.pm line 1757
> 
> It seems some of the javascript decoding routines that they are
> sending with the the obfuscated data are buggy - they decode the text
> to non-printable characters.  I modified get_content_base() to refetch
> the URL if the decoded page contained bogus characters, the next fetch
> usually works although occasionally two or three refetches are required.
> It works but it's not very elegant.

patch?

> The other problem:
> 
>>sub-title element: stripping trailing whitespace from value 'Part 1 '
>>for sub-title at /usr/share/perl5/XMLTV.pm line 1725, <> line 2.
>>bad data inside programme element, not writing
> 
> I fixed by "$title2 =~ s/\s+$//;" in get_closeup_details().  Not sure
> why the same code in XMLTV.pm doesn't seem to work.

This surely improved the situation, I now get all programs in good shape.

tv_grab_au 1.56: grabbing 7 days into /tmp/mythKBP9NF
Warning: parsed data incomplete for pid=14919466, url=http://tvguide.ninemsn.com.au/cu/default.asp?pid=14919466
td[0] = <12:00 am>
td[1] = <>
td[2] = <12:00 am>
td[3] = <>
td[4] = <( mins)>
td[5] = <Genre:>
td[6] = <>
td[7] = <>
td[8] = <>
td[9] = <>
td[10] = <Discuss this program>
td[11] = <( mins)>
td[12] = <Genre:>
td[13] = <>
td[14] = <>
td[15] = <>
td[16] = <>
td[17] = <Discuss this program>
body: q{HTML::Element=HASH(0xa2260c4)->as_text}
tv_grab_au: downloads = 7 guide pages, 1585 detail pages
1 failed detail pages, 0 stop times adjusted
tv_grab_au: finished in 690 seconds

-- 
Eyal Lebedinsky (eyal at eyal.emu.id.au) <http://samba.org/eyal/>
	attach .zip as .dat


More information about the mythtv-users mailing list