[mythtv-users] Australian TV grabber (tv_grab_au)
Michael Smith
myth at immir.com
Fri Apr 15 00:31:15 UTC 2005
Hi all.
I'm stripping down and rewriting the Nine MSN tv_grab_au script (that I
got from this list, posted in Nov 2004) mostly from scratch following
the style/features of the other grab scripts which are already in the
myth distribution, with the obvious goal of getting this script
incorporated too.
As such, I'd like to hear any suggestions for features that people find
lacking in the current version.
What I've got so far:
(*) Configuration code that automatically probes the NineMSN site for
regions and services and queries the site for appropriate ids and
channel lists. (tv_grab_au --configure)
(*) A channels file which is an initial attempt at following the ideas
of RFC 2838, plus code that handles channels not found in the
configuration file (hard to know what to do with those). Resulting
channels are of the form abc2.abc.gov.au, sydney.nine.com.au etc, and
the mapping can be seen using tv_grab_au --list-channels.
<http://www.rfc-editor.org/rfc/rfc2838.txt>
(*) Simplified parsing code for the guide and details pages, using the
"industry-standard" HTML::TreeBuilder parsing code (in line with the
existing myth tv grabbers). This should make it easier to adapt to any
changes in HTML code on the listing site.
Here are the things I'm thinking of looking into now:
(*) People with Foxtel (or some other commercial TV service) presumably
want both the free channels and the Foxtel channels downloaded, right?
What are you doing now? Two versions of the script with different
hard-coded configurations (one for free, one for Foxtel)?? If anyone
cares about this, I'd propose incorporating a multiple source system
into the script from the ground up.
(*) Parsing ourguide.com.au for the channels that it lists. It doesn't
appear to have any commercial services listed, and it is missing the
digital-only channels (e.g., ABC2, SBS News). It does, however, have
enough information to rip titles, start/end times, and short
descriptions for many programs right from the guide page.
(*) Configuration parameters and/or a probabilistic/adaptive method of
finding programs whose descriptions don't change often enough to query
the details page every bloody time (e.g., news programs, morning
programs, etc).
Obviously some of these features are designed to lower the load on Nine
MSN (or whichever site we need to switch to down the track).
*NON-FUNCTIONING* experimental code may be perused for those interested
-- I'll have a working version shortly and post a message when that is done.
<http://immir.com/tv_grab_au> the grabber
<http://immir.com/channels> channels configuration file
<http://immir.com/tv_grab_au.conf> example config file
Any comments?
Cheers,
Michael.
More information about the mythtv-users
mailing list