[mythtv] imdb.pl

Tue Feb 20 01:20:43 UTC 2007

> -----Original Message-----
> From: mythtv-dev-bounces at mythtv.org [mailto:mythtv-dev-bounces at mythtv.org] On
> Behalf Of Anduin Withers
> Sent: Tuesday, 20 February 2007 12:01 PM
> To: 'Development of mythtv'
> Subject: Re: [mythtv] imdb.pl
> 
> > its worked for me for pretty much the last 6 months without any problems.
> >
> > you may find the search/parsing routines of use, specifically
> > search_imdb_online(), imdb_scalar_parser(),imdb_list_parser() and
> > get_imdb_movie_online() routines.
> 
> That code is going to need the same updates, it is searching relative to <b>
> tags, imdb.pl did something similar (though without a tag parser). Many of
> those <b> tags are now <h5>.

indeed they are. i was looking at something in my schedule for today and was
about to say you were wrong, but its the caching that meant what i was looking
at was correct and imdb must have changed their site in the last week or so.

> Most imdb.pl alternatives have their drawbacks. IMDBPy is huge, others have
> more exotic perl dependencies.

the imdb script i referred you to originally used the .txt.gz offline imdb
datafiles and parsed those also, but was really just far more complicated than
it needed to be.

you'll nose that the comments up the top of
http://svn.whuffy.com/index.cgi/browser/postprocessors/imdb_augment_data pretty
much say exactly what you just said: 

	# based roughly on a few existing IMDB XMLTV modules and IMDB CPAN
	# modules  but doesn't actually use them due to the large number of
	# interdependencies they drag in.
	#     much credit goes to Michael Stepanov for his excellent IMDB::Film
	#     module and the regex's used to match data from IMDb pages

cheers,

lincoln.