IMDb List - 100 Movie Limit
Opened this issue · 11 comments
Is this limited to the first 100 movies on an IMDb list? For example, I am searching this list, and it is matching 21 movies, and prompting to add 79 to Radarr. Is there a way to get around this limit?
I tend to agree. It seems it is only capable of grabbing the first 100 movies (so page1) of a list that has more items.
I hope you can fix this. The other scripts that you refer to in your readme can grab more, but always require input, while yours can be automated which I like a lot.
Thanks for this very helpful script!
Same when adding to a collection based on genre. Processes the first 100 only
This fixes the issue for everything, sorry I am new so just posting here for anyone.
In imdb_tools.py at line 22 it should be changed to the below (initiate title_ids, define a range, I have just set to 10 pages as it will except out after that, minor change to the request.get URL to include page number and then extend the empty title_ids instead). I have tested this up to about 480 items on a list. I was also able to verify it added 200+ to radarr without issue.
title_ids = []
for i in range(1,10):
try:
r = requests.get(imdb_url + '?page={}'.format(i), headers={'Accept-Language': library_language})
except requests.exceptions.MissingSchema:
return
tree = html.fromstring(r.content)
title_ids.extend(tree.xpath("//div[contains(@class, 'lister-item-image')]"
"//a/img//@data-tconst"))
if title_ids:
Hi @spekta-23, I tried your fix but I keep getting Syntax errors.
Traceback (most recent call last):
File "plex_auto_collections.py", line 8, in
import image_server
File "/home/user/Plex-Auto-Collections-master/image_server.py", line 4, in
from config_tools import ImageServer
File "/home/user/Plex-Auto-Collections-master/config_tools.py", line 12, in
from plex_tools import get_actor_rkey
File "/home/user/Plex-Auto-Collections-master/plex_tools.py", line 6, in
import imdb_tools
File "/home/user/Plex-Auto-Collections-master/imdb_tools.py", line 22
title_ids = []
^
SyntaxError: invalid syntax
Hi @spekta-23, I tried your fix but I keep getting Syntax errors.
Traceback (most recent call last):
File "plex_auto_collections.py", line 8, in
import image_server
File "/home/user/Plex-Auto-Collections-master/image_server.py", line 4, in
from config_tools import ImageServer
File "/home/user/Plex-Auto-Collections-master/config_tools.py", line 12, in
from plex_tools import get_actor_rkey
File "/home/user/Plex-Auto-Collections-master/plex_tools.py", line 6, in
import imdb_tools
File "/home/user/Plex-Auto-Collections-master/imdb_tools.py", line 22
title_ids = []
^
SyntaxError: invalid syntax
Getting the same thing. Did you find a fix for this?
This fixes the issue for everything, sorry I am new so just posting here for anyone.
In imdb_tools.py at line 22 it should be changed to the below (initiate title_ids, define a range, I have just set to 10 pages as it will except out after that, minor change to the request.get URL to include page number and then extend the empty title_ids instead). I have tested this up to about 480 items on a list. I was also able to verify it added 200+ to radarr without issue.
title_ids = [] for i in range(1,10): try: r = requests.get(imdb_url + '?page={}'.format(i), headers={'Accept-Language': library_language}) except requests.exceptions.MissingSchema: return tree = html.fromstring(r.content) title_ids.extend(tree.xpath("//div[contains(@class, 'lister-item-image')]" "//a/img//@data-tconst")) if title_ids:
Can you post you entire imdb_tools.py?
import re
import requests
from lxml import html
from tmdbv3api import TMDb
from tmdbv3api import Movie
from tmdbv3api import Collection
from tmdbv3api import Person
import config_tools
def imdb_get_movies(config_path, plex, data):
tmdb = TMDb()
movie = Movie()
tmdb.api_key = config_tools.TMDB(config_path).apikey
imdb_url = data
if imdb_url[-1:] == " ":
imdb_url = imdb_url[:-1]
imdb_map = {}
library_language = plex.Library.language
title_ids = []
for i in range(1,10):
try:
r = requests.get(imdb_url + '?page={}'.format(i), headers={'Accept-Language': library_language})
except requests.exceptions.MissingSchema:
return
tree = html.fromstring(r.content)
title_ids.extend(tree.xpath("//div[contains(@class, 'lister-item-image')]"
"//a/img//@data-tconst"))
if title_ids:
for m in plex.Library.all():
if 'themoviedb://' in m.guid:
if not tmdb.api_key == "None":
tmdb_id = m.guid.split('themoviedb://')[1].split('?')[0]
tmdbapi = movie.details(tmdb_id)
imdb_id = tmdbapi.imdb_id
else:
imdb_id = None
elif 'imdb://' in m.guid:
imdb_id = m.guid.split('imdb://')[1].split('?')[0]
else:
imdb_id = None
if imdb_id and imdb_id in title_ids:
imdb_map[imdb_id] = m
else:
imdb_map[m.ratingKey] = m
matched_imbd_movies = []
missing_imdb_movies = []
for imdb_id in title_ids:
movie = imdb_map.pop(imdb_id, None)
if movie:
matched_imbd_movies.append(plex.Server.fetchItem(movie.ratingKey))
else:
missing_imdb_movies.append(imdb_id)
return matched_imbd_movies, missing_imdb_movies
def tmdb_get_movies(config_path, plex, data):
try:
tmdb_id = re.search('.*?(\d+)', data)
tmdb_id = tmdb_id.group(1)
except AttributeError: # Bad URL Provided
return
t_movie = Movie()
tmdb = Collection()
tmdb.api_key = config_tools.TMDB(config_path).apikey # Set TMDb api key for Collection
if tmdb.api_key == "None":
raise KeyError("Invalid TMDb API Key")
t_movie.api_key = tmdb.api_key # Copy same api key to Movie
t_col = tmdb.details(tmdb_id)
t_movs = []
for tmovie in t_col.parts:
t_movs.append(tmovie['id'])
# Create dictionary of movies and their guid
# GUIDs reference from which source Plex has pulled the metadata
p_m_map = {}
p_movies = plex.Library.all()
for m in p_movies:
guid = m.guid
if "themoviedb://" in guid:
guid = guid.split('themoviedb://')[1].split('?')[0]
elif "imdb://" in guid:
guid = guid.split('imdb://')[1].split('?')[0]
else:
guid = "None"
p_m_map[m] = guid
matched = []
missing = []
# We want to search for a match first to limit TMDb API calls
# Too many rapid calls can cause a momentary block
# If needed in future maybe add a delay after x calls to let the limit reset
for mid in t_movs: # For each TMBd ID in TMBd Collection
match = False
for m in p_m_map: # For each movie in Plex
if "tt" not in p_m_map[m] is not "None": # If the Plex movie's guid does not start with tt
if int(p_m_map[m]) == int(mid):
match = True
break
if not match:
imdb_id = t_movie.details(mid).entries['imdb_id']
for m in p_m_map:
if "tt" in p_m_map[m]:
if p_m_map[m] == imdb_id:
match = True
break
if match:
matched.append(m)
else:
missing.append(t_movie.details(mid).entries['imdb_id'])
return matched, missing
def tmdb_get_summary(config_path, data, type):
collection = Collection()
person = Person()
collection.api_key = config_tools.TMDB(config_path).apikey
person.api_key = collection.api_key
collection.language = config_tools.TMDB(config_path).language
person.language = collection.language
if type == "overview":
return collection.details(data).overview
elif type == "biography":
return person.details(data).biography
Let me know if that takes care of it for you, dont forget to use Python3.6 I had issues with that myself
Let me know if that takes care of it for you, dont forget to use Python3.6 I had issues with that myself
Would you be willing to upload your imdb_tools.py file? The formatting didn't translate over github.
Thank you!!!
Had a similar problem and managed to get @spekta-23 code to work.
Just a note though I do use MZA's fork, but I doubt this script is very different. I uploaded the imd_tools.py here
I know this is late but this issue has been fixed in the fork mza921/Plex-Auto-Collections as well as many other improvements and bug fixes