It is currently March 28th, 2024, 3:14 pm



Post new topic Reply to topic  [ 12 posts ] 
Author Message
 Post subject: [imdb.com]
PostPosted: December 24th, 2019, 10:32 pm 

Joined: March 30th, 2015, 12:31 pm
Posts: 667
URL: https://www.imdb.com/search/title/?title_type=movie

Error message: Failed to find message

Problem: Failed to find message

i tried to add "p.text-muted" "div.subtext" but multi page leech giving error.

only i need to collect the movie name and would be nice if can grab the genre and imdb rating with votes.


Top
 Profile  
Reply with quote  
 Post subject: Re: [imdb.com]
PostPosted: December 28th, 2019, 8:08 am 
Site Admin
User avatar

Joined: March 10th, 2011, 11:14 pm
Posts: 12629
Location: Earth
Message selector should be taken from the post ("movie" in this case) page, for example: https://www.imdb.com/title/tt8623904

It really depends on what you want to leech. For the basic summary "div.plot_summary" would be fine.

Here are all selectors:

You just need to add custom selectors: http://leecher.themasoftware.com/faq.php?expand=faq206

Go to "WEBSITES" -> select that website -> "Selectors" tab -> add these selectors:

Subject:
Code:
h1


Message:
Code:
div.plot_summary


Subject with URL:
Code:
h3.lister-item-header

_________________
themaPoster | themaCreator | themaManager | themaLeecher | themaRegister


Top
 Profile  
Reply with quote  
 Post subject: Re: [imdb.com]
PostPosted: December 28th, 2019, 7:09 pm 

Joined: March 30th, 2015, 12:31 pm
Posts: 667
page number help. after 9950 IMDB giving different page number. if you click next after below link

Code:
https://www.imdb.com/search/title/?title_type=movie&start=9950&ref_=adv_nxt


this is for 10000 page list. will TL able to grab such page link for multipage fetch
Code:
https://www.imdb.com/search/title/?title_type=movie&after=WzE0NDcwLCJ0dDI3Mjg3ODQiLDEwMDAwXQ%3D%3D&ref_=adv_nxt


=====================
multipage has option "don't use subject with url" if select that TL add
movie name (Year)

but "Pages" option don't have such option. is there anyway to add year with title for "Pages" leech?


Top
 Profile  
Reply with quote  
 Post subject: Re: [imdb.com]
PostPosted: December 29th, 2019, 10:57 am 
Site Admin
User avatar

Joined: March 10th, 2011, 11:14 pm
Posts: 12629
Location: Earth
No, it won't be possible to leech those further pages with "Multi-Pages".

There is exactly the same option for "Pages" as well in "Websites" -> "Pages" -> "Other settings" tab (above the page "URL" field).

_________________
themaPoster | themaCreator | themaManager | themaLeecher | themaRegister


Top
 Profile  
Reply with quote  
 Post subject: Re: [imdb.com]
PostPosted: January 7th, 2020, 11:17 pm 

Joined: March 30th, 2015, 12:31 pm
Posts: 667
works all perfect and for increment should i use "50" or 51?


Top
 Profile  
Reply with quote  
 Post subject: Re: [imdb.com]
PostPosted: January 8th, 2020, 9:40 am 
Site Admin
User avatar

Joined: March 10th, 2011, 11:14 pm
Posts: 12629
Location: Earth
Use 50. They adjust the page results automatically. It will work better.

_________________
themaPoster | themaCreator | themaManager | themaLeecher | themaRegister


Top
 Profile  
Reply with quote  
 Post subject: Re: [imdb.com]
PostPosted: February 19th, 2022, 4:06 pm 

Joined: March 30th, 2015, 12:31 pm
Posts: 667
for recent imdb template changes, message selector is not working.

Code:
Failed to find message. Custom "Message" selector need to be added.


i tried this multiple selector but i could not able to add the proper selector. kindly help please.


Top
 Profile  
Reply with quote  
 Post subject: Re: [imdb.com]
PostPosted: February 19th, 2022, 6:34 pm 
Site Admin
User avatar

Joined: March 10th, 2011, 11:14 pm
Posts: 12629
Location: Earth
It really depends on what you want to leech, their information is all over the place.

For plot only this would work:
Code:
span.eqbKRZ

_________________
themaPoster | themaCreator | themaManager | themaLeecher | themaRegister


Top
 Profile  
Reply with quote  
 Post subject: Re: [imdb.com]
PostPosted: December 11th, 2023, 5:37 am 

Joined: March 30th, 2015, 12:31 pm
Posts: 667
they have changed their layout. can you please help me again

https://www.imdb.com/search/title/?title_type=feature

also they changed their next page options. it's like loading the next page with javascript or something. possible to do multipage leech?

i only need the imdb URL in message and title. so that I can get movie list.


Top
 Profile  
Reply with quote  
 Post subject: Re: [imdb.com]
PostPosted: December 11th, 2023, 9:09 am 
Site Admin
User avatar

Joined: March 10th, 2011, 11:14 pm
Posts: 12629
Location: Earth
Go to "WEBSITES" -> select that website -> "Selectors" tab -> add these selectors:

Subject:
Code:
h1


Message:
Code:
span[data-testid=plot-xl]


Subject with URL:
Code:
a.ipc-title-link-wrapper



Seems like there are no pages anymore and it's not possible to view all results at once.

Multi-Pages won't really work.

_________________
themaPoster | themaCreator | themaManager | themaLeecher | themaRegister


Top
 Profile  
Reply with quote  
 Post subject: Re: [imdb.com]
PostPosted: March 23rd, 2024, 6:58 am 

Joined: March 30th, 2015, 12:31 pm
Posts: 667
Freddy wrote:
Go to "WEBSITES" -> select that website -> "Selectors" tab -> add these selectors:

Subject:
Code:
h1


Message:
Code:
span[data-testid=plot-xl]


Subject with URL:
Code:
a.ipc-title-link-wrapper



Seems like there are no pages anymore and it's not possible to view all results at once.

Multi-Pages won't really work.



sorry to bother you again. following URL msg leech

Code:
https://www.imdb.com/search/title/?title_type=feature


this is the output we can get. is there anyway we can include the imdb URL in message. so that we can get use imdb URL leech feature of TL?
https://i.postimg.cc/CMfC1sMs/image.png


Top
 Profile  
Reply with quote  
 Post subject: Re: [imdb.com]
PostPosted: March 23rd, 2024, 9:22 pm 
Site Admin
User avatar

Joined: March 10th, 2011, 11:14 pm
Posts: 12629
Location: Earth
Yes, you can with message replacement.

Search for:
Code:
^


Replace with:
Code:
<postUrl>\n\n


Enable regex search.

Assign only for that site after adding.

_________________
themaPoster | themaCreator | themaManager | themaLeecher | themaRegister


Top
 Profile  
Reply with quote  
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 12 posts ] 

Who is online

Users browsing this forum: No registered users and 14 guests


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Jump to:  
cron
Theme designed by stylerbb.net © 2008
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group
All times are UTC