Page 1 of 1

Crawl for imdb ID from subscene / folders?

Posted: 26 Sep 2015, 01:04
by denywinarto
Would it be possible using filebot / groovy script to crawl for imdb from subscene or from folders?

I'm limiting my library to the movies & tvshows that has subs from my country only..
My library already has movie.nfo for almost all movies
And i need some sort of method to connect my library with subscene..
And they only have IMDB links on each sub page unfortunately..

So it's probably like this :
1. Subscene has a page that can filter subs based on country alphabetically
2. Crawl that page for IMDB links
3. Crawl My library for IMDB links
4. Compare both of them and see which one is missing

Is it possible to do this?

Thanks before.

Re: Crawl for imdb ID from subscene / folders?

Posted: 26 Sep 2015, 08:50
by rednoah
1.
Search for *.nfo files with IMDb is easy.

2.
Scraping Subscene and downloading subtitles is tricky. I don't do scrapers, and it's probably not allowed according their ToS. But if they had an API for 3rd party programs then I could have a look.

Re: Crawl for imdb ID from subscene / folders?

Posted: 29 Sep 2015, 14:31
by kim

Re: Crawl for imdb ID from subscene / folders?

Posted: 29 Sep 2015, 14:36
by rednoah
Does that API support download of subtitles, not just search with links to pages of other sites?

Re: Crawl for imdb ID from subscene / folders?

Posted: 30 Sep 2015, 08:07
by denywinarto
rednoah wrote:1.
Search for *.nfo files with IMDb is easy.

2.
Scraping Subscene and downloading subtitles is tricky. I don't do scrapers, and it's probably not allowed according their ToS. But if they had an API for 3rd party programs then I could have a look.
I found a third party subscene API.. but it seems to be inactive, is it possible to use it?

https://github.com/andy-pham/subscene-api

1. Really it's possible ? Mind giving me a hint how to do it rednoah?

How can i crawl for IMDB links from this kind of page?

http://v2.subscene.com/subtitles/indone ... guage.aspx
Could it search for IMDB links within a subscene page?

Re: Crawl for imdb ID from subscene / folders?

Posted: 30 Sep 2015, 16:19
by rednoah
After 5 seconds link clicky-clicky... the "API" returns 503 Over Quota server error and the "Popcorn Time" has a DMCA takedown notice.

For future reference, by API I mean an officially endorsed API.

THREAD CLOSED.