AMC: Subs for multi-part movies
AMC: Subs for multi-part movies
This is more an observation for now: with a subtitles=xx parameter, FileBot is fetching subtitles for multi-part movies. These would need to be split and adjusted to actually work.
I've seen net/filebot/subtitle contains classes for transcoding subtitles. Do these work or is there any documentation?
I've seen net/filebot/subtitle contains classes for transcoding subtitles. Do these work or is there any documentation?
Re: AMC: Subs for multi-part movies
--def subtitles=en will only fetch subtitles by file hash, not by movie name, so the subtitles should match exactly that file.
@see viewtopic.php?f=3&t=2615
@see viewtopic.php?f=3&t=2615
Re: AMC: Subs for multi-part movies
Thanks, I will try to narrow down what's happening here. Maybe the uploaded subs have already been wrong.
Re: AMC: Subs for multi-part movies
Hi, below is an example of FileBot fetching the same single CD subtitle for both CD parts:
The downloaded subtitles are identical for both CD parts and seem to be either https://www.opensubtitles.org/en/subtit ... 83/blow-fr
or https://www.opensubtitles.org/en/subtit ... 00/blow-fr
From what I understand, the hash used to download is over the first and last 64K of the movie, so a hash collision seems rather unlikely. This would leave us with a wrong upload on OpenSubtitles, API problems or a bug in FileBot.
Is there any easy way to debug this / calculate the moviehashes / etc?
Code: Select all
Get [French] subtitles for 2 files
Looking up subtitles by hash via OpenSubtitles
Fetching [French] subtitles [Blow[2001]DVDrip[ENG]-MissRipZ.FR.srt] from [OpenSubtitles]
Export [Blow[2001]DVDrip[ENG]-MissRipZ.FR.srt] as [SubRip / UTF-8]
Writing [Blow[2001]DVDrip[ENG]-MissRipZ.FR.srt] to [Blow (2001).CD1.fra.srt]
Fetching [French] subtitles [Blow[2001]DVDrip[ENG]-MissRipZ.FR.srt] from [OpenSubtitles]
Export [Blow[2001]DVDrip[ENG]-MissRipZ.FR.srt] as [SubRip / UTF-8]
Writing [Blow[2001]DVDrip[ENG]-MissRipZ.FR.srt] to [Blow (2001).CD2.fra.srt]
or https://www.opensubtitles.org/en/subtit ... 00/blow-fr
From what I understand, the hash used to download is over the first and last 64K of the movie, so a hash collision seems rather unlikely. This would leave us with a wrong upload on OpenSubtitles, API problems or a bug in FileBot.
Is there any easy way to debug this / calculate the moviehashes / etc?
Re: AMC: Subs for multi-part movies
1.
AFAIK, BSPlayer uploads subtitles automatically without asking the user if they're good or bad, so bad upload is likely the case here.
2.
The osdb.explain script might shed some light on this issue:
AFAIK, BSPlayer uploads subtitles automatically without asking the user if they're good or bad, so bad upload is likely the case here.
2.
The osdb.explain script might shed some light on this issue:
Code: Select all
filebot -script fn:osdb.explain /path/to/movie --def fetch=y
Re: AMC: Subs for multi-part movies
The result is the same for both parts: and most interesting, it's a tag match (due to MovieHash=0 maybe??) of a single-part subtitle. I would expect these to be eliminated:
- In strict mode, due to not being a hash match
- In non-strict mode, due to not matching the part / total parts count and being significantly longer than the matched movie part (i.e. SubLastTS > (part playing time + 1minute)
Code: Select all
File: /volume1/_INPUT_/Blow (2001) [640x272 EN]/Blow (2001).CD1.avi
Hash/Tag Lookup (hash: 72f6d85ab3c052aa, size: 728033280, lang: fr_FR, tag: Blow (2001).CD1)
[...]
File: /volume1/_INPUT_/Blow (2001) [640x272 EN]/Blow (2001).CD2.avi
Hash/Tag Lookup (hash: f62de139e78c9c3d, size: 729657344, lang: fr_FR, tag: Blow (2001).CD2)
[...]
Best Hash Match: [IDSubtitle:3300000, IDSubtitleFile:1951888932, IDSubMovieFile:0, IDMovie:377,
MovieHash:0, MovieByteSize:0, MovieName:Blow, MovieYear:2001, MovieTimeMS:0, MovieFPS:25.000,
SubFileName:Blow[2001]DVDrip[ENG]-MissRipZ.FR.srt, SubLastTS:01:58:08, SubFormat:srt,
SubLanguageID:fre, ISO639:fr, SubActualCD:1, SubSumCD:1, MatchedBy:tag,
]
Re: AMC: Subs for multi-part movies
Interesting. Something may have changed over time here. "tag" used to refer to an exact match of the video/subtitle file names (excluding subtitle extension).
e.g.
@see http://trac.opensubtitles.org/projects/ ... hSubtitles
If this is not a bug, and API behaviour has indeed changed, than "tag lookup" will have to be removed. There doesn't seem to be a replacement for "exact filename lookup" as far as I can see.
e.g.
Code: Select all
Blow (2001).CD2.avi
Blow (2001).CD2.eng.srt <-- TAG MATCH
Blow[2001]DVDrip[ENG]-MissRipZ.FR.srt <-- NOT SUPPOSED TO BE A TAG MATCH
If this is not a bug, and API behaviour has indeed changed, than "tag lookup" will have to be removed. There doesn't seem to be a replacement for "exact filename lookup" as far as I can see.
Re: AMC: Subs for multi-part movies
"For perfect matches use moviehash/moviebytesize searching, for movie matches use tag/imdbid searching, if you can not use any of them, use fulltext search (least accurate)"
The way I understand is that only the hash/size provides a perfect match; tag/imdbid is the preferred way to search (as you get at least the right movie) and fulltext search the least accurate option.
The second best match on this query was even worse - I can't see **any** tag match here. Not even the movie is matching!
The way I understand is that only the hash/size provides a perfect match; tag/imdbid is the preferred way to search (as you get at least the right movie) and fulltext search the least accurate option.
The second best match on this query was even worse - I can't see **any** tag match here. Not even the movie is matching!
Code: Select all
Result 2: [IDSubtitle:3254295, IDSubtitleFile:1951831232, IDSubMovieFile:0, IDMovie:93990, IDMovieImdb:1055795, SubFileName:terminator.the.sarah.connor.chronicles.s01e07..proper.hdtv.xvid-notv.FR.srt, SubLastTS:00:40:53, SubFormat:srt, SubEncoding:CP1252, SubHash:ca2f0d7d0e0e403922547501d42fa2cd, SubSize:44567, MovieHash:0, MovieByteSize:0, MovieName:"Terminator: The Sarah Connor Chronicles" The Demon Hand, MovieNameEng:, MovieYear:2008, MovieReleaseName:S01E07 Hdtv NoTV Proper, MovieTimeMS:0, MovieFPS:23.980, MovieImdbRating:8.0, MovieKind:episode, SeriesSeason:1, SeriesEpisode:7, SeriesIMDBParent:851851, SubLanguageID:fre, ISO639:fr, LanguageName:French, UserID:120910, UserRank:trusted, UserNickName:ninjaw, SubAddDate:2008-02-28 18:48:13, SubAuthorComment:, SubFeatured:0, SubComments:0, SubDownloadsCnt:695, SubHearingImpaired:0, SubRating:5.0, SubHD:1, SubBad:0, SubActualCD:1, SubSumCD:1, MatchedBy:tag, QueryNumber:0, ...]
Re: AMC: Subs for multi-part movies
Alright then, tag lookup seems completely broken then, so it's been removed with the latest revision. strict match is now moviehash/filesize only.
Re: AMC: Subs for multi-part movies
What's the non-strict policy?
Re: AMC: Subs for multi-part movies
Lookup by Name kinda like a human would do it. Identify movie, search by name/id, check list of subtitles and pick the one that matches your files best.
@see viewtopic.php?f=3&t=2615
Note that the amc script will always force strict mode for subtitles regardless of the -non-strict option set by you. If -non-strict subtitle lookup is desired, then the suball script is recommended for that.
@see viewtopic.php?f=3&t=2615

Re: AMC: Subs for multi-part movies
To confirm: FB no longer fetches the wrong subtitles with the latest jar.
Re: AMC: Subs for multi-part movies
Thanks for checking and confirming. 
