DVD Rip Prep (drp) is a bash script that attempts to prepare an MKV file produced from a DVD by MakeMKV for upscaling. It might also work with rips made with other software, but I have only tested MakeMKV rips. It will probably not be of any use with other video files. It only accepts NTSC or PAL SD video and that video ideally should not have been edited or reencoded since being ripped from DVD. It is possible that it will work with other SD sources like digitized VHS or Laserdisc video, but I don't have any to try.
Before you read further, this is a hobby project that I made for my own needs. I am not a video professional and this is as much a learning project for me as anything. I don't really know what I am doing. The script is just built around ffmpeg and I am probably not even making full use of its capabilities. So, keep your expectations minimal and perhaps you will be pleasantly surprised.
There are other well established tools for doing this kind of work, but they weren't a good solution for me because they run on Windows and generally work best with rips that were not made by MakeMKV. I run Linux and I have thousands of MakeMKV rips and I really didn't want to start over. I also wanted a tool that was automatic as much as possible. There is no question that I could get better results doing everything manually, but I only have so much time to spend. My goal was to get a good result, not necessarily the best possible result, with the lowest possible amount of effort per video.
To understand why this exists, you need to know a few things about video, DVDs, and upscaling. Until recently, most movies and TV shows were shot on film, at a frame rate of 24 FPS (actually 23.976025 or 24000/1001). The rate at which TV screens refresh is tied to the cycle rate of the power that they use. In North America, this was 30 FPS (29.97003 or 30000/1001) and, in Europe, it was 25 FPS. The North American standard is known as NTSC and the European standard is PAL. There are other standards as well. Screens are now somewhat more flexible than in the days of tube TVs, but we still live in a world where the standards were set in the past. The problem here is that the rate at which the video was recorded and the rate at which is is displayed are different. Several methods are used to make 24 FPS video play at 30 FPS or 25 FPS. In North America, we used telecine to make 24 FPS content play like 30 FPS. In Europe, since 24 and 25 FPS are so close, they usually just played 24 FPS content at 25 FPS. The video and audio play about 4% faster, but the difference isn't very noticeable.
Telecine is the process of adding 20% more frames to video in order to make it play at 30 FPS instead of 24 FPS. This is done by inserting an extra frame after every set of four, but the extra frame is special. Telecine takes the fourth frame and splits it into two interlaced frames. The original video consisted of four progressive frames: PPPP. The telecined version becomes PPP(II). This accomplishes the job of getting the video to play at 30 FPS, but it introduces a motion irregularity, because those two interlaced frames are copies of the same image. This wasn't as noticeable on CRTs, but it is easier to spot on modern screens. We're so used to it that you might never notice, but there is a way that you can make it worse and more noticeable and that is by running a deinterlace on the telecined video. That takes the PPP(II) and turns it into PPP(PP). There are now two full, duplicate frames and your display won't be aware of it. The solution is to remove the telecine instead of deinterlacing. But, before you can do that, you need to know if your video is telecined.
There are also two forms of telecine: soft and hard. In soft telecine, the video is actually stored as progressive content on the DVD, but the player is instructed to create and insert the interlaced frames. If you know this, you can just extract the progressive frames and move ahead. In hard telecine, the video is stored in telecined format, so the interlaced frames actually exist on the disc and the player just reads them off like anything else. To remove hard telecine, you need to recombine those interlaced pairs, hopefully without altering anything else. This process is known as inverse telecine or, sometimes, detelecine.
Video can also be fully interlaced. While telecine uses interlacing, it only interlaces 25% of the original frames. Full interlacing is a signature of content that was either shot or edited on video. Filmed content is almost always telecined. Content that was recorded on a video camera is generally interlaced. True interlacing means that every frame of video is spread across two interlaced fields. Deinterlacers can deal with this in two ways: by combining the two fields into a single frame, which preserves the 30 FPS frame rate, or by extrapolating the missing lines in each field, resulting in a doubled frame rate of 60 FPS.
You will find these techniques used in different places. Almost all North American DVDs of movies are either soft or hard telecined and that's it. European DVDs of movies are usually just progressive content played at the slightly faster 25 FPS. You can just slow it down to 24 FPS and the job is done. TV is where it gets complicated. TV made before the 1980s was usually shot and edited on film and is either soft or hard telecined on disc. TV made after about 2005 seems most often to be soft telecined, but fully interlaced is also not unusual. It is those years in between that are a real problem. From around 1980 to 2005, video was often shot on one format and edited on another. The original bits and the edited bits were then spliced back together. Often, this will result in combinations of soft telecine and hard telecine or even full interlacing. This is really hard to deal with automatically. If you inverse telecine soft telecined frames, then you drop 20% of the frames. If you deinterlace telecined content, then you make the interlaced frames progressive and turn the original 24 FPS video into 30 FPS with a stutter. When the techniques are mixed in a video, if you apply a single method to reverse them, you will fix some of the video but break the rest.
When telecine is removed from filmed content, the frame rate is restored to 24 FPS. When fully interlaced content is deinterlaced, then the frame rate is either 30 or 60 FPS. This is no problem when only one of these techniques is used, but when they are combined, there is no way to get to uniformly progressive frames without either increasing the frame rate of the 24 FPS material or dropping frames from the 30/60 FPS material. After all, by removing telecine, we have removed the original solution to this rate mismatch. There is no single solution to this problem. In many cases, videos consist of something like 95% telecined content and 5% interlaced. In such cases, the interlacing tends to be in the titles and credits and you might never notice if you dropped 20% of the frames from those sections.
In other cases, the 30 FPS sections might be special effects and dropping frames would be really noticeable. In that case, the only solution is to increase the overall frame rate of the video to something that can accommodate both rates. The closest frame rate that is wholly divisible by 24, 30, and 60 is 120. So, a video that mixes 24 FPS filmed content and 30 FPS video content can be increased to 120 FPS by quintupling the 24 FPS frames and quadrupling the 30 FPS frames. This maintains the original cadence of all content. The script can now perform this 120FPS conversion (set FIXRATE to 2) and this will solve almost any DVD rip, as long as you are willing to accept the downsides of conversion to 120FPS, which are increased file size, increased upscaling time, and incompatibility with screens that don't have 120Hz refresh rates.
Writing this script has been quite an education and, if I had known what I know now in the beginning, I might have dropped the idea. The ways that these techniques are used on TV shows, even within a given season of a given show, is surprisingly inconsistent and, in some cases, looks outright crazy. There are, of course, a lot of shows that use just one technique or use a mix but at least in a predictable pattern. But there are also a surprisingly large number of shows that go from one technique to another from episode to episode and that mix them in ways that seem quite strange. Some use mixes in a way that almost seems intentionally designed to defeat any future efforts to disentagle them. I have been able to figure out solutions for a lot of shows, but there are some that I will probably never crack and can only approach with a "least bad" method.
Most people who are interested in this script are probably intending to upscale video with Topaz Video AI or other software, which is the reason why I wrote it. TVAI can deal with the easy cases itself, as long as you know how to tell it what to do. The hard part is knowing what to tell it to do. TVAI does not recognize soft telecined content, which you can see when you load the video. It will see the original frame rate as 29.97 and will output the same. So, you are better off re-encoding soft telecined videos before importing to TVAI. TVAI can perform inverse telecine for hard telecined video using the setting found under Input/Edit, but this will wreck anything that isn't actually hard telecined. And TVAI can deinterlace using the very same function that I use in the script, but the problem is that there are very few instances of movies or TV on DVD that are actually interlaced all the way through. And Topaz has nothing that will deal with the problem of mixed methods in the video.
If you tell TVAI to do the wrong things, your results generally won't be as good as they could be. You might not notice the problems introduced by misapplications of deinterlacing because TVAI output often looks so much better that it is easy to overlook the small problems that it can introduce. I had upscaled a lot of videos, which took a lot of time, before I started to notice stutters and artifacts related to improper deinterlacing. I then spent even more time fiddling around with TVAI settings to no great success because the real problem was that I was feeding it defective video.
So, the first function of drp is to look at MKVs and decide if they are soft telecined, hard telecined, interlaced, or some horrifying combination of these techniques (which is, unfortunately, common). Originally, this was my only goal for the script. I wanted something that would analyze the videos and give me a report so that I could then apply the appropriate techniques to recover fully progressive video for upscaling. As these things tend to go, I then wondered if I could make the script actually do all of the work and fix the video. In a lot of cases, now it can.
The most basic feature of the script is to diagnose what kind of frame structure a DVD rip has. You can use the script in dry run mode, which will just output a report in the same directory that you run it from that lists the diagnosis for each file. To use this feature, just change to a directory containing your MKV files and run drp -d.
The script will output two files: diagreport.txt and maplist.txt. The diagreport.txt file lists the diagnosis for each file. The maplist.txt file contains a list of files that use mixed methods and need further analysis. If AUTOMAP is enabled, then the script will continue to analyze and produce segment maps for each of these files.
If you are lucky, most of the content that you want to upscale will either use a single mode of frame rate manipulation or it will only make brief use of a secondary method that can be overlooked.
But, if your video was made in the 80s, 90s, or early 2000s, there is a good chance that it uses mixed methods. To fix such a video, the script needs to break it up into segments by frame rate and then work on each segment. The script uses a segment map file to do this work.
There are two ways to produce a segment map file. The manual method is to look at the video yourself, identify the places where there appears to be a change between original filmed content and edited content, and make your own map file. The automatic method uses another script, smap, to try to find the frame rate changes.
If you have a mixed content video, the script will now look for a segment map file, which is just a series of times in a text file. If you have a file named my-video.mkv, you will create a file named my-video.smap in the same directory. Scan through the video and look for places where titles, credits, or computer graphics were added. Get the start and end times for each such segment and add them to the times file in this format:
start,00:02:00
00:02:00,00:04:00
00:04:00,00:48:00
00:48:00,end
This is an example of a very common pattern in TV shows that have overlayed titles and credits but no CGI effects. This would look like a two minute intro, followed by two minutes of overlayed titles and major credits, followed by the 44 minute body of the episode, followed by the end credits. You have to account for the full run of the video - no gaps. Start and end are special times, since the script can get the first and last frame number automatically. You should try to get as close as possible to the time where the segment changes. If you are lucky, your videos will only have a few segments, but I have tested the script successfully with as many as thirty-three.
If you have identified the segments correctly, the script will apply the appropriate technique to each segment and output a rejoined video that still syncs up with the audio. If you got it wrong, then the audio will be out of sync. That is a sign that you either need to tighten up the segment times or look for an additional segment (maybe a special effects sequence) that you missed.
I'm sure that you are wondering why you would do manual mapping when the script can now (sometimes) do it automatically. Automatic mapping is really slow. It is entirely possible that your video is all one mode except for something like a 20 second preview or recap at the beginning of the video. If you have a whole season of a show that does this, you might be looking at a choice between letting the script grind away for 50 hours or just spending the time to look at each episode and get the transition times. In some cases, the time might even be the same in each episode.
It is also possible for the script to automatically generate segment maps. This now works pretty reliably on most of the rips that I have tested, but I don't expect that it will work for everything. If you have videos with relatively simple structures, like the example above in the Manual Mapping section, then it will probably be faster to make manual maps than to do it automatically. However, if your video has many small segments, the automatic mapper is likely to do the job faster.
Drp uses an accessory script, smap, to do the mapping. If you have AUTOMAP enabled, which is now the default, then the whole process takes place automatically. If AUTOMAP is disabled, then you need to run the smap script on each file yourself (smap file).
Smap divides the file into hundreds or thousands of short segments and analyzes each of those to determine where frame rate changes occur in the file. It then outputs this data as a segment map file. Once the segment map file has been created, you can run drp normally and it will use the map file.
Automatic mapping is a slow process that generates a lot of small files and can use a lot of disk space. I'm not sure at what point all these writes become a wear and tear problem for drives, but it seems like a good idea to avoid them if possible, so the smap script has an option to create and use a memory filesystem. This is controlled by the USERAMDISK setting at the top of the smap script. It is disabled by default, but if you have at least 16GB of RAM, you might want to enable it. If you have AUTOMAP enabled in drp and USERAMDISK enabled in smap, then you should run drp with sudo. Smap will set up and tear down the memory filesystem with every run and it needs root permissions to do that. Without using sudo, it will prompt for authentication on each run and will wait for you to come back and type your password.
The script is best at converting single method videos. It can do the following pretty reliably:
- Convert 30FPS soft telecined video to 24FPS progressive video
- Convert 30FPS hard telecined video to 24FPS progressive video
- Convert 30FPS interlaced video to 30FPS progressive video
- Convert 25FPS PAL (interlaced or progressive) to 24FPS progressive video
These conversions should give you a file with a consistent frame rate, and no introduced stutters, that can be processed in TVAI without resorting to an interlaced method. It is worth noting that the aim of this script is not perfection. Except for soft telecined and progressive content, it will probably never produce a result that does not contain a single interlaced frame. They are going to continue to lurk in the transitions and in sequences of fast motion where ffmpeg cannot find a good match, but the occasional interlaced frame is usually going to go unnoticed in casual viewing and it is not worth the extra effort that would be required to eliminate them.
By default, the script outputs files in the FFV1 format, which is lossless. That means that the file sizes can be pretty large, usually at least five times the size of the MKV you started with. Keep that in mind when batch processing. The scripts do not currently check available disk space, although I should probably add that.
Mixed method conversion is still a work in progress. It works a lot of the time, but it doesn't work at all for some combinations. This is a very slow process - the more segments, the longer it takes. Between mapping and converting the segments, it is not unusual for the script to take three or four hours per hour of video. The more segments, the longer it will take, because ffmpeg has to read from the beginning to the segment start every time.
The script now creates a temp directory named 00DRP that contains log files and subdirectories for the output of all the intermediate steps. If you find that a video didn't complete, check the log files and you are bound to discover that one of the segments needs adjustment.
By default, the final output of the script will be both a "rejoined" MKV that contains only the video and a "merged" MKV that add the source MKV file back in. You should use the rejoined file for your TVAI upscaling. The merged file is primarily there so that you can check it and make sure that the audio still syncs and you can safely delete it. You can also configure the script to output unjoined segments, so that you can upscale them separately and rejoin them yourself, if you prefer. This might be a good idea with video that includes computer generated special effect sequences.
As far as I can tell, any content that you find on a PAL format DVD that was originally shot on film is just sped up from 24 to 25 FPS. Drp will convert this by slowing the video back down to 24 FPS. It will also output an audio file that is slowed to the same rate so that you can ensure that the audio still syncs up. Assuming that the DVD contains multiple audio and subtitle tracks, you will need to slow those down to match the video. I recommend doing this with MKVtoolnix using the stretch option. Just enter 1.042708289 for the stretch value for each track. If you don't want PAL conversion, set CONVPAL to 0 in the drp script.
I plan to improve PAL conversion in the future so that all audio and subtitles are converted automatically. I just don't have a lot of PAL DVDs, so it hasn't been a high priority.
The script requires a system that runs a bash shell (Linux/Unix/Mac or Windows with the Linux subsystem) and ffmpeg, ffprobe, and mkvtoolnix must be installed. I have only tested it on Ubuntu Linux, so there might well be minor compatibility problems with other distros or systems. I will do my best to fix problems that crop up on other platforms, but my experience with them is limited. This could probably be ported to run natively on Windows without too much trouble, since the tools that it uses are all available in Windows native versions, but I haven't run Windows in 20 years, so I am probably not the one to do it.
The two scripts, drp and smap, are just bash scripts and can be installed anywhere that is convenient. You might just keep them in your home directory. If you want them to be in path, then the /usr/local/bin directory wouldn't be a bad place to put them.
You can run them through bash, like 'bash drp' or 'bash smap', but if you want to run them directly as commands, then you will need to make them executable:
chmod +x drp
chmod +x smap
Command | Description |
---|---|
drp | Batch mode - it will try to convert every MKV file in the current directory. |
drp -d | Dry run mode - it will try to diagnose every MKV file and then exit. |
drp -f value | Override drp FIXRATE |
drp -i file | Single file mode - it will diagnose and/or convert the named file only. |
drp -m value | Override smap MINSEG. |
drp -o directory | Override drp OUTDIR. |
drp -s value | Override smap SEGSIZE. |
smap file | Single file mode - it will create a segment map for the named file. |
There are settings at the top of each script file that you can change. They are explained in the comments.
As of Dec 7, the script now creates a unique temp directory with each run, so it is possible to run more than one process of the script concurrently. The only limits to the number of processes you can run simultaneously are processor power and memory (each process that uses the mapper will consume 8GB).
You can do everything in one pass, but I don't really trust the scripts enough to do that. The whole process takes so long that, if something goes wrong in the diagnostic phase, your script will grind away for hours for no reason. So, I usually use a diagnostic pass followed by a conversion pass. I recommend spending some time working with single files before trying out batch mode, since you will learn more by working through the individual steps.
Change to the directory that contains your MKV files.
cd myvideos/someTVshow/season1
Make sure that AUTOMAP is set to 1 in the drp settings.
Do a dry run in batch mode.
drp -d
Or, if USERAMDISK is enabled in smap:
sudo drp -d
Take a look at diagreport.txt and make sure that the diagnoses seem realistic.
Do a full run in batch mode.
drp
Change to the directory that contains your MKV files.
cd myvideos/someTVshow/season1
Do a single file dry run.
drp -d episodeSxxExx.mkv
If the file is diagnosed as "Mixed Methods," run smap on it.
smap episodeSxxExx.mkv
Run drp in conversion mode.
drp episodeSxxExx.mkv
The settings are found at the top of each of the script files.
The more invasive the method used by the script, the more likely that the runtime of the video will change. Usually, it ends up slightly shorter due to dropped frames, but occasionally it will also run slightly longer, so this setting really refers to runtime change, not just loss. In most cases, the difference is small enough that it does not matter, but a big enough change will result in audio that is out of sync. Videos that change by more than ALLOWLOSS will be renamed to indicate the time difference. Also, segments that change by more than this amount may be retried with an alternate method to see if it produces a better result. This is a percentage, so the default value allows a change of 2.5 seconds per hour of video. That might seem like a lot, but that much difference is only ever likely to result on short segments, not on the full runtime of the video.
Default is 0.07.
If enabled (1), runs the segment mapper in batch mode. If disabled (0), you will need to run smap on mixed files manually. Default is enabled. Disable if you prefer to run smap and drp separately.
If you feed the script PAL video, it will slow it from 25 to 24 FPS, hopefully restoring the original runtime and audio pitch. The script will output an audio MKV with the default track stretched 4% to match up with the slower frame rate. You will need to add this track to the final MKV after you upscale. If you have alternate audio or sub tracks that need to be matched to the slower frame rate, you can use the strech function in MKVToolnix with a value of 1.042708289.
Set to 1 to enable PAL conversion or 0 to disable it. Default is enabled (1).
What to do when a file contains more than one true frame rate. That is, after applying detelecine, do we now have a mix of 24FPS video and deinterlaced 30 FPS video? If so, the script can:
0 - Do nothing. If you set this to zero, then OUTPUTSEGS should be 1.
1 - Reduce the frame rate to 24 FPS by dropping frames from 30FPS segments.
2 - Increase all segments to 120 FPS by quintupling 24 and quadrupling 30.
FIXRATE can also be overridden with the command line switch -f. So, "-f 2", for example, will trigger FIXRATE 2 without editing the settings.
The default of 1 works well for most of my library, in which 30FPS sections tend to be short, but it does drop frames to get the lower frame rate, so it is destructive. In videos that have substantial 30 FPS segments, 2 would be a better approach. 2 is always the less destructive approach, but 120 FPS videos might not play on everything. FIXRATE 2 is also still new/experimental and does not always work well for reasons that I don't understand. FIXRATE 0 will just output the segment files at their native frame rates and you can decide what to do with them.
FIXRATE 2 has the advantage that it cuts out any analysis of mixed video, so the analysis is much faster although the upscaling will be slower. It is usually the best solution for a lot of shows that were edited on video and contain CGI sequences.
More advice for FIXRATE 2: You only need it for mixed videos. Single method videos will be output at their true frame rates already. You don't always need it for mixed rate videos. A lot of mixed videos only mix soft and hard telecine and will be output most appropriately at 24 FPS. FIXRATE 2 becomes useful when there is true interlaced content mixed with telecined content. Whenever this combination occurs, it is not a bad idea to at least try FIXRATE 2 for comparison. Whenever this combination occurs and you see stutters in the output, then this should fix it. The downsides to FIXRATE 2 are that the higher frame rate produces a slightly larger file (although not linearly, since the extra frames are duplicates) and that it takes longer to upscale.
Sets the percentage threshhold above which a video is considered to be hard telecined. The default setting is 2.5. Settings between 1 and 25 are probably reasonable and might give good results with some sources.
Sets the percentage threshhold above which a video is condidered to be fully interlaced. The default setting is 50. Settings between 40 and 80 are probably reasonable and might give good results with some sources.
Specify the location of the segment mapper script. The default is the smap script that comes with drp, but you could potentially modify your own mapper to meet different requirements. All that it needs to do is accept a file name as input and output a segment map file. Default is "$(dirname "$0")/smap", which is a file named smap in the same directory as drp.
By default, the script outputs a file containing only the video track, as this seems to work better with TVAI sometimes. If you enable merging, then drp will also output a copy that is merged with the original MKV, which allows you to compare the two video tracks and make sure that the audio syncs up. OUTPUTJOIN must be enabled (1) for this to work. Default is enabled (1).
This is the directory that you want the script to use for its output and working directories. There is no longer a default setting, due to the complexities of sometimes needing to run the script with sudo. Instead, if you have not set the directory, the script will prompt you and exit.
OUTDIR can also be overridden with the command line switch -o. So, "-o /home/johndoe", for example, will use the specified directory for output.
If enabled, the script will output a rejoined output file when the mixed process was used (the input file is divided into segments for processing and then those segments are rejoined into a single MKV file for output). This can be used in combination with OUTPUTSEGS. Default is enabled (1)
If enabled, the script will output the processed segments when the mixed process was used. You might want to output the segments if you would like to upscale them separately. This can be used in combination with OUTPUTJOIN. Default is disabled (0).
RATELOCK determines how far the frame rate can diverge from the nominal rate and still be treated by the script as having that nominal frame rate. The default is 0.005, which translates, for example, into a film rate range of ~23.86 to 24.1. This allows the script to ignore a maximum of about 9.5 seconds per hour of material that is not at the predominant frame rate. The original RATELOCK setting was 0.01 and that is still a reasonable setting if you are not too particular about dropping frames from 30FPS sections that probably only occur in titles or credits. 0.002 is about the lowest/most strict that you might want to try, but the result will be running a lot of things through the much slower mixed process when they don't really need it. You could reasonably go as high as 0.05 if your goal is to ignore 30FPS titles and credits that amount to a couple minutes of runtime.
STRICT overrides RATELOCK settings with a value of 0.0005, which will cause the script to use the mixed method even when there is a very small amount of material at a different frame rate. For example, it is not unusual for a soft telecined program to have a bumper that is hard telecined. We can save a lot of time by just treating the whole video like it is soft telecined because these few seconds of video at the beginning or end of the program are unlikely to suffer if some frames are dropped, because there is probably little to no motion. Once in a while, you might find that the script breaks something that you want to preserve, in which case you can use strict mode. STRICT mode allows less than 1 second per hour of material at a different frame rate and this amount is more likely to just be variance in the frame rate measurement than the actual presence of a segment with a different rate. Default is disabled (0). STRICT can be set with the command line switch -x.
Set to 1 to enable logging or 0 to disable it. Logs are written to 00DRP/LOGS. Default is enabled (1).
If enabled (1), then the memory filesystem will not be unmounted at the end of the run. This is helpful if you are troubleshooting and want to look at the working files. Default is disabled (0).
Segments shorter than MINSEG will be ignored. Very short segments may be false positives or inconsequential parts like black screen transitions that will lose nothing as a result of dropped frames. There is no right answer when it comes to this setting. 0.5 is the shortest practical setting and should catch just about everything, but I've found that 3 is a better default. You might think that the smallest possible value would give the best result, but the transitions between frame rates are not always clean, which means that very short segments are the most likely to be misdiagnosed. It almost always does less harm for transitions to get processed with the leading or following segment. I do occasionally run across content that needs a lower MINSEG though. If you find that you just can't get a good audio sync, try adjusting this to 2 or 1.
MINSEG can also be overridden with the drp command line switch -m. So, "-m 1", for example, will trigger MINSEG with a value of 1 second without editing the settings.
This is the directory that you want the script to use for its output and working directories. Default is your home directory (~).
How finely should the script slice the video for analysis? The default time of 0.1 effectively instructs ffmpeg to make the smallest possible segments. Usually, the smallest that ffmpeg will actually produce is about 1/3 second. Small values will catch brief, but real, transitions at the cost of significantly increased processing time. Whether those short segments exist in the content you are analyzing is uncertain and whether or not you care about potentially missing a few interlaced frames is a matter of preference. If your video has no short segments, you can increase the value for faster mapping.
SEGSIZE can also be overridden with the command line switch -s. So, "-s 1", for example, will trigger SEGSIZE with a value of 1 second without editing the settings.
If enabled (1), which is now the default, the script will create an 8GB tmpfs memory filesystem for the mapper to use. This will consume 8GB of RAM for the duration that the script runs. Because this script writes out thousands of files per analysis, using a memory filesystem seems like a good idea. Writing lots of small files over and over might accelerate disk wear. If you have less than 16GB of RAM, the RAMDISK will be automatically disabled, regardless of the setting.
If the script crashes or you break out of it, you will need to manually unmount the filesystem, which you can do with the following command:
sudo umount ~/00DRP/*/segments
(This command assumes that you have not changed OUTDIR.)
I wrote the script based on what I discovered when analyzing my collection of TV shows on DVD. I analyzed a few hundred movies as well, but movies tend to be very predictable and are almost never mixed. I figured that I might as well report what I found with each of these series, since it might help others and might also give you clues about where to start with shows that are similar.
This is a pretty easy one. Most episodes are mixed, but it is mostly just that the titles and credits that are hard telecined while the main feature is soft telecined. All seven seasons ran successfully with default settings: SEGSIZE 0.1, MINSEG 3. Season 7 was only available as a PAL set when I bought it, and that also worked well.
The first episode is hard telecined, but the rest are all mixed. Episodes vary in complexity, with some of the more special effects heavy episodes having more than thirty segments. It ran successfully with default settings: SEGSIZE 0.1, MINSEG 3, with the exception of episode 11 which required a MINSEG of 1.
This one is mostly mixed episodes, with a few hard telecined. Because most of them have interlaced content joined to hard telecine, which the script cannot distinguish, it is best to run these at FIXRATE 2 (120FPS) to preserve normal motion.
PAL format, non-interlaced. These ran just fine with the default settings and PAL conversion.
PAL DVD sets. These are all progressive, 25FPS. HTC% ranges from 0 to 0.15, INTL% ranges from 0.18 to 2.03. They process successfully with default settings.
Seasons 1-8 are hard telecined, except for episodes that contain the original promotional segments and bumpers, which are mixed, but don't really suffer from just being run as hard telecine. HTC% ranges from 8.01 to 20.36, INTL% ranges from 81.49 to 87.88. All ran successfully with SEGSIZE 0.1, MINSEG 3.
Seasons 1 and 2 are soft telecined and run successfully with default settings. HTC% ranges from 0 to 0.09 and INTL% ranges from 0 to 1.01.
All episodes are hard telecined and process successfully with default settings. HTC% ranges from 3.43 to 11.47 and INTL% ranges from 81.98 to 85.65.
Episodes 1-17 are soft telecined. Unfortunately, 18-26 are improperly deinterlaced hard telecine with the associated motion defects. The soft telecined episodes run just fine and the badly deinterlaced episodes are properly detected and improved with the baddtc method (which uses mpdecimate to eliminate duplicates), but don't come out quite as nicely as the soft telecined episodes, since you generally don't get perfection with duplicate detection. HTC% ranges from 0 to 0.01 and INTL% ranges from 0.02 to 1.95.
Season 1 is soft telecined, with HTC% ranging from 0 to 0.1 and an INTL% ranging from 0.01 to 0.07. Season 2 is hard telecined, with HTC% ranging from 28.03 to 32.58 and INTL% from 87.33 to 88.8. They all run successfully with default settings.
All four seasons are soft telecined, with HTC% ranging from 0 to 0.03 and INTL% from 0 to 0.76. They all run successfully with default settings.
Season 2 is soft telecined, with HTC% ranging from 0 to 0.1 and INTL% from 0.21 to 0.35. They run successfully with default settings.
All but episode 12 is hard telecined, with HTC% ranging from 8.95 to 20.30 and INTL% from 83.36 to 88.33. Episode 12 is fully interlaced (0.52/98.44). They all run successfully with default settings.
PAL DVD set. Progressive, 25FPS. HTC% 0 to 0.11, INTL% 0.06 to 0.27. Ran successfully with default settings.
PAL DVD set. Interlaced, 25FPS. HTC% 0.15 to 0.22, INTL% 65.76 to 94.39. Ran successfully with default settings.
All hard telecined with HTC% ranging from 7.21 to 22.26 and INTL% 82.35 to 87.95. Ran successfully with default settings.
Season 1 is fully interlaced with the exception of episode 18, which is hard telecined (11.41/82.15). HTC% ranges from 0.87 to 1.23 and INTL% ranges from 91.20 to 94.7. Ran successfully with default settings.
All hard telecined with HTC% ranging from 14.02 to 25.63 and INTL% from 83.16 to 88.16. Ran successfully with default settings.
All seasons either soft or hard telecined. Soft telecined seasons: 1, 2 (except ep 7), 13 (except ep 21) 15, 16, 17, 18, 19, 20. Hard telecined seasons: 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14 (except ep 12). HTC% ranges from 7.38 to 26.14 and INTL% from 82.13 to 89.75 for hard telecined episodes. HTC% ranges from 0 to 0.13 and INTL% from 0 to 0.69 for soft telecined episodes. All ran successfully with default settings. Some season 2 episodes have an interlaced CBS bumper at the end that slightly increases the calculated frame rate, but within the default settings for 24FPS.
Seasons 1-3 combine soft and hard telecined, mixed, and even a couple fully interlaced episodes. Seasons 4-6 are hard telecined.
Hard telecined all the way through. Ran successfully with default settings.
Episodes are mixed. While many can be processed at FIXRATE 1, most are at least a little better at FIXRATE 2, so that it what I recommend.
Episodes are mixed. Some can be processed at FIXRATE 1, but most require FIXRATE 2, so it is probably best to use that for all of them.
These episodes are mostly hard telecined with interlaced credits, but a few are mixed to a crazy degree, with hundreds of frame rate transitions. Episode 1&2 has 323 segments and took more than a day to process! The best and certainly easiest approch here is to just run them at FIXRATE 2 if you can accept 120FPS.
Hard telecined all the way through. Ran successfully with default settings.
Mostly hard telecined, with a few mixed. Ran successfully with default settings: SEGSIZE 0.1, MINSEG 3.