Convert VOB to DSS — Free Online Tool

Convert VOB files from DVD-Video discs to DSS (Digital Speech Standard) format, extracting and re-encoding the audio track using ADPCM IMA OKI — the proprietary codec used in Olympus and Philips digital dictation devices. This is a highly specialized conversion that strips MPEG-2 video and AC-3 surround audio down to a mono, speech-optimized compressed audio stream.

FFmpeg Command

Copy this command to run the same conversion locally with FFmpeg on your desktop. Download FFmpeg

Free — no uploads, no signups. Your files never leave your browser.

Estimated output:

Conversion Complete!

Download

How It Works

VOB files contain multiplexed MPEG-2 video, AC-3 (Dolby Digital) or other audio streams, subtitle tracks, and DVD navigation data. During this conversion, the video stream, subtitle streams, and all additional audio tracks are discarded entirely. The primary audio track — typically AC-3 at 192–448 kbps — is decoded and then re-encoded using the ADPCM IMA OKI codec, a low-bitrate adaptive delta PCM algorithm designed specifically for speech intelligibility on dictation hardware. The resulting DSS file contains only this heavily compressed mono audio, optimized for voice recordings rather than music or cinematic audio.

What Each Flag Does

Flag What it does
ffmpeg Invokes the FFmpeg media processing tool. In the browser-based version of this tool, FFmpeg runs locally via WebAssembly (ffmpeg.wasm) — your VOB file is never uploaded to a server.
-i input.vob Specifies the input VOB file. FFmpeg will parse the VOB container and identify all internal streams — typically an MPEG-2 video stream, one or more AC-3 audio streams, and DVD subtitle (VobSub) streams — making them available for processing or discarding.
-c:a adpcm_ima_oki Sets the audio codec to ADPCM IMA OKI, the proprietary low-bitrate speech codec required by the DSS container format. This encoder decodes the source AC-3 audio from the VOB, downmixes it to mono, resamples as needed, and applies adaptive delta PCM compression optimized for voice intelligibility on dictation devices.
output.dss Defines the output filename and container format. The .dss extension tells FFmpeg to use the Digital Speech Standard container, which is audio-only — the MPEG-2 video stream, subtitle streams, and any secondary audio tracks from the VOB are automatically excluded from the output.

Common Use Cases

  • Archiving the spoken-word audio from a DVD documentary or lecture recording into a format compatible with Olympus DS-series or Philips DPM-series digital dictation recorders for field playback
  • Converting the audio commentary track from a DVD VOB file into DSS for transcription workflows that use dictation software expecting DSS input files
  • Extracting interview or dialogue audio from a DVD production master in VOB format and converting to DSS for import into transcription services that require the DSS container
  • Repurposing spoken narration from educational DVD content into DSS files for use on legacy dictation hardware used in medical or legal transcription offices
  • Stripping the audio from a VOB-based DVD training video and converting to DSS to feed into voice-recognition or dictation management systems that accept only DSS format

Frequently Asked Questions

Quality will be significantly reduced. AC-3 audio in VOB files typically runs at 192–448 kbps and supports full-range stereo or surround sound, while the ADPCM IMA OKI codec used in DSS is designed exclusively for speech at very low bitrates. Music, cinematic sound design, and surround audio will all degrade substantially. DSS is purpose-built for voice intelligibility — spoken dialogue from a DVD will remain understandable, but the format is a poor fit for anything other than speech content.
No. DSS is an audio-only format with no support for video, subtitles, chapters, or multiple audio tracks. All of these streams present in the VOB are silently dropped during conversion. Only the first (primary) audio stream from the VOB will be encoded into the DSS output. If your VOB contains multiple audio tracks — for example, different language dubs — only the default track will be captured unless you explicitly specify an alternate stream with '-map' in a custom FFmpeg command.
VOB files are large because they contain MPEG-2 video (which accounts for the majority of the file size), AC-3 audio, and DVD overhead data. The DSS output contains only the re-encoded audio using the extremely low-bitrate ADPCM IMA OKI codec. Even if your VOB is several gigabytes, the DSS output will typically be only a few megabytes, since the codec is optimized for compact speech storage on dictation devices with limited flash memory.
Yes, but you will need to modify the FFmpeg command manually. VOB files often contain multiple audio streams for different languages or commentary. Add a '-map 0:a:1' flag (changing the index number to select the desired track) before the output filename to select a specific audio stream. For example: 'ffmpeg -i input.vob -map 0:a:1 -c:a adpcm_ima_oki output.dss' would select the second audio track. You can identify available audio streams by running 'ffmpeg -i input.vob' and inspecting the stream list.
The single-file command shown here can be adapted for batch processing in a shell script. On Linux or macOS, use: 'for f in *.vob; do ffmpeg -i "$f" -c:a adpcm_ima_oki "${f%.vob}.dss"; done'. On Windows Command Prompt, use: 'for %f in (*.vob) do ffmpeg -i "%f" -c:a adpcm_ima_oki "%~nf.dss"'. This is especially useful when digitizing multiple DVD chapters, which are often split across several VOB files named VTS_01_1.vob, VTS_01_2.vob, etc.
Yes, if your DVD content spans multiple VOB files (which is common — a single movie may be split across VTS_01_1.vob through VTS_01_5.vob), you should concatenate them first or use FFmpeg's concat demuxer. A simple approach is: 'ffmpeg -i "concat:VTS_01_1.vob|VTS_01_2.vob|VTS_01_3.vob" -c:a adpcm_ima_oki output.dss'. This ensures the full audio is captured in sequence rather than converting only the first segment of the film or program.

Technical Notes

The ADPCM IMA OKI codec used in DSS is a proprietary variant of adaptive delta pulse-code modulation developed for the Olympus/Philips/Grundig dictation ecosystem. It operates at a fixed, very low bitrate optimized for mono speech, and FFmpeg's implementation does not expose bitrate or quality parameters for this codec — the command has no '-b:a' or '-q:a' option available, making quality adjustment impossible without switching codecs (which would break DSS compatibility). The source AC-3 audio from the VOB is first fully decoded to PCM, then the sample rate is resampled to match DSS requirements, and then ADPCM IMA OKI encoding is applied. Stereo or surround AC-3 tracks will be downmixed to mono in this process. There is no metadata preservation — VOB/DVD metadata (title, language tags, chapter names) has no mapping in the DSS container spec, so all such information is lost. DSS files produced by FFmpeg may have limited compatibility with some dictation hardware or proprietary DSS software suites that enforce strict conformance to the Olympus DSS Pro or DSS SP sub-specifications, as FFmpeg targets a general implementation of the format.

Related Tools