Extract Audio from VOB to DSS — Free Online Tool

Extract audio from DVD VOB files and convert it to DSS (Digital Speech Standard) format, transcoding the AC3/Dolby Digital or other DVD audio streams into the ADPCM IMA OKI codec used by Olympus and Philips dictation devices. This is a niche but precise conversion for workflows that require speech-optimized, low-bitrate DSS audio derived from DVD source material.

FFmpeg Command

Copy this command to run the same conversion locally with FFmpeg on your desktop. Download FFmpeg

Free — no uploads, no signups. Your files never leave your browser.

Estimated output:

Conversion Complete!

Download

How It Works

VOB files from DVD discs typically carry AC3 (Dolby Digital) audio multiplexed alongside MPEG-2 video, subtitles, and menu data. This tool strips the video stream entirely using the -vn flag and re-encodes only the audio into DSS format using the adpcm_ima_oki codec — an Adaptive Differential Pulse Code Modulation variant developed for low-bitrate digital dictation. Unlike a simple remux, this is a full audio transcode: the AC3 audio is decoded from its compressed Dolby format and re-encoded into the proprietary OKI ADPCM codec at a fixed low bitrate, heavily optimized for intelligibility of speech rather than fidelity to music or surround sound. The resulting DSS file is extremely compact and compatible with Olympus, Philips, and Grundig dictation hardware and software.

What Each Flag Does

Flag What it does
ffmpeg Invokes the FFmpeg command-line tool, which handles all demuxing, decoding, re-encoding, and muxing. In the browser version, this runs via FFmpeg.wasm compiled to WebAssembly.
-i input.vob Specifies the input DVD VOB file. FFmpeg will parse the MPEG-2 program stream container, identifying the multiplexed video, AC3 audio, and subtitle streams embedded in the VOB structure.
-vn Disables video output entirely, instructing FFmpeg to ignore the MPEG-2 video stream from the VOB. Since the target DSS format is audio-only, this ensures no video processing overhead occurs and no video data is passed to the output muxer.
-c:a adpcm_ima_oki Sets the audio encoder to adpcm_ima_oki, the ADPCM variant developed by OKI Semiconductor that underpins the DSS dictation format. This fully re-encodes the decoded DVD audio (typically from AC3) into the fixed-bitrate, mono, 8kHz speech-optimized codec required by the DSS container.
output.dss Defines the output filename with the .dss extension, which signals FFmpeg to use the DSS muxer. The resulting file is a Digital Speech Standard file compatible with Olympus and Philips dictation hardware and transcription software.

Common Use Cases

  • Extracting spoken commentary or narration from a DVD documentary to archive it as a DSS dictation file for transcription in digital voice recorder software
  • Converting DVD lecture recordings or conference presentations on VOB-formatted discs into DSS files for import into dictation transcription workflows like Olympus Dictation Management System
  • Archiving the audio track of a training DVD into a compact DSS file for playback on a Philips or Olympus digital dictation device without needing a DVD player
  • Extracting a single-language audio track from a multi-track VOB file to produce a compact speech-focused DSS file for legal or medical transcription purposes
  • Stripping the audio from a DVD-recorded deposition or court proceeding stored in VOB format for ingestion into a transcription service that accepts DSS files

Frequently Asked Questions

Yes, significant quality reduction is expected and by design. VOB files typically carry AC3 audio at 192–448 kbps, often in stereo or 5.1 surround. DSS uses the adpcm_ima_oki codec at a very low fixed bitrate, mono, and is specifically engineered for speech intelligibility rather than music or full-bandwidth audio. The conversion will sound acceptable for spoken word content but will noticeably degrade music, sound effects, and any surround channels, which will be downmixed to mono.
The DSS format using the adpcm_ima_oki codec operates at a fixed bitrate and sample rate — there are no adjustable quality parameters exposed by FFmpeg for this codec. Unlike MP3 or AAC where you can target higher bitrates, DSS is a rigid dictation standard with a fixed encoding profile. The output bitrate and quality are determined entirely by the codec specification, not by user-configurable flags.
The adpcm_ima_oki codec used by DSS supports only mono audio. FFmpeg will automatically downmix the multichannel AC3 surround audio to mono during the transcode. This means all surround channels — including the dedicated center, surround, and LFE channels — are blended together into a single mono track. For speech-heavy content like narration or dialogue, the center channel's dominance means the result is still intelligible.
Yes. VOB files from DVDs commonly contain multiple audio streams for different languages. You can select a specific track by adding -map 0:a:1 (for the second audio track, zero-indexed) to the FFmpeg command before the output filename, for example: ffmpeg -i input.vob -vn -map 0:a:1 -c:a adpcm_ima_oki output.dss. Use ffprobe on your VOB file to list all available audio streams and identify the index of the language track you want.
FFmpeg processes one input at a time, but you can batch convert using a shell loop. On Linux or macOS: for f in *.vob; do ffmpeg -i "$f" -vn -c:a adpcm_ima_oki "${f%.vob}.dss"; done. On Windows Command Prompt: for %f in (*.vob) do ffmpeg -i "%f" -vn -c:a adpcm_ima_oki "%~nf.dss". This is especially useful for processing a full set of VOB files ripped from a multi-chapter DVD disc.
No. DSS supports only a single mono audio stream and has no concept of chapters, subtitles, or multiple tracks. The -vn flag explicitly disables video, and the adpcm_ima_oki codec outputs a single audio stream. Subtitle streams embedded in the VOB are ignored entirely. DVD menu audio, if present on a separate stream, will also be excluded unless you explicitly map it with -map.

Technical Notes

VOB is a DVD-Video container that wraps MPEG-2 program stream data, and FFmpeg requires the -f vob flag in some contexts to correctly identify the container type, though auto-detection usually works for files with the .vob extension. The default audio codec in VOB is AC3 (Dolby Digital), though some DVDs use DTS, LPCM, or even MP2. Regardless of the source audio codec in the VOB, FFmpeg will fully decode it before re-encoding to adpcm_ima_oki — this is never a remux. The adpcm_ima_oki codec used in DSS was developed by OKI Semiconductor and is based on IMA ADPCM, modified for low-bitrate telephony and dictation applications. It operates at 8000 Hz sample rate, mono, at approximately 32 kbps — a dramatic reduction from a typical DVD AC3 stream. DSS files produced by FFmpeg should be compatible with Olympus DSS Player and similar transcription software, though some proprietary DSS variants (DSS Pro) may have additional header metadata requirements not handled by FFmpeg's implementation. There is no lossless path between VOB audio and DSS — both the source and output are lossy, so this represents a generation loss transcode.

Related Tools