Extract Audio from RMVB to DSS — Free Online Tool

Extract audio from RMVB video files and convert it to DSS format using the ADPCM IMA OKI codec — a speech-optimized, low-bitrate format originally designed for Olympus and Philips digital dictation devices. This tool runs entirely in your browser via FFmpeg.wasm, so no files leave your machine.

FFmpeg Command

Copy this command to run the same conversion locally with FFmpeg on your desktop. Download FFmpeg

Free — no uploads, no signups. Your files never leave your browser.

Estimated output:

Conversion Complete!

Download

How It Works

RMVB (RealMedia Variable Bitrate) files typically carry AAC or MP3 audio streams embedded in RealNetworks' proprietary container. During this conversion, FFmpeg discards the video stream entirely and decodes the audio track, then re-encodes it using the ADPCM IMA OKI codec — a low-complexity adaptive PCM variant used natively in the DSS (Digital Speech Standard) format. Because ADPCM IMA OKI is highly specialized for narrow-band speech at very low bitrates and fixed sample rates, the audio is fully re-encoded rather than remuxed. The result is a compact DSS file suited for dictation playback devices and speech transcription workflows.

What Each Flag Does

Flag What it does
ffmpeg Invokes the FFmpeg tool. In this browser-based tool, this runs as FFmpeg.wasm compiled to WebAssembly — the same command works identically with a native FFmpeg installation on your desktop.
-i input.rmvb Specifies the input RMVB file. FFmpeg uses its RealMedia demuxer to read the variable-bitrate container and expose the embedded video and audio streams for processing.
-vn Disables video output entirely, discarding the libx264 video stream from the RMVB container. Since DSS is a pure audio format, no video can or should be written to the output.
-c:a adpcm_ima_oki Encodes the audio using the ADPCM IMA OKI codec, which is the native codec of the DSS Digital Speech Standard format. FFmpeg decodes the source AAC or MP3 audio from the RMVB and re-encodes it through this low-bitrate speech codec, automatically resampling to the required 8000 Hz mono format.
output.dss Sets the output filename with the .dss extension, which tells FFmpeg to use the DSS muxer to write the encoded ADPCM IMA OKI audio into a properly structured Digital Speech Standard file compatible with Olympus and Philips dictation devices and software.

Common Use Cases

  • Extracting spoken commentary or narration from an RMVB lecture recording to load onto an Olympus or Philips digital dictation device for playback
  • Converting RMVB-recorded interviews or verbal notes into DSS format for ingestion into professional transcription software that natively reads DSS files
  • Archiving the speech-only audio from an RMVB documentary or news clip in a compact, speech-optimized DSS file to save storage on older dictation hardware
  • Preparing RMVB-sourced voice content for legal or medical dictation workflows where DSS is the required format for compatibility with case management systems
  • Stripping the audio from an RMVB training video to produce a small DSS file for offline review on a portable digital voice recorder during commutes

Frequently Asked Questions

Yes, some quality loss should be expected. The ADPCM IMA OKI codec used in DSS files is engineered specifically for narrow-band speech intelligibility at very low bitrates, not for music or wide-frequency audio. If the RMVB source contains music, ambient sound, or high-frequency content, those elements will be heavily compressed or lost. For spoken voice, intelligibility is generally preserved, but the output will sound noticeably more compressed than the original AAC or MP3 stream.
DSS is a fixed-specification proprietary format tied to dictation hardware standards set by Olympus, Philips, and Grundig. The ADPCM IMA OKI codec operates at a fixed encoding structure with no user-configurable bitrate or quality parameters — unlike AAC or MP3, which offer flexible bitrate targeting. This means FFmpeg cannot expose quality options for the DSS output; the codec always encodes at its built-in specification regardless of the source audio's bitrate.
Yes. On Linux or macOS you can run a shell loop: for f in *.rmvb; do ffmpeg -i "$f" -vn -c:a adpcm_ima_oki "${f%.rmvb}.dss"; done. On Windows Command Prompt you can use: for %f in (*.rmvb) do ffmpeg -i "%f" -vn -c:a adpcm_ima_oki "%~nf.dss". This is especially practical for converting large batches of RMVB lecture or meeting recordings into DSS files for a dictation archive.
Generally no. The DSS format has very limited metadata support tied to its dictation-device origins — it stores fields like author and work type defined by the dictation standard, not the generic title, artist, or comment tags that might exist in an RMVB container. FFmpeg will not automatically map RealMedia metadata into DSS dictation fields, so the output file will typically contain no meaningful metadata from the source.
The -vn flag in the FFmpeg command explicitly discards the video stream. The libx264-encoded video inside the RMVB container is never decoded or written to disk — only the audio track is processed. This keeps the conversion fast and produces a pure audio DSS file with no video component, which is the correct output for the DSS format since it does not support video.
The DSS files produced here use the ADPCM IMA OKI codec, which is compatible with the DSS standard supported by Olympus digital voice recorders, Philips SpeechMike devices, and Grundig Digta recorders. Transcription software such as Olympus DSS Player, Philips SpeechExec, and many third-party transcription platforms also read this format. However, DSS compatibility is not universal — general media players like VLC may have partial or no support, so these files are best used within dedicated dictation or transcription workflows.

Technical Notes

RMVB files store audio in either AAC or MP3 streams inside RealNetworks' proprietary variable-bitrate container — a format once widely used for compressed video distribution in Asia and on early peer-to-peer networks. FFmpeg's RMVB demuxer handles the container reliably, but the format's variable bitrate structure and RealMedia-specific packetization can occasionally cause timestamp irregularities on poorly muxed files. The DSS output uses the ADPCM IMA OKI codec, which is a 4-bit ADPCM variant running at a fixed 8000 Hz sample rate and mono channel — parameters mandated by the DSS specification. If your RMVB audio track is stereo or at a higher sample rate (e.g., 44100 Hz or 48000 Hz), FFmpeg will automatically down-mix to mono and resample to 8000 Hz as part of the encoding pipeline. This resampling is the primary source of quality reduction. There are no audio bitrate flags applicable to the DSS output since the codec's parameters are fixed by the format specification. Files larger than 1GB can be processed using the displayed FFmpeg command locally on your desktop with a native FFmpeg installation, which will also be significantly faster than the browser-based WebAssembly implementation for large files.

Related Tools