Extract Audio from MPEG to DSS — Free Online Tool

Extract audio from MPEG video files and convert it to DSS (Digital Speech Standard) format using the ADPCM IMA OKI codec — the native codec used by Olympus, Philips, and Grundig digital dictation devices. This tool is ideal for repurposing MPEG-encoded speech recordings into a format compatible with professional dictation workflows.

FFmpeg Command

Copy this command to run the same conversion locally with FFmpeg on your desktop. Download FFmpeg

Free — no uploads, no signups. Your files never leave your browser.

Estimated output:

Conversion Complete!

Download

How It Works

During this conversion, FFmpeg strips the video stream entirely from the MPEG file and re-encodes the audio — which is typically MP2, MP3, or AAC in an MPEG container — into ADPCM IMA OKI, the proprietary codec used by the Digital Speech Standard. DSS is a fixed, device-oriented format with no configurable bitrate or quality parameters; the codec operates at a low bitrate optimized specifically for speech intelligibility rather than music or high-fidelity audio. Because the source audio codec (MP2/AAC) and the target codec (ADPCM IMA OKI) are entirely different, a full decode-and-re-encode pass is required — this is not a simple remux.

What Each Flag Does

Flag What it does
ffmpeg Invokes the FFmpeg tool. In the browser-based version of this tool, FFmpeg runs locally via WebAssembly (FFmpeg.wasm) — no data leaves your device. When running locally on your desktop, this calls your system-installed FFmpeg binary.
-i input.mpeg Specifies the input MPEG file. FFmpeg will detect whether it contains MPEG-1 or MPEG-2 video and identify the audio codec (typically MP2, but possibly MP3 or AAC depending on the MPEG variant) before processing.
-vn Disables video output entirely, stripping the MPEG-1 or MPEG-2 video stream from the output. Since DSS is a speech-only audio format with no video capability, this flag ensures only the audio track is processed and written to the DSS file.
-c:a adpcm_ima_oki Instructs FFmpeg to encode the audio using the ADPCM IMA OKI codec, which is the native audio codec of the Digital Speech Standard format. This codec re-encodes the MPEG source audio (decoded from MP2 or AAC) into the low-bitrate, speech-optimized ADPCM format required by DSS-compatible dictation devices and software.
output.dss Defines the output file with the .dss extension, which tells FFmpeg to use the DSS muxer and produce a file compatible with Olympus, Philips, and Grundig dictation hardware and software. The container wraps the ADPCM IMA OKI audio stream in the proprietary DSS file structure.

Common Use Cases

  • Converting MPEG recordings of meetings, lectures, or interviews into DSS files for transcription using Olympus or Philips dictation software
  • Archiving broadcast MPEG news segments as DSS audio for import into professional transcription workflows used in journalism
  • Repurposing MPEG-recorded depositions or legal proceedings into DSS format compatible with legal transcription services
  • Extracting speech from DVD-compatible MPEG content to create dictation files for voice recognition pipelines that accept DSS input
  • Converting legacy MPEG audio from broadcast archives into DSS for ingestion into digital dictation management systems used in healthcare
  • Preparing MPEG-sourced spoken-word content for playback on Grundig Digta or Olympus DS-series handheld dictation recorders

Frequently Asked Questions

Yes, expect a significant reduction in audio fidelity. DSS using ADPCM IMA OKI is engineered specifically for speech at very low bitrates, discarding much of the frequency range that music or ambient sound relies on. If your MPEG file contains speech, intelligibility is generally preserved well; however, background music, sound effects, or high-quality stereo audio will sound noticeably degraded or muffled. DSS is not intended for high-fidelity audio reproduction.
DSS with ADPCM IMA OKI is a mono format optimized for single-channel speech recording. If your MPEG file contains stereo audio (common with MP2 or AAC tracks in broadcast MPEG), FFmpeg will downmix it to mono during the conversion. This is expected behavior for dictation-grade formats and is not a bug — DSS devices record and play back in mono by design.
No. The DSS format with ADPCM IMA OKI codec does not expose configurable bitrate or quality parameters in FFmpeg. The encoder operates at a fixed internal rate suited for speech, which is why no -b:a or -q:a flags appear in the command. Unlike converting MPEG audio to MP3 where you can choose 128k or 320k, DSS output quality is entirely determined by the codec's fixed specification.
The MPEG video stream — whether MPEG-1 or MPEG-2 encoded — is completely discarded. The -vn flag explicitly tells FFmpeg to ignore all video streams and produce an audio-only output. The resulting DSS file contains no video data whatsoever, which is appropriate since DSS is a pure audio dictation format with no video container capability.
You can adapt the displayed command into a shell loop for batch processing. On Linux or macOS, use: for f in *.mpeg; do ffmpeg -i "$f" -vn -c:a adpcm_ima_oki "${f%.mpeg}.dss"; done. On Windows Command Prompt, use: for %f in (*.mpeg) do ffmpeg -i "%f" -vn -c:a adpcm_ima_oki "%~nf.dss". This is particularly useful for processing large archives of MPEG recordings — the browser-based tool handles files up to 1GB individually, but the desktop FFmpeg command has no such limit.
Metadata preservation is very limited in this conversion. MPEG containers can carry basic metadata tags, but the DSS format's metadata support is minimal and device-specific, focused on dictation-related fields like author ID and work type rather than general media tags. FFmpeg will attempt to map any compatible metadata, but most standard tags from the MPEG file — such as title, artist, or comment — will likely be absent or silently dropped in the output DSS file.

Technical Notes

The DSS format is a proprietary standard jointly developed by Olympus, Philips, and Grundig, and its use of ADPCM IMA OKI (a variant of Adaptive Differential Pulse-Code Modulation) reflects its origins as a hardware-first dictation codec rather than a general-purpose audio format. FFmpeg's support for DSS writing is limited — not all builds include full DSS muxer support, so if the command fails, verify your FFmpeg version includes the DSS muxer and adpcm_ima_oki encoder by running ffmpeg -formats | grep dss and ffmpeg -codecs | grep adpcm_ima_oki. The source MPEG audio — typically MP2 at 192kbps in broadcast contexts — must go through a full decode-to-PCM and re-encode pipeline before being written as ADPCM IMA OKI, so this is computationally more involved than a simple stream copy. Sample rate compatibility is also worth noting: DSS typically operates at 8000 Hz (8kHz), so FFmpeg will resample the MPEG audio (often at 44.1kHz or 48kHz) down to 8kHz, which is the primary technical reason for the audible quality reduction. This resampling is automatic and necessary for the output file to be valid DSS.

Related Tools