Extract Audio from SWF to DSS — Free Online Tool

Extract audio from SWF Flash files and convert it to DSS format using the ADPCM IMA OKI codec — the same compressed speech format used by Olympus and Philips digital dictation devices. This tool strips the video stream entirely and re-encodes the audio track (typically MP3 or AAC from the SWF) into the low-bitrate, speech-optimized DSS container, all processed locally in your browser.

FFmpeg Command

Copy this command to run the same conversion locally with FFmpeg on your desktop. Download FFmpeg

Free — no uploads, no signups. Your files never leave your browser.

Estimated output:

Conversion Complete!

Download

How It Works

SWF files typically carry audio encoded as MP3 (libmp3lame) or AAC, embedded alongside Flash vector animation or video content. During this conversion, FFmpeg discards the entire video stream and re-encodes the audio into DSS format using the ADPCM IMA OKI codec — a low-bitrate adaptive delta pulse-code modulation algorithm optimized for speech intelligibility rather than music fidelity. Because ADPCM IMA OKI operates at a fixed, narrow sample rate and bit depth suited to dictation devices, the output will be significantly smaller than the source audio and optimized for voice content. This is a full transcode, not a remux — the audio is decoded from its SWF-embedded codec and re-encoded from scratch into the DSS container structure.

What Each Flag Does

Flag What it does
ffmpeg Invokes the FFmpeg binary — in this browser-based tool, this runs as a WebAssembly (FFmpeg.wasm) instance entirely client-side, with no server involvement.
-i input.swf Specifies the input SWF file. FFmpeg parses the SWF container and identifies the embedded streams, which typically include a video track (FLV1 or MJPEG) and an audio track (MP3 via libmp3lame or AAC).
-vn Disables video output entirely, telling FFmpeg to ignore the Flash video stream (FLV1 or MJPEG) in the SWF. Since DSS is a pure audio format with no video support, this flag is required to prevent an error.
-c:a adpcm_ima_oki Selects the ADPCM IMA OKI encoder for the audio stream — the codec required by the DSS format. This re-encodes the SWF's embedded MP3 or AAC audio from scratch into the low-bitrate, 8 kHz, mono format that DSS dictation devices expect.
output.dss Defines the output filename and triggers FFmpeg to use the DSS muxer based on the .dss extension, wrapping the ADPCM IMA OKI encoded audio into the Digital Speech Standard container structure.

Common Use Cases

  • Transferring narration or voiceover audio from a legacy Flash e-learning module into a digital dictation workflow for transcription on Olympus or Philips hardware
  • Extracting spoken commentary from an archived SWF presentation and converting it to DSS so it can be imported into dictation transcription software that only accepts DSS input
  • Recovering voice memo content embedded in an old SWF-based web application and converting it to a format compatible with DSS-capable transcription foot pedals
  • Archiving the audio track from Flash-based training simulations into a compact, speech-focused DSS file for long-term storage of spoken instructions
  • Preparing Flash animation dialogue or voice acting recordings for review on a DSS-compatible portable dictation player

Frequently Asked Questions

Yes, and this is expected given the nature of DSS. The ADPCM IMA OKI codec used in DSS is specifically engineered for speech intelligibility at very low bitrates, not for music or high-fidelity audio. If your SWF contains music, sound effects, or high-quality narration encoded as 128k MP3 or AAC, the output DSS file will sound noticeably compressed and narrow. DSS is a practical choice only when the source audio is spoken voice content and the destination is a dictation or transcription workflow.
The DSS format with the ADPCM IMA OKI codec does not expose configurable bitrate or quality parameters through FFmpeg — the codec operates at a fixed encoding configuration defined by the DSS specification. Unlike the SWF input side (where you could control audio bitrate at 64k–256k), the DSS output is encoded at a predetermined quality level. This is by design: DSS was created as a closed, hardware-targeted format for Olympus, Philips, and Grundig dictation devices with no user-adjustable fidelity options.
Yes. On Linux or macOS you can use a shell loop: `for f in *.swf; do ffmpeg -i "$f" -vn -c:a adpcm_ima_oki "${f%.swf}.dss"; done`. On Windows Command Prompt, use: `for %f in (*.swf) do ffmpeg -i "%f" -vn -c:a adpcm_ima_oki "%~nf.dss"`. Each SWF will have its video discarded and audio re-encoded to a separate DSS file. This is especially useful when archiving a collection of Flash e-learning modules or presentations.
FFmpeg will return an error and produce no output. SWF files are not required to contain audio — many Flash animations use only vector graphics and visual content. If you attempt the conversion on an audio-free SWF, FFmpeg will report that no audio stream was found after the `-vn` flag discards the video. You can check whether a SWF has audio beforehand by running `ffprobe input.swf` and looking for an audio stream in the output.
DSS is generally not recommended for archival purposes because it is a proprietary, lossy, speech-optimized format with limited software support outside of professional transcription ecosystems. For archival, a lossless format like FLAC or a broadly compatible lossy format like MP3 would be far more suitable. DSS makes sense specifically when the extracted audio needs to be imported into dictation hardware or transcription software (such as Philips SpeechExec or Olympus Sonority) that requires DSS as its native input format.
The ADPCM IMA OKI codec used in DSS operates at 8000 Hz (8 kHz), which is significantly lower than the 44100 Hz or 48000 Hz sample rates typical of MP3 or AAC audio embedded in SWF files. FFmpeg will automatically resample the SWF audio down to 8 kHz during the transcode. This downsampling is a major contributor to the audible quality reduction — frequencies above 4 kHz (including most consonant clarity in speech and all musical content) will be lost. The result is a telephone-quality audio file suited only for voice intelligibility.

Technical Notes

The DSS container is a proprietary binary format co-developed by Olympus, Philips, and Grundig and is not an open standard. FFmpeg's DSS muxer and ADPCM IMA OKI encoder provide write support, but the format's internal structure (including header metadata fields used by dictation devices, such as author ID, work type, and priority flags) will not be populated from the SWF source since SWF carries no equivalent metadata. The resulting DSS file will be technically valid but may lack device-specific metadata fields that dictation hardware expects. Additionally, the ADPCM IMA OKI codec is inherently mono — if the SWF audio contains stereo channels (common with music or mixed soundscapes), FFmpeg will downmix to mono during encoding, collapsing the stereo field entirely. There are no audio quality flags available for the DSS output side of this command, so the only lever for influencing output characteristics is the source audio quality going in. Files processed here stay entirely in your browser via FFmpeg.wasm; nothing is sent to a server.

Related Tools