Extract Audio from FLV to DSS — Free Online Tool

Extract audio from FLV video files and convert it to DSS format using the ADPCM IMA OKI codec — a proprietary compressed audio format designed for digital dictation devices by Olympus, Philips, and Grundig. This is especially useful when you need to feed Flash Video speech content into professional dictation workflows or transcription hardware that expects DSS input.

FFmpeg Command

Copy this command to run the same conversion locally with FFmpeg on your desktop. Download FFmpeg

Free — no uploads, no signups. Your files never leave your browser.

Estimated output:

Conversion Complete!

Download

How It Works

During this conversion, FFmpeg strips the video stream from the FLV container entirely and re-encodes the audio — whether it was originally AAC or MP3 — into the ADPCM IMA OKI codec used by the DSS format. ADPCM IMA OKI is a highly specialized adaptive delta pulse-code modulation variant optimized for low-bitrate speech capture, so the encoder will downsample and compress the audio aggressively. This is a full audio transcode, not a copy or remux: the original FLV audio codec is decoded to raw PCM first, then encoded fresh into DSS. Because DSS has no configurable quality parameters in FFmpeg, the encoder applies its fixed speech-optimized settings automatically.

What Each Flag Does

Flag What it does
ffmpeg Invokes the FFmpeg tool — in this browser-based context it runs as FFmpeg.wasm compiled to WebAssembly, executing entirely within your browser without sending your FLV file to any server.
-i input.flv Specifies the input FLV file. FFmpeg reads the Flash Video container and demuxes its streams — typically an H.264 or FLV video stream and an AAC or MP3 audio stream — for processing.
-vn Disables video output entirely. Since DSS is a pure audio format with no video support, this flag tells FFmpeg to ignore the FLV's video stream and produce an audio-only output.
-c:a adpcm_ima_oki Selects the ADPCM IMA OKI encoder for the audio stream — the specific codec required by the DSS format. This triggers a full transcode of the FLV's original audio (AAC or MP3) into the low-bitrate, speech-optimized ADPCM IMA OKI format, including automatic resampling to 8 kHz mono.
output.dss Defines the output filename with the .dss extension, which tells FFmpeg to wrap the encoded ADPCM IMA OKI audio in a DSS container — the proprietary Digital Speech Standard format used by Olympus, Philips, and Grundig dictation devices and software.

Common Use Cases

  • Converting a recorded Flash-based webinar or online meeting saved as FLV into DSS so it can be loaded into Olympus or Philips dictation management software for professional transcription.
  • Feeding archived FLV news broadcast audio or spoken-word interview content into a DSS-compatible transcription pedal workflow used in legal or medical settings.
  • Migrating legacy Flash video dictations or voice memos — common in older enterprise systems that recorded meetings in FLV — into DSS for long-term storage in dictation archives.
  • Extracting speech audio from an FLV tutorial or e-learning recording and converting it to DSS to test compatibility with hardware dictation players before committing to a batch workflow.
  • Stripping and converting the spoken commentary track from a Flash-era screencast into DSS format for review using a foot-pedal transcription workstation.

Frequently Asked Questions

Yes, expect a noticeable reduction in audio quality. FLV files typically carry AAC or MP3 audio at bitrates of 128k or higher with a wide frequency range. DSS using the ADPCM IMA OKI codec operates at a very low bitrate and is engineered specifically for speech intelligibility rather than music or full-range audio. The result will sound narrow and compressed — entirely appropriate for voice transcription but unsuitable for music or high-fidelity content.
No — DSS with the ADPCM IMA OKI codec in FFmpeg does not expose quality or bitrate parameters. The codec operates at a fixed, speech-optimized configuration and ignores flags like -b:a. This is by design: DSS is a dictation format with standardized technical constraints defined by its hardware manufacturers (Olympus, Philips, Grundig), not a general-purpose audio format with adjustable fidelity.
The ADPCM IMA OKI codec used in DSS requires a specific sample rate (typically 8000 Hz), which is far lower than the 44100 Hz or 48000 Hz audio commonly found in FLV files. FFmpeg will automatically resample the FLV audio down to the required rate during encoding. You do not need to add a -ar flag manually — FFmpeg resolves this constraint from the codec's requirements.
DSS is a proprietary dictation format primarily supported by Olympus, Philips, and Grundig hardware and their associated desktop software (e.g., Olympus DSS Player, Philips SpeechExec). VLC has partial DSS support and may play the file, but mainstream players like Windows Media Player generally do not recognize DSS. This format is purpose-built for professional transcription environments, not general media playback.
You can adapt the displayed command into a shell loop. On Linux or macOS, use: for f in *.flv; do ffmpeg -i "$f" -vn -c:a adpcm_ima_oki "${f%.flv}.dss"; done. On Windows Command Prompt, use: for %f in (*.flv) do ffmpeg -i "%f" -vn -c:a adpcm_ima_oki "%~nf.dss". This processes each FLV in the current directory and outputs a matching DSS file. The browser-based tool processes one file at a time, so the FFmpeg command is particularly valuable for bulk jobs.
DSS has extremely limited metadata support compared to modern container formats, and FFmpeg's DSS muxer does not map FLV metadata tags (such as title, artist, or creation date) into the DSS output. Any metadata embedded in the original FLV will be lost during this conversion. If metadata preservation is critical to your workflow, document the FLV file's metadata separately before converting.

Technical Notes

FLV containers most commonly carry AAC audio (when paired with H.264 video) or MP3 audio (when using the older FLV video codec), both of which are lossy codecs. Converting either to ADPCM IMA OKI for DSS output involves a full decode-and-re-encode pipeline with significant lossy generation loss — this is unavoidable given how different these codecs are in their design goals. ADPCM IMA OKI is an 8-bit adaptive DPCM variant originally developed for Oki Semiconductor voice chips; it is optimized for telephone-quality speech and imposes a fixed 8 kHz sample rate and mono channel output. FFmpeg will automatically convert stereo FLV audio to mono and resample to 8 kHz as part of the encoding process. The DSS container itself lacks support for chapters, multiple audio tracks, subtitles, or embedded cover art. Because DSS is a proprietary format, third-party tool support is limited — verify that your target transcription software or device supports DSS files produced by FFmpeg before processing large batches, as minor implementation differences in the DSS spec can affect compatibility.

Related Tools