Extract Audio from WTV to DSS — Free Online Tool

Extract audio from Windows Media Center TV recordings (.wtv) and convert it to DSS format using the ADPCM IMA OKI codec — ideal for archiving spoken broadcast content to a format compatible with digital dictation workflows. This tool strips the video stream entirely and encodes only the audio, running entirely in your browser with no file uploads required.

FFmpeg Command

Copy this command to run the same conversion locally with FFmpeg on your desktop. Download FFmpeg

Free — no uploads, no signups. Your files never leave your browser.

Estimated output:

Conversion Complete!

Download

How It Works

WTV files recorded by Windows Vista Media Center typically contain H.264 video and AAC or MP3 audio captured from broadcast TV. This conversion discards the video stream entirely using the -vn flag, then re-encodes the audio track using the ADPCM IMA OKI codec into a DSS container. ADPCM IMA OKI is a low-bitrate adaptive delta PCM codec originally designed for Olympus digital dictation hardware, so the encoder applies fixed compression optimized for speech intelligibility rather than music or high-fidelity audio. The DSS format does not support configurable bitrate options or multiple audio tracks, so only the first audio stream from the WTV file is used, and no quality parameters are adjustable — the codec encodes at its fixed specification.

What Each Flag Does

Flag	What it does
`ffmpeg`	Invokes the FFmpeg application — this is the same open-source multimedia engine running under the hood in your browser via WebAssembly (FFmpeg.wasm). You can run this exact command locally on any desktop with FFmpeg installed to process files larger than 1GB.
`-i input.wtv`	Specifies the input Windows Television file recorded by Windows Media Center. FFmpeg reads the WTV container and demuxes its streams, which typically include an H.264 video track and an AAC or MP3 audio track from the broadcast recording.
`-vn`	Disables video output entirely, discarding the H.264 video stream from the WTV file. Since DSS is a pure audio format with no video support, this flag is essential — without it, FFmpeg would attempt to encode video and fail because the DSS container cannot hold a video stream.
`-c:a adpcm_ima_oki`	Specifies the ADPCM IMA OKI audio codec for encoding the output, which is the only codec supported by the DSS format. This codec re-encodes the broadcast audio (decoded from AAC or MP3) using the low-bitrate adaptive delta PCM algorithm designed for Olympus digital dictation hardware.
`output.dss`	Defines the output filename and triggers FFmpeg to use the Digital Speech Standard container format. The .dss extension tells FFmpeg to write a DSS file compatible with Olympus, Philips, and Grundig digital dictation devices and their associated transcription software.

Common Use Cases

Archiving the audio from a recorded news broadcast or talk radio show captured via Windows Media Center into DSS format for use with Olympus or Philips digital dictation transcription software.
Extracting spoken-word content from a recorded TV documentary or interview program into a compact DSS file for playback on legacy digital voice recorders that only read DSS.
Converting Windows Media Center DVR recordings of court proceedings, lectures, or hearings that were broadcast on public access TV into DSS files for legal transcription workflows.
Stripping audio from old WTV recordings stored on a Windows Vista or Windows 7 Media Center PC to create lightweight DSS files without requiring the original broadcast equipment.
Preparing extracted broadcast speech audio in DSS format for import into dictation management software such as Philips SpeechExec or Olympus Dictation Management System.

Frequently Asked Questions

Expect a significant reduction in audio fidelity. WTV files typically store audio as AAC or MP3 at broadcast quality (often 128–192 kbps), whereas the DSS format uses ADPCM IMA OKI encoding, which is a low-bitrate codec designed specifically for speech intelligibility on dictation devices. Music, sound effects, or high-frequency content from broadcast TV will sound noticeably degraded, but spoken dialogue and voice content remains reasonably intelligible — which is the intended use case for DSS.

DSS is a single-track audio format designed for dictation devices and does not support multiple audio streams. FFmpeg will default to mapping the first detected audio stream from the WTV file. If your recording contains separate audio tracks (such as alternate language broadcasts or secondary audio programming), only the primary track is included. If you need a different track, you can modify the command by adding -map 0:a:1 (or the appropriate track index) before the output filename.

WTV files can store rich broadcast metadata including program titles, channel information, and recording timestamps. However, DSS is a minimalist format originally designed for voice recorder hardware and has very limited metadata support. FFmpeg will attempt to map basic tags, but most WTV-specific broadcast metadata fields will not survive the conversion. If preserving metadata is important, consider extracting it from the WTV file separately before converting.

No — the DSS format with the ADPCM IMA OKI codec does not expose configurable bitrate or quality parameters in FFmpeg. The codec encodes at a fixed specification by design, reflecting its origins as a hardware dictation standard where interoperability between devices was prioritized over flexibility. Unlike converting to formats such as MP3 or AAC, you cannot pass a -b:a flag to change the output quality for DSS files.

The displayed command processes a single file, but you can batch convert on the command line using a shell loop. On Linux or macOS, run: for f in *.wtv; do ffmpeg -i "$f" -vn -c:a adpcm_ima_oki "${f%.wtv}.dss"; done. On Windows Command Prompt, use: for %f in (*.wtv) do ffmpeg -i "%f" -vn -c:a adpcm_ima_oki "%~nf.dss". The browser-based tool processes one file at a time, so use the local FFmpeg command for bulk conversions.

DSS playback support in mainstream media players is inconsistent. VLC can open many DSS files, but Windows Media Player does not natively support the format. Dedicated software from Olympus or Philips — such as Olympus DSS Player or Philips SpeechExec — provides the most reliable playback. The DSS format is primarily a professional dictation standard rather than a general consumer audio format, so compatibility outside that ecosystem is limited.

Technical Notes

The ADPCM IMA OKI codec used in DSS files is a variant of Adaptive Differential Pulse Code Modulation developed specifically for the Olympus/Philips/Grundig dictation hardware ecosystem. It operates at a fixed low sample rate (typically 8 kHz mono), which makes it highly efficient for speech but entirely unsuitable for music or broadcast audio with stereo content. When converting from WTV, the AAC or MP3 audio stream in the source file must be fully decoded and then re-encoded — this is a transcoding operation, not a remux, meaning two generations of lossy compression are applied (the original broadcast encoding plus the ADPCM IMA OKI encoding). Any stereo content in the WTV file will be downmixed to mono during encoding since DSS is inherently a mono format. Subtitles embedded in the WTV file are dropped entirely, as DSS has no subtitle support. The output file size will typically be very small compared to the source WTV file, reflecting both the elimination of the video stream and the low-bitrate nature of the DSS codec.

Related Tools

Extract Audio WTV to WEBA Extract Audio WTV to FLAC Extract Audio WTV to VOC Extract Audio MP4 to DSS Extract Audio MOV to DSS Extract Audio MOD to DSS Convert WTV to MP4 Compress WTV

JavaScript Required

This tool processes files entirely in your browser using WebAssembly, which requires JavaScript. Please enable JavaScript to use this tool.

Alternatively, copy this FFmpeg command to run the conversion on your desktop:

ffmpeg -i input.wtv -vn -c:a adpcm_ima_oki output.dss