Extract Audio from MTS to DSS — Free Online Tool

Extract audio from AVCHD camcorder footage (.mts) and convert it to Digital Speech Standard (.dss) format using the ADPCM IMA OKI codec — a dictation-focused, speech-optimized format compatible with Olympus, Philips, and Grundig digital voice recorders. This is a highly specific conversion that strips H.264 video entirely and re-encodes the AC-3 or AAC audio stream into the low-bitrate DSS container.

FFmpeg Command

Copy this command to run the same conversion locally with FFmpeg on your desktop. Download FFmpeg

Free — no uploads, no signups. Your files never leave your browser.

Estimated output:

Conversion Complete!

Download

How It Works

MTS files are AVCHD recordings that bundle H.264 video with AC-3 or AAC audio inside an MPEG-2 Transport Stream container. During this conversion, the video stream is completely discarded using the -vn flag — no video decoding or re-encoding occurs. The audio stream (typically AC-3 or AAC from the camcorder) is decoded and then re-encoded using the ADPCM IMA OKI codec, which is the proprietary lossy compression scheme at the heart of the DSS format. DSS was designed specifically for speech at low bitrates, so the codec aggressively compresses audio in ways optimized for voice intelligibility rather than music or ambient sound fidelity. The resulting file is a compact DSS audio file suitable for digital dictation devices and transcription software.

What Each Flag Does

Flag What it does
ffmpeg Invokes the FFmpeg tool — in this browser-based tool, this runs entirely via FFmpeg.wasm inside WebAssembly without any server involvement. If running locally on your desktop, this calls your system-installed FFmpeg binary.
-i input.mts Specifies the input AVCHD file in MTS format. FFmpeg reads the MPEG-2 Transport Stream container and identifies the H.264 video stream and the AC-3 or AAC audio stream inside it for processing.
-vn Disables video output entirely, discarding the H.264 video stream from the MTS file without decoding it. This is required because DSS is a pure audio container and cannot hold any video data.
-c:a adpcm_ima_oki Encodes the audio stream using the ADPCM IMA OKI codec, which is the proprietary compression algorithm at the core of the DSS format developed for Olympus, Philips, and Grundig digital dictation devices. The original AC-3 or AAC audio from the camcorder is fully decoded and re-encoded through this speech-optimized, low-bitrate codec.
output.dss Defines the output filename with the .dss extension, which tells FFmpeg to write the result into a Digital Speech Standard container. The DSS container wraps the ADPCM IMA OKI audio in the format expected by Olympus and Philips dictation hardware and transcription software.

Common Use Cases

  • Transcribing recorded interviews or press conferences filmed with a Sony or Panasonic AVCHD camcorder by converting the audio into DSS format for import into Olympus or Philips dictation and transcription software
  • Archiving spoken-word recordings from a camcorder into a legacy DSS-based workflow used by legal, medical, or corporate transcription departments
  • Extracting dictated notes or verbal memos that were captured incidentally on a camcorder and need to be sent to a DSS-compatible voice recorder or playback device
  • Converting field interview audio from AVCHD footage into DSS for use with foot-pedal-controlled transcription stations that only accept DSS or DS2 files
  • Reducing storage footprint of speech-only camcorder recordings — such as deposition video audio — by stripping the video and re-encoding only the voice content in the highly compressed DSS format
  • Generating DSS audio files from AVCHD source recordings for compatibility with older Olympus DSS Player or Grundig DigtaSoft transcription software suites

Frequently Asked Questions

Yes, and this is expected given DSS's design purpose. The ADPCM IMA OKI codec used in DSS files is optimized for speech intelligibility at very low bitrates — it discards a great deal of audio detail that matters for music or ambient sound. If your MTS recording contains spoken dialogue, interviews, or dictation, the result should remain intelligible. However, if your camcorder footage includes music, complex ambient sound, or high-fidelity audio, the DSS output will sound noticeably compressed and degraded.
DSS uses a fixed codec — ADPCM IMA OKI — with no configurable bitrate or quality parameter exposed through FFmpeg. The format was designed as a closed, proprietary standard for digital dictation devices by Olympus, Philips, and Grundig, and its compression parameters are baked into the specification. Unlike converting MTS audio to MP3 or AAC where you can choose a target bitrate, DSS output is always encoded at the same fixed low-bitrate speech-grade quality.
DSS is a proprietary format primarily designed for Olympus, Philips, and Grundig digital voice recorders and their associated PC software (such as Olympus DSS Player or Philips SpeechExec). Most general-purpose media players like VLC do not natively support DSS playback without plugins. If your goal is simply to extract and listen to audio from an MTS file, a more universally compatible format like MP3 or AAC would be a better choice — DSS makes sense only when you specifically need DSS compatibility for transcription workflows.
No. The DSS container format has very limited metadata support compared to the MPEG-2 Transport Stream used by AVCHD camcorders. Metadata such as recording timestamps, GPS coordinates, or camera model information embedded in the MTS file will not be carried over into the DSS output. The DSS format's metadata fields are designed around dictation-specific tags (such as author and priority level) rather than camcorder or broadcast recording metadata.
The -vn flag instructs FFmpeg to exclude all video streams from the output — it stands for 'no video.' This is essential here because DSS is a purely audio container and cannot hold a video stream; FFmpeg would error out or produce a corrupted file if you attempted to include video. Since your MTS file contains H.264 video that has no place in a DSS file, -vn ensures only the audio is processed and written to the output.
The command as shown processes a single file, but you can adapt it for batch processing using a shell loop. On Linux or macOS, you can run: for f in *.mts; do ffmpeg -i "$f" -vn -c:a adpcm_ima_oki "${f%.mts}.dss"; done. On Windows Command Prompt, use: for %f in (*.mts) do ffmpeg -i "%f" -vn -c:a adpcm_ima_oki "%~nf.dss". This will convert every MTS file in the current directory to a corresponding DSS file while preserving the original filenames.

Technical Notes

MTS files produced by Sony and Panasonic AVCHD camcorders typically carry AC-3 (Dolby Digital) or AAC audio, often recorded at 48kHz stereo. The DSS format's ADPCM IMA OKI codec is a variant of Adaptive Differential Pulse-Code Modulation tuned for narrow-band speech — it operates at a much lower sample rate and collapses stereo to mono, meaning any stereo field from the original camcorder recording is lost. This conversion is inherently very lossy in terms of audio fidelity: it decodes the original AC-3 or AAC stream and re-encodes it through a codec that prioritizes speech compression over accuracy. FFmpeg's support for DSS output via the adpcm_ima_oki codec is functional but reflects the closed nature of the format — there are no tunable parameters. Notably, DSS does not support multiple audio tracks, subtitles, or chapters, all of which MTS can theoretically contain. If your MTS file has multiple audio tracks (e.g., a second language track), only the first audio stream will be selected by FFmpeg by default unless you explicitly specify a different stream with the -map flag.

Related Tools