Extract Audio from DV to M4B — Free Online Tool

Extract audio from DV camcorder footage and save it as an M4B audiobook or podcast file, transcoding the native PCM 16-bit audio from the DV stream into AAC at 128kbps inside an MPEG-4 container with chapter and bookmarking support. Ideal for archiving recorded spoken-word DV content — lectures, interviews, or narrations — into a portable, playback-friendly format.

FFmpeg Command

Copy this command to run the same conversion locally with FFmpeg on your desktop. Download FFmpeg

Free — no uploads, no signups. Your files never leave your browser.

Estimated output:

Conversion Complete!

Download

How It Works

DV files store audio as uncompressed PCM 16-bit stereo (pcm_s16le) at either 48kHz or 32kHz, locked tightly to the video stream inside the DV container. This tool strips the video stream entirely using the -vn flag and transcodes only the raw PCM audio into AAC, which is the native codec for M4B. Because DV's PCM audio is uncompressed, the transcoder is working from a lossless source — there is no generation loss from decompressing a lossy codec before re-encoding. The resulting M4B file is wrapped in an MPEG-4 container with the +faststart flag applied, which moves the file's metadata index to the beginning of the file so it can begin playing before fully downloading — a useful property for podcast feeds or streaming audiobook servers.

What Each Flag Does

Flag What it does
ffmpeg Invokes the FFmpeg binary. When run in the browser, this is executed via FFmpeg.wasm, a WebAssembly port that processes your DV file entirely locally without any server upload.
-i input.dv Specifies the input DV file. FFmpeg reads the DV container and demuxes it into its component streams — the dvvideo video stream and the pcm_s16le audio stream — making each available for further processing.
-vn Disables video output entirely, telling FFmpeg to ignore the dvvideo stream and produce an audio-only output. This is essential for creating an M4B audiobook file and also dramatically reduces processing time since the large DV video data is never decoded or re-encoded.
-c:a aac Specifies AAC (Advanced Audio Coding) as the audio codec for the output. AAC is the required codec for M4B files and is natively supported by Apple devices, iPhone audiobook apps, and all major podcast players — making it the correct choice for this audiobook-oriented container.
-b:a 128k Sets the AAC audio bitrate to 128 kilobits per second. At this bitrate, AAC produces audio quality that is transparent for speech and voice recordings typical of DV camcorder content, while keeping the output file size very compact compared to the original DV source.
-movflags +faststart Moves the MPEG-4 container's metadata index (the moov atom) from the end of the file to the beginning after encoding completes. This is important for M4B files distributed via podcast feeds or streaming servers, as it allows playback to begin before the entire file has downloaded.
output.m4b Defines the output filename with the .m4b extension, which signals to both FFmpeg and media players that this MPEG-4 audio file is intended for audiobook or podcast use, enabling chapter navigation and per-file bookmarking on compatible players like Apple Books and Overcast.

Common Use Cases

  • Archiving a lecture or keynote recorded on a MiniDV camcorder into an M4B podcast episode that listeners can resume mid-playback using bookmarking
  • Converting DV tape captures of radio broadcasts or voice interviews into a chapter-friendly audiobook format for distribution via podcast feeds
  • Extracting the spoken narration track from a DV documentary or home video to create a standalone audio memoir or family history audiobook
  • Repurposing DV footage of a sermon, workshop, or conference session into an M4B file compatible with Apple Podcasts and iPhone audiobook players
  • Stripping the audio from DV training videos recorded in a professional broadcast environment to produce compact AAC-encoded modules playable on any MPEG-4 device
  • Reducing the file size of a long DV recording — which can exceed several gigabytes per hour — to a small M4B audio file for easier distribution and storage

Frequently Asked Questions

DV stores audio as uncompressed PCM 16-bit, so the source going into the AAC encoder is already at full fidelity — there is no prior lossy decoding step to introduce generation loss. The AAC encoding itself does introduce lossy compression, but at the default 128kbps bitrate AAC is transparent for most speech content, which is the typical use case for M4B. If you need higher fidelity for music recorded on DV, you can raise the bitrate to 192k or 256k using the -b:a flag.
M4B natively supports chapters as part of the MPEG-4 container specification, which is one of the main reasons the format is favored for audiobooks and podcasts. However, DV files do not carry chapter metadata, so the extracted M4B will not automatically have chapters defined. You can add chapters to the output file separately using FFmpeg's chapter metadata injection or a tool like mp4chaps after the initial extraction.
A standard DV file runs at approximately 25 Mbps for video plus uncompressed PCM audio, which results in very large files — roughly 13GB per hour of footage. By stripping the video stream entirely and encoding only the audio as AAC at 128kbps, the output M4B uses only about 57MB per hour of audio. This roughly 200:1 reduction in file size is expected and is a direct result of removing the video data and applying lossy audio compression.
DV camcorders record audio at either 48kHz (the standard broadcast rate, used in most professional DV recordings) or 32kHz (used in some consumer camcorders recording four-channel audio). FFmpeg will detect the source sample rate automatically and the AAC encoder will default to matching it, though AAC commonly outputs at 44.1kHz or 48kHz. If your source was 32kHz, FFmpeg may resample it during encoding — this is normal and will not produce artifacts perceptible in speech recordings.
The -b:a 128k flag controls the AAC audio bitrate. Replace 128k with any of the supported values — 64k, 96k, 192k, 256k, or 320k — depending on your quality and file size requirements. For voice-only content like lectures or podcasts, 96k AAC is often indistinguishable from 128k. For music or high-fidelity archival, 192k or 256k is recommended. The full modified command would look like: ffmpeg -i input.dv -vn -c:a aac -b:a 192k -movflags +faststart output.m4b
Yes. On Linux or macOS you can loop over all DV files in a directory with a shell one-liner: for f in *.dv; do ffmpeg -i "$f" -vn -c:a aac -b:a 128k -movflags +faststart "${f%.dv}.m4b"; done. On Windows Command Prompt, use: for %f in (*.dv) do ffmpeg -i "%f" -vn -c:a aac -b:a 128k -movflags +faststart "%~nf.m4b". The browser-based tool processes one file at a time, so the FFmpeg command is especially useful for bulk archival of large DV tape capture libraries.

Technical Notes

DV's audio is stored as pcm_s16le (signed 16-bit little-endian PCM), which is a lossless, byte-aligned format with no perceptual compression — making it an ideal clean source for AAC transcoding. The AAC codec used here is FFmpeg's native AAC encoder (not libfdk_aac), which produces compliant MPEG-4 AAC-LC output compatible with Apple devices, Android, and all M4B-capable audiobook players. The -movflags +faststart flag is particularly important for M4B files that may be served over HTTP in a podcast feed, as it relocates the moov atom to the file header, enabling progressive playback before full download. M4B files support ID3-style metadata tags (title, author, album, cover art) which can be embedded post-conversion but are not automatically carried over from the DV source, since DV containers carry minimal embedded metadata. Note that DV files recorded in four-channel 32kHz mode (some consumer camcorders) may require explicit channel mapping if only two channels are desired in the output — by default FFmpeg will mix down or pass through as available.

Related Tools