Extract Audio from DV to AMR — Free Online Tool
Extract audio from DV camcorder footage and convert it to AMR format, transcoding the PCM 16-bit stereo audio captured on tape or file into a low-bitrate, speech-optimized AMR stream using the libopencore_amrnb codec. Ideal for archiving voice content from DV recordings in a compact, mobile-friendly format.
to
FFmpeg Command
Copy this command to run the same conversion locally with FFmpeg on your desktop. Download FFmpeg
Drop your DV file here
or click to browse
Free — no uploads, no signups. Your files never leave your browser.
Settings
Note: Browser-based encoding uses approximate quality targets. For precise CRF compression, copy the FFmpeg command above and run it on your desktop.
Estimated output:
Conversion Complete!
DownloadHow It Works
DV files store audio as uncompressed PCM (pcm_s16le) — 16-bit little-endian stereo at either 48kHz or 32kHz, recorded alongside the DV-compressed video. During this conversion, the video stream is completely discarded (-vn), and the raw PCM audio is decoded and then re-encoded from scratch into AMR-NB (Adaptive Multi-Rate Narrowband) using the libopencore_amrnb codec at 12,200 bps. AMR-NB operates at 8kHz mono, so the original stereo PCM audio is downmixed to mono and resampled from the DV source sample rate down to 8kHz. This is a lossy transcode — the high-fidelity PCM data is heavily compressed and bandwidth-limited to the narrowband telephone range (300–3400 Hz), which is optimized for speech intelligibility rather than full-spectrum audio fidelity.
What Each Flag Does
| Flag | What it does |
|---|---|
ffmpeg
|
Invokes the FFmpeg multimedia processing tool. In the browser-based version of this tool, FFmpeg runs locally via WebAssembly (FFmpeg.wasm) — no data leaves your machine. |
-i input.dv
|
Specifies the input DV file. FFmpeg detects the DV container and demuxes both the dvvideo stream and the pcm_s16le audio stream; only the audio will be used in this conversion. |
-vn
|
Disables video output entirely, discarding the dvvideo stream from the DV file. Since AMR is a pure audio format, the video data is simply ignored rather than transcoded. |
-c:a libopencore_amrnb
|
Selects the libopencore_amrnb encoder to produce AMR Narrowband audio, transcoding the raw PCM audio extracted from the DV file into the compressed, speech-optimized AMR-NB codec used in mobile telephony. |
-b:a 12200
|
Sets the AMR-NB bitrate to 12,200 bits per second, which is the highest quality mode available in the AMR-NB standard and produces the most intelligible speech output from the DV source audio. |
output.amr
|
Defines the output filename with the .amr extension, which triggers FFmpeg to write the AMR storage format (RFC 4867 single-channel AMR file) containing the encoded narrowband audio extracted from the DV footage. |
Common Use Cases
- Archiving spoken-word interviews or oral histories recorded on DV camcorders into compact AMR files for storage on mobile devices or legacy voice systems
- Extracting narration or voiceover audio from DV footage to share with mobile apps or telephony systems that natively support AMR playback
- Repurposing voice commentary from DV news or documentary footage into AMR clips compatible with older Nokia or feature-phone media players
- Converting DV-recorded field interviews into AMR format for integration with voice messaging platforms or IVR (Interactive Voice Response) systems that require AMR input
- Reducing the storage footprint of large DV audio archives where only the spoken dialogue needs to be preserved, not full broadcast-quality PCM audio
- Preparing speech content from DV camcorder recordings for upload to mobile voice annotation or transcription tools that accept AMR-NB files
Frequently Asked Questions
Yes, significantly — this is one of the most dramatic quality reductions possible in audio conversion. DV stores audio as uncompressed PCM at 48kHz stereo, capturing the full audible frequency range. AMR-NB at 12,200 bps is a narrowband codec designed for telephone speech, limited to 8kHz mono with a frequency range of roughly 300–3400 Hz. Music, ambient sound, and anything beyond the telephone voice band will be heavily degraded or lost entirely. For speech content, intelligibility is preserved well; for music or environmental audio from DV footage, the result will sound muffled and lo-fi.
AMR-NB (the narrowband variant used here via libopencore_amrnb) is inherently a mono codec — it was designed for single-channel mobile telephony, not stereo playback. FFmpeg automatically downmixes the stereo PCM channels from the DV source to mono during the transcode. If your DV footage had distinct left/right audio channels (e.g., dual-mic interview setups), both channels will be blended into a single mono stream in the output AMR file.
Yes. AMR-NB supports eight standardized bitrate modes: 4750, 5150, 5900, 6700, 7400, 7950, 10200, and 12200 bps. The default used here is 12200 bps (the highest quality AMR-NB mode). To use a lower bitrate, change the -b:a flag in the FFmpeg command — for example, -b:a 4750 for the most compressed mode. Lower bitrates will reduce file size but introduce more speech artifacts. For reference, a 12200 bps AMR file is already extremely compact compared to the PCM audio in a DV file.
No. DV files can contain timecode, tape metadata, and scene information embedded in the DV stream, but AMR is a bare audio container with no meaningful metadata support. None of the DV-specific metadata — including recording date, timecode, camera model, or tape name — is preserved in the output AMR file. If metadata preservation is important, consider extracting the audio to a container like WAV or MP4/AAC that supports embedded tags before archiving.
On Linux or macOS, you can use a shell loop: for f in *.dv; do ffmpeg -i "$f" -vn -c:a libopencore_amrnb -b:a 12200 "${f%.dv}.amr"; done. On Windows Command Prompt, use: for %f in (*.dv) do ffmpeg -i "%f" -vn -c:a libopencore_amrnb -b:a 12200 "%~nf.amr". This is especially useful for digitized DV tape archives where you may have dozens of individual DV files to process. Note that the browser-based tool on this page processes one file at a time.
They are different codecs. libopencore_amrnb encodes AMR-NB (Narrowband), which samples at 8kHz and is limited to telephone-quality speech. AMR-WB (Wideband), available via libopencore_amrwb, samples at 16kHz and covers a wider frequency range (50–7000 Hz), producing noticeably better voice quality. Since DV audio is recorded at 48kHz, AMR-WB will preserve more of the original vocal clarity than AMR-NB. If your target device or platform supports AMR-WB (.awb files), it's a better choice for voice content extracted from DV footage. The command would use -c:a libopencore_amrwb and a compatible bitrate like -b:a 23850.
Technical Notes
DV audio is recorded at either 48kHz/16-bit stereo (the standard broadcast mode) or 32kHz/12-bit four-channel mode, depending on the camera. FFmpeg reads both variants correctly from .dv files or DV-in-MKV/AVI containers. The conversion pipeline involves three transformations: stereo-to-mono downmix, sample rate conversion from 48kHz (or 32kHz) down to 8kHz, and lossy encoding via libopencore_amrnb. The 8kHz output is a fundamental constraint of AMR-NB, not a configurable option. Because AMR-NB was standardized for GSM telephony, the output files are compatible with most mobile phones and voice platforms but are inappropriate for any audio where fidelity beyond the speech band matters. The libopencore_amrnb encoder in FFmpeg is derived from the OpenCORE framework and produces standards-compliant AMR files. Note that FFmpeg must be compiled with libopencore-amrnb support — this is included in most binary distributions but not always in minimal builds. The output .amr file uses the single-channel AMR storage format (IETF RFC 4867), which is a simple container with a magic number header followed by AMR frames.