Extract Audio from WTV to M4B — Free Online Tool
Extract audio from WTV recorded TV files and save it as M4B, an MPEG-4 audiobook format with chapter and bookmarking support. The audio is encoded as AAC inside an M4B container, making it ideal for turning recorded broadcasts, lectures, or talk radio captures into portable, bookmark-able audio files.
to
FFmpeg Command
Copy this command to run the same conversion locally with FFmpeg on your desktop. Download FFmpeg
Drop your WTV file here
or click to browse
Free — no uploads, no signups. Your files never leave your browser.
Settings
Note: Browser-based encoding uses approximate quality targets. For precise CRF compression, copy the FFmpeg command above and run it on your desktop.
Estimated output:
Conversion Complete!
DownloadHow It Works
WTV files recorded by Windows Media Center typically contain an H.264 video stream alongside an AAC or MP3 audio track. This tool discards the video entirely and extracts only the audio, re-encoding it as AAC at 128k bitrate and wrapping it in an M4B (MPEG-4) container. Because WTV commonly stores audio as AAC already, the re-encoding step is lightweight, but it is still a full transcode rather than a stream copy to ensure strict M4B container compliance. The -movflags +faststart flag reorganizes the MP4 metadata to the front of the file, which allows the M4B to begin playback before it is fully loaded — particularly useful for large recorded broadcasts.
What Each Flag Does
| Flag | What it does |
|---|---|
ffmpeg
|
Invokes the FFmpeg multimedia processing tool. In the browser-based version of this tool, this runs as FFmpeg.wasm compiled to WebAssembly, executing entirely within your browser with no server upload. |
-i input.wtv
|
Specifies the input file — a WTV recording created by Windows Media Center. FFmpeg uses its WTV demuxer to parse the proprietary RIFF-based container and expose the embedded video, audio, and broadcast metadata streams. |
-vn
|
Disables video output entirely, discarding the H.264 or MPEG-2 video stream from the WTV file. This is the core 'extract audio' operation — M4B is a pure audio container and cannot hold video, so this flag is required for the output to be valid. |
-c:a aac
|
Encodes the output audio as AAC using FFmpeg's built-in AAC encoder. AAC is the native and only practical codec for M4B files, ensuring compatibility with Apple devices, iTunes, and audiobook players that rely on the format. |
-b:a 128k
|
Sets the AAC audio bitrate to 128 kilobits per second, the default used by this tool. For speech-heavy WTV recordings like documentaries, news, or talk shows, 128k AAC provides transparent quality at a reasonable file size. |
-movflags +faststart
|
Moves the MPEG-4 'moov' metadata atom to the beginning of the M4B file after encoding. This is a required special flag for M4B output that enables streaming playback and ensures audiobook players can begin playing or seeking through the file without reading the entire thing first — particularly important for long broadcast recordings. |
output.m4b
|
The output filename with the .m4b extension, which tells Apple devices, iTunes, and podcast/audiobook apps to treat the file as long-form audio with bookmarking support, remembering your playback position between listening sessions. |
Common Use Cases
- Turn a recorded TV documentary or educational broadcast saved by Windows Media Center into an M4B audiobook you can listen to on an iPhone or iPod with chapter navigation intact
- Convert a recorded radio talk show or lecture captured via Windows Vista/7 Media Center into a podcast-style M4B file that supports bookmarking so you can resume where you left off
- Strip the audio from a multi-hour recorded broadcast — such as a news marathon or live event — into an M4B so you can listen while commuting without needing the video
- Archive the audio commentary or narration from a recorded TV program in a compact AAC-encoded M4B file, discarding the large video stream to save storage space
- Prepare a recorded TV interview or panel discussion for distribution as an audiobook or podcast episode by extracting the audio into the M4B format supported by Apple Books and iTunes
- Extract audio from legacy WTV recordings made years ago on Windows Media Center machines before the format becomes harder to play on modern systems
Frequently Asked Questions
Because WTV files can store audio as AAC and M4B also uses AAC, you might expect a lossless stream copy — but this tool performs a full re-encode to ensure the output is a properly formed M4B container. Re-encoding from one lossy AAC stream to another does introduce a small amount of generation loss. At the default 128k bitrate the result is transparent for speech-heavy content like documentaries or talk shows. If you recorded a music broadcast and want the highest fidelity, increase the bitrate to 192k or 256k in the FFmpeg command.
WTV files embed broadcast metadata such as program title, episode description, and channel information, but this metadata is stored in a proprietary Windows Media Center format that FFmpeg does not automatically map to M4B chapter or ID3 tags during this conversion. The M4B container itself fully supports chapters and bookmarking, but you would need to add chapter markers manually after conversion using a tool like mp4chaps or Chapter and Verse. Generic title and artist tags from the WTV file may be partially preserved depending on how they were written at recording time.
Both .m4a and .m4b are AAC audio inside an MPEG-4 container — the difference is purely in the file extension and how software interprets the file's purpose. The .m4b extension signals to Apple devices, iTunes, and podcast players that the file is an audiobook or long-form audio, enabling bookmarking (the player remembers your playback position) and chapter navigation. For recorded TV content that runs 30 minutes to several hours, the bookmarking behavior of .m4b is a significant practical advantage over .m4a.
Replace the -b:a 128k portion of the command with your desired bitrate. For example, use -b:a 64k for smaller files when audio quality is less critical (such as speech-only recordings), -b:a 192k for better fidelity on music-heavy broadcasts, or -b:a 320k for the highest quality AAC output. The full adjusted command would look like: ffmpeg -i input.wtv -vn -c:a aac -b:a 192k -movflags +faststart output.m4b. Keep in mind that AAC at 128k is already considered transparent for most spoken-word content.
By default FFmpeg selects the first audio stream it finds in the WTV file, which is typically the primary program audio. WTV supports multiple audio tracks because Windows Media Center was designed for digital broadcast recordings that often include secondary audio programs (SAP) or audio description services. To extract a specific track, add -map 0:a:1 to the command to select the second audio stream (zero-indexed), for example: ffmpeg -i input.wtv -vn -map 0:a:1 -c:a aac -b:a 128k -movflags +faststart output.m4b.
Yes. On Windows, you can run this in a batch script: for %f in (*.wtv) do ffmpeg -i "%f" -vn -c:a aac -b:a 128k -movflags +faststart "%~nf.m4b". On macOS or Linux, use: for f in *.wtv; do ffmpeg -i "$f" -vn -c:a aac -b:a 128k -movflags +faststart "${f%.wtv}.m4b"; done. This is particularly useful if you have a large archive of Windows Media Center recordings you want to convert to a more portable audio format, especially since the browser-based tool has a 1GB file size limit.
Technical Notes
WTV (Windows Television) is a container format introduced with Windows Vista Media Center and uses a modified RIFF-based structure that wraps MPEG-2, H.264, AAC, and MP3 streams alongside extensive DVR metadata. FFmpeg's WTV demuxer handles the format reasonably well but may occasionally struggle with recordings that contain mid-stream codec parameter changes, which sometimes occur in digital broadcast captures when channels change resolution or bitrate. The -vn flag is essential here because WTV video streams (typically H.264) are not valid in an M4B container, and omitting it would cause the mux to fail or produce a non-compliant file. The -movflags +faststart flag is a required special flag for M4B output that moves the 'moov' atom to the beginning of the file; without it, players may need to download the entire file before playback can start, which is particularly problematic for large broadcast recordings. M4B does not support multiple audio tracks, so the multi-audio-track capability of the source WTV format is necessarily reduced to a single stream in the output. If your WTV recording is very long (a multi-hour live event, for example), be aware that the browser-based conversion runs entirely in WebAssembly and may be slow or run into memory constraints — the desktop FFmpeg command is strongly recommended for files approaching or exceeding 1GB.