Skip to main content

About Audio Merger

Audio Merger is shaped around how people actually use audio editing and conversion utilities online: open the page, drop in a file, get the result. Merge up to 12 audio files into one continuous track. Supports MP3, WAV, OGG, M4A, AAC, FLAC, Opus, AIFF and more. Optional loudness normalization to even out clip levels. Files are processed entirely in your browser with FFmpeg WebAssembly. The interface stays out of the way once the work begins so the engine can use the available CPU and memory for the actual transformation.

Audio Merger is implemented on top of standard browser APIs. The accepted input formats are MP3, WAV, M4A, AAC, OGG, Opus, FLAC, AIFF, and WMA, decoded in the browser, processed, and re-encoded into the output format. Files up to 200 MB are well within the comfort zone of any modern browser.

The execution path is auditable from the page itself: open developer tools, switch to the Network tab, run a job. The requests you see are static-asset GETs for the engine and the page resources. The actual work is JavaScript code running against the bytes already in your tab's memory.

Typical users of Audio Merger include musicians sharing demos, sound designers prototyping cues and voice-over artists exporting takes. The thread connecting all of them is the same: a focused audio editing and conversion task that fits cleanly into a browser tab and benefits from a tool with sensible defaults and minimal setup.

The right moment to reach for Audio Merger is when you have a focused audio editing and conversion job that fits inside a browser tab. Open the page, drop in the file or paste your input, choose the options that matter, and the tool returns the result.

The only practical limit is the 200 MB per-file ceiling, which keeps the tool responsive across a wide range of devices. Run the tool ten times in a row, run it ten thousand times — it behaves the same way and produces the same quality of result.

If your task needs more than one step, chain Audio Merger with Audio Crossfade, Audio Splitter, and Audio Fade. Each tool produces output that is a clean input to the next, so multi-step workflows are just a matter of opening the next tool in a new tab and continuing.

Audio Merger is honest about scope: it handles a single, well-defined audio editing and conversion step. Specialist edge-case work — uncommon formats, very large inputs, or pipelines that need scripting — is what dedicated desktop apps are for. This page handles the common case quickly.

Audio Merger returns the result as `{name}-edited.{ext}`. If you are running multiple jobs, the output names will not collide as long as the input names differ. You can re-run with different settings as many times as you like; each run produces a fresh file with no caching trickery in between.

Some context on why Audio Merger exists in this form: modern File APIs, high-performance JavaScript engines, and well-maintained open-source libraries now make it possible to perform audio editing and conversion work entirely in the browser. Audio Merger is built on top of that capability, which is why a single page can host the full pipeline.

Audio Merger produces deterministic output: the same input plus the same options always produces the same result. That predictability matters when the result has to match an upstream specification or be reproducible later.

Pro tip: Audio Merger works just as well in a private/incognito window as in a normal one, which is occasionally useful when you want zero browser-history footprint of the job. Another tip: if the tool ever feels slow, it is almost always because the browser tab is competing for CPU with another tab — pausing or closing the heavy ones gives the engine room to work.

Common gotchas worth flagging: Audio Merger only accepts MP3, WAV, M4A, AAC, OGG, Opus, FLAC, AIFF, and WMA, so if your file is in another format you will need to convert it first. The 200 MB ceiling is per-file, not per-session; you can run as many separate jobs as you like, but a single oversized input will be rejected on load.

Open the workspace above to start using Audio Merger. The engine loads on the first interaction so the page itself stays light, and once the tool is warm it processes subsequent jobs quickly. The moment the page is interactive, the tool is ready to do real work on your file.

How it works

  1. 1Reach the Audio Merger page in your browser to begin.
  2. 2Select the MP3, WAV, M4A, AAC, OGG, Opus, FLAC, AIFF, and WMA file you want to process — drag-and-drop and the file picker both work.
  3. 3Pick any non-default settings you need. Most users leave the defaults alone for the first run and only revisit if the result needs tuning.
  4. 4Click to start the job. The engine (standard browser APIs) processes the input in the page; you can watch the progress indicator until it completes.
  5. 5Download the result as `{name}-edited.{ext}`. The file is generated in your browser and saved through your normal download flow.
  6. 6Run additional jobs as needed. The same controls and defaults apply on every run.

Common use cases

FAQ

How does merging work?

A WebAssembly build of FFmpeg loads in the page. Each file is decoded to a common sample rate, then concatenated using the `concat` filter. The combined buffer is encoded to your chosen format (MP3, WAV, OGG, or M4A) and offered as a download. Everything happens locally in your browser.

How many files can I merge at once?

Up to 12 files at a time, with a combined total of 200MB. For longer mixes, merge in stages: combine 12 files, then take the result and merge with more. The output of one merge can be the input of the next.

Can the files be in different formats?

Yes — mix MP3, WAV, FLAC, M4A, OGG, AAC, Opus, AIFF, and WMA freely. The merger decodes each file independently and resamples them to a common sample rate before joining, so format mismatches are handled automatically.

What does "Normalize loudness" do?

It applies the EBU R128 loudness standard (target -14 LUFS, the same level Spotify and YouTube use) to the merged output. This evens out volume differences between clips that were recorded at different levels — essential when joining audio from different sources or sessions.

How is this different from "Audio Crossfade"?

This tool butt-joins the clips with no overlap (or you can use the crossfade mode for smooth transitions). Audio Crossfade is the dedicated crossfade tool with finer control over transition curves. For simple sequential joining, this merger is faster and simpler.

How do I reorder the files?

After uploading, drag-and-drop the file cards to change their order. The order in the list is exactly the order they appear in the merged output. You can also remove individual files with the trash icon and add more later.

Why is in-browser audio processing slower than online tools?

Server-side tools use multi-threaded native FFmpeg running on dedicated CPUs with fast disks and parallel pipelines. Our engine is FFmpeg compiled to WebAssembly, which runs single-threaded inside your browser tab and has no access to native hardware acceleration. That makes browser-based jobs typically 3–8× slower than a server. The trade-off is total privacy: your audio file is never uploaded, never logged, and never stored — closing the tab erases everything from memory immediately. For most clips up to a few minutes the wait is small, and for sensitive recordings (voice memos, drafts, confidential meetings) the privacy gain is well worth it.

Is my audio uploaded?

No. Everything runs entirely inside your browser tab using FFmpeg compiled to WebAssembly. The file is read into local memory only, processed in the same tab, and the result is offered as a direct download. Nothing is transmitted to any server, no account is required, no analytics are tied to your file, and closing the tab discards every byte from memory.

How big a file can I process?

The file picker accepts audio inputs up to about 1 GB, which is well above what mainstream "free tier" online converters allow. The real ceiling is your device — everything runs inside your browser tab, which shares memory with the rest of the page. Most podcasts, songs, and voice memos sit comfortably under that limit even on a phone. If a very large lossless WAV or FLAC ever fails, trim it first or transcode to MP3 / Opus to bring the size down before re-running the tool.

Which audio formats are supported?

MP3, WAV, OGG (Vorbis and Opus), FLAC, M4A (AAC), AAC, Opus, AIFF, and WMA all decode reliably via FFmpeg WASM. Output formats depend on the specific tool — most editing tools default to MP3 (universal) or WAV (lossless) but expose a format picker so you can pick the one that fits your downstream player or DAW.

Which browsers are supported?

Recent Chrome, Edge, Firefox, Safari, and other Chromium-based browsers all work. The tool relies on WebAssembly and SharedArrayBuffer, which require the page to be served over HTTPS with the right cross-origin headers — this site is configured correctly by default. On phones the same code runs but is slower than on a desktop because mobile CPUs are weaker.

Is there a watermark, sign-up wall, or usage cap?

No. The tool is completely free, requires no account, attaches no watermark, applies no usage caps, and shows no popup ads on your output. Because the work happens on your own device, there is no per-user quota for us to enforce — your hardware and browser memory are the only limits. The download is the file you would get from running FFmpeg locally, nothing more, nothing less.

What does Audio Merger do that command-line tools do not?

Desktop apps usually have more advanced features but require installation, maintenance and (often) a licence. Paid online tools are convenient but route your file through their servers and gate downloads behind accounts. Audio Merger sits in between: free, instant, and private, but intentionally narrow in scope. For one-off jobs and the common audio editing and conversion operations, it is usually the lowest-friction choice; for highly specialised work, a dedicated app is still the right answer.

Why is my browser prompting me when I open Audio Merger?

Audio Merger only needs the standard web platform — file picker access for the inputs you choose to load, and optionally clipboard access if you copy the result rather than downloading it. There is no microphone, camera, geolocation or background-permission request, because none of those are needed for the work the tool does.

Is the source for Audio Merger available?

Audio Merger is a static page running an open-source engine in your browser, so a typical corporate firewall does not get in the way as long as it allows JavaScript to load from Favtoo. For teams that need to host it themselves on an internal network, the underlying engine (standard browser APIs) is open-source and can be packaged into a private build with the same behaviour. Reach out via the Contact page if that is something you are exploring.

Will Audio Merger keep working if my Wi-Fi drops mid-task?

Once the page is loaded, Audio Merger can complete jobs without an active internet connection — the engine is bundled with the page, so there is no per-job network call. The initial page load does require a connection (to fetch the static assets), but after that you can disconnect entirely and the tool will still work. This is a side-effect of the local-first architecture, not a deliberate "offline mode" feature.

Does Audio Merger support batch processing?

Audio Merger processes one input at a time by design — it keeps memory usage predictable on lower-end devices and makes results easier to verify. To handle a folder, run the tool once per file; the page stays loaded between runs and remembers your last-used settings, so the second run is essentially instant.

Does Audio Merger reduce quality of the result?

Audio Merger is built to preserve quality wherever the underlying audio format allows it. Operations that are mathematically lossless (e.g. structural transformations, lossless re-encoding) round-trip with no perceptible change. Operations that involve a lossy codec inevitably introduce small artefacts at the byte level, but the defaults aim at the sweet spot where output looks or sounds the same to a normal viewer or listener while still being meaningfully smaller or faster than the input.

How do I know I am using the latest version of Audio Merger?

Audio Merger is updated whenever the underlying engine releases an improvement or a bug fix. Because the tool is delivered as a static page, every visit fetches the latest version automatically — there is no "version" to manage on your end. If a particular release ever changes default behaviour, the change is documented on Favtoo's changelog so you can confirm what shifted.

What should I do if Audio Merger fails on my file?

Failures usually fall into one of three buckets: the input is in an unsupported format, the input is over the size cap, or the input is structurally malformed (a truncated download, a partial export, or a stream the engine does not recognise). The first two are easy to confirm — check that your file is one of MP3, WAV, M4A, AAC, OGG, Opus, FLAC, AIFF, and WMA and that it is below 200 MB. For the third, opening the file in its native viewer first is the fastest way to confirm the source is intact.

Which browsers are supported by Audio Merger?

Audio Merger works in any modern browser released in the last few years — Chrome, Edge, Firefox, Safari, Brave, Arc and the major Chromium derivatives are all supported. The underlying engine relies on widely-supported web APIs, so there is nothing exotic to install. If you are on a very old browser version and the tool fails to load, updating to the latest release of your preferred browser is the only fix needed.

Audio Recorder

Record from your microphone directly in the browser. Pick quality (high, medium, low), toggle echo cancellation, noise suppression and auto-gain, then save to WebM/Opus or M4A/AAC. Audio is captured locally — nothing is uploaded.

Text to Speech

Type or paste text, pick a system voice, and listen instantly. Adjust speaking rate (0.5×–2×), pitch, and volume in real time. Uses your browser's built-in Web Speech API — no cloud TTS, no API keys, no costs.

Tone Generator

Generate a pure tone at any frequency from 20 Hz to 20 kHz. Pick a sine, square, triangle, or sawtooth waveform, choose duration, amplitude, and mono/stereo. Exports a 16-bit PCM WAV file at 44.1 kHz with built-in click-preventing fades.

Silence Generator

Generate a perfectly silent WAV file of any length from 1 second up to 1 hour. Pick mono or stereo, get a 16-bit PCM WAV at 44.1 kHz. Useful as padding between clips, intro silence, leader audio for video timing, or test material.

White Noise Generator

Generate white, pink, or brown noise as a 16-bit PCM WAV file. Pick noise type, duration up to 1 hour, amplitude, and mono/stereo. Useful for sleep, focus, masking distractions, audio testing, and as a backing layer for ambient music.

Metronome

A precise browser-based metronome powered by the Web Audio API. Set BPM from 30 to 300, choose a time signature, accent the first beat, and use tap-tempo to sync. Click timing is sample-accurate using lookahead scheduling — much steadier than typical JavaScript setInterval beats.

Audio Trimmer

Trim any audio file to a precise start and end time. Outputs a lossless stream-copy by default (no quality loss, very fast) or re-encodes to MP3, WAV, OGG, or M4A. Files are processed entirely in your browser with FFmpeg WebAssembly.

Audio Splitter

Split a long audio file into N equal-length parts and download them as a ZIP. Each part is named sequentially. Great for chapterizing audiobooks, podcasts, or long DJ mixes. Runs entirely in your browser with FFmpeg WebAssembly.

View all Audio Tools