Skip to main content

About Video from Images + Audio

Video from Images + Audio performs video from images + audio as a focused single-page utility. Combine a stack of photos with a music track or narration into a single MP4 video. Pick the resolution, per-slide duration, transitions, and let the slideshow length match the audio. All processed in your browser with FFmpeg WebAssembly. Defaults are tuned for the common case so the first run is one click, with every option that matters exposed for the moments you need to fine-tune the result.

Reach for Video from Images + Audio when you need a predictable result on a single file. The page works on the first visit, the controls are visible without a menu, and the output is delivered the moment the engine finishes.

Video from Images + Audio performs the transformation entirely inside the JavaScript runtime. Your file lives in the tab's memory while the engine works on it; the result lives in the tab's memory until the browser triggers the download. Both are released when the tab closes, the way every browser tab releases its memory.

Behind the controls you see, FFmpeg compiled to WebAssembly is doing the actual video editing and conversion. MP4, WebM, MOV, MKV, AVI, FLV, and OGV are first-class formats and the engine produces a deterministic output for any given input + options combination — useful when you need to re-run a job and expect identical results.

The 500 MB ceiling on input size is the only fixed limit. Output files are produced in standard formats that every common viewer recognises, and the tool runs the same way regardless of how many times you have used it during the session.

Typical users of Video from Images + Audio include educators editing lecture clips, support agents preparing screen recordings and creators trimming short clips. The thread connecting all of them is the same: a focused video editing and conversion task that fits cleanly into a browser tab and benefits from a tool with sensible defaults and minimal setup.

The download is delivered as `{name}-edited.{ext}` the moment processing completes — no email link, no "your result will be ready in 5 minutes" queue, no expiry timer. The file is generated in your browser and saved by your browser's normal download flow.

As a workflow component, Video from Images + Audio is the part you reach for when a single, well-defined video editing and conversion step needs to happen. It performs that step and returns a standard file you can carry into the next part of your pipeline.

Some notes on the design of Video from Images + Audio. The page is intentionally narrow: one input, the controls relevant to the task, and one output. Adding unrelated features would make the common case slower for the majority of users, so the surface is held to what people actually use.

Some context on why Video from Images + Audio exists in this form: modern File APIs, high-performance JavaScript engines, and well-maintained open-source libraries now make it possible to perform video editing and conversion work entirely in the browser. Video from Images + Audio is built on top of that capability, which is why a single page can host the full pipeline.

As a single-page tool, Video from Images + Audio stays focused on one video editing and conversion step. Multi-step workflows are composed by chaining adjacent tools — each tool produces a standard file the next one can read directly, so a longer pipeline is just a sequence of short tab-and-tab visits.

A few practical tips that experienced users of Video from Images + Audio pick up over time. First, keep your default browser updated — the engine relies on standard web APIs and newer browser versions are noticeably faster than ones from a few years ago. Second, close other heavy tabs before processing a large input; the engine shares CPU and memory with whatever else is open. Third, if you re-run the same kind of job often, your last-used settings are remembered for the rest of the tab session, so subsequent runs are essentially one click.

If the result is not what you expected, the most common causes are easy to check. Confirm the input is under the 500 MB ceiling — files just above the cap fail silently because the engine refuses to allocate the buffer. Confirm the input is one of the supported formats. And if the page itself feels slow, try closing other heavy tabs to free up memory; the engine runs in your browser, so it competes for the same resources as everything else open.

Video from Images + Audio is one of many single-purpose tools in the catalog. Each is built around the same single-page model. Use this one, close the tab, and come back the next time you need the same job done. None of the tools require prior knowledge of the others — each page is self-contained.

How it works

  1. 1Reach the Video from Images + Audio page in your browser to begin.
  2. 2Add your MP4, WebM, MOV, MKV, AVI, FLV, and OGV input by dropping it onto the page or browsing for it.
  3. 3Adjust the options to match what you need. Sensible defaults cover the most common case, so you can usually skip this step.
  4. 4Trigger processing. FFmpeg compiled to WebAssembly reads your input, applies the transformation, and writes the result back into the page.
  5. 5Grab the output named `{name}-edited.{ext}` as soon as the run completes. You can also copy the result instead of downloading if the next tool in your workflow accepts pasted input.
  6. 6Re-run with different settings as often as you want. Each run produces a fresh output and the original file on disk is never modified.

Common use cases

FAQ

How does in-browser rendering work?

A WebAssembly build of FFmpeg loads in this page. Each image becomes a looped still input scaled to the chosen resolution, optional crossfades are applied with the xfade filter, and your audio is mixed in. The result is encoded to MP4 with H.264 video and AAC audio — entirely on your device.

Why is in-browser rendering slower than online tools?

Server tools have native multi-threaded FFmpeg with hardware-accelerated H.264 encoding. The WebAssembly build is single-threaded and uses software encoding, typically 4–8× slower per frame. The trade-off is total privacy — your photos and music never leave your device.

How is this different from the "Video Slideshow Maker"?

Both render an MP4 from photos. This tool requires an audio file (the soundtrack is the whole point of a music montage) and emphasizes audio-length matching. The Slideshow Maker treats audio as optional and is great for silent photo decks.

Can I match the slideshow length exactly to the music?

Yes — turn on "Trim slideshow to match audio length" and the encode stops at the audio's end (using FFmpeg's -shortest flag). Otherwise the music is held under the full slideshow runtime; if it's shorter than the slides, the audio simply ends.

How many photos and what audio formats?

Up to 60 images, 200MB total. Audio: MP3, WAV, AAC, M4A, OGG, or FLAC up to 100MB. Each image must be under 30MB.

How do I time the slides to the beat?

For exact beat-mapping, set per-slide durations matching your beat interval (e.g. 0.5s for 120 BPM). Or pick a global duration and let "Match audio length" finish the slideshow when the song ends.

What aspect ratios are supported?

1080p and 720p landscape, square 1080×1080 (Instagram), 9:16 vertical 1080×1920 (TikTok / Reels / Shorts), and 480p landscape. Photos are letterboxed to fit while preserving aspect ratio.

Is anything uploaded?

No. The rendering runs entirely in your browser tab. Nothing is transmitted, stored, or logged. Closing the tab erases the photos and audio from memory immediately.

Can I trust the output of Video from Images + Audio for important work?

Video from Images + Audio is built on FFmpeg compiled to WebAssembly, which is the same class of engine used by professional video editing and conversion pipelines. For deterministic operations, the output is byte-identical to what an equivalent CLI run would produce; for operations involving a codec or a model, the result is well within the range of what comparable tools generate. If you have a specific reference output you need to match, run a small test job first to confirm the configuration produces what you expect.

How is Video from Images + Audio different from desktop apps that do the same thing?

Desktop apps usually have more advanced features but require installation, maintenance and (often) a licence. Paid online tools are convenient but route your file through their servers and gate downloads behind accounts. Video from Images + Audio sits in between: free, instant, and private, but intentionally narrow in scope. For one-off jobs and the common video editing and conversion operations, it is usually the lowest-friction choice; for highly specialised work, a dedicated app is still the right answer.

Is Video from Images + Audio keyboard accessible?

Video from Images + Audio uses native HTML controls wherever possible, which means keyboard navigation, focus rings, and screen-reader labels work the way the platform expects. The drop zone accepts files via the keyboard-accessible file picker as well as drag-and-drop, and result downloads use standard browser download flows. If you spot an accessibility gap, Favtoo treats it as a bug worth fixing.

Can I use Video from Images + Audio with formats other than the defaults?

Video from Images + Audio accepts MP4, WebM, MOV, MKV, AVI, FLV, and OGV. If your input is in a format that is not directly supported, convert it first using one of Favtoo's converter tools — every Favtoo converter outputs a file that is a clean input to the next tool in the chain.

Can I use Video from Images + Audio for commercial work?

Video from Images + Audio can be used for personal and commercial work alike — there is no separate "business" licence to purchase. The output you generate is yours to use however you want, including in client deliverables, internal documents, or commercial products. Favtoo's only ask is fair, individual use; the tool is not designed to be embedded as a backend service or wrapped behind an API for resale.

Will Video from Images + Audio keep working if my Wi-Fi drops mid-task?

Once the page is loaded, Video from Images + Audio can complete jobs without an active internet connection — the engine is bundled with the page, so there is no per-job network call. The initial page load does require a connection (to fetch the static assets), but after that you can disconnect entirely and the tool will still work. This is a side-effect of the local-first architecture, not a deliberate "offline mode" feature.

Do I need a specific browser to use Video from Images + Audio?

Video from Images + Audio works in any modern browser released in the last few years — Chrome, Edge, Firefox, Safari, Brave, Arc and the major Chromium derivatives are all supported. The underlying engine relies on widely-supported web APIs, so there is nothing exotic to install. If you are on a very old browser version and the tool fails to load, updating to the latest release of your preferred browser is the only fix needed.

Does Video from Images + Audio require a browser extension or plug-in?

No installation is needed. Video from Images + Audio runs as a normal web page, with no browser extension, no native helper, and no separate desktop client to download. That is partly a privacy choice — extensions can request broad permissions, while a regular page is sandboxed by default — and partly a convenience one: you can use Video from Images + Audio on any computer you have temporary access to without leaving anything installed on it.

Video to GIF

Convert any video clip to an animated GIF entirely in your browser. Pick the start, length, frame rate, and width — your file is processed locally with FFmpeg WebAssembly and never uploaded.

Screen Recorder

Record your screen, a window, or a browser tab directly in your browser. Optionally include system audio and your microphone. Capture, preview, and download the video without installing any app — and without uploading anything.

Webcam Recorder

Record your webcam directly in your browser with optional microphone audio. Pick the resolution (480p, 720p, or 1080p), frame rate, and mirror mode, then capture and download the result without installing any app.

Screen + Webcam Recorder

Record your screen with your webcam composited into a picture-in-picture corner — perfect for tutorials, course videos, demos, and reaction recordings. Pick the camera position, size, and audio sources, then capture and download in your browser.

Video Slideshow Maker

Turn a stack of photos into an MP4 slideshow with per-slide durations, crossfades, and an optional soundtrack. Pick the resolution (up to 1080p), frame rate, and transitions, then download a single MP4 — all processed in your browser with FFmpeg WebAssembly.

Video Trimmer

Set precise in and out timestamps, snap to keyframes when needed, and document handles for social-safe cutdowns.

Video Splitter

Split any video into 2–10 equal-length pieces, packaged as a downloadable ZIP. Files are processed entirely in your browser with FFmpeg WebAssembly using lossless stream-copy.

Video Merger

Combine multiple video clips into a single MP4 in your browser. Drop in MP4, WebM, MOV, MKV, AVI or FLV files, drag to reorder them, pick a target resolution and frame rate, and merge — all locally with no uploads.

View all Video Tools