Audio & Video
8 toolsBrowser-based audio and video utilities. Record audio and screen, snap webcam photos, extract thumbnails from video, mute video tracks, and use built-in speech synthesis and recognition.
Audio Recorder
Online audio recorder — record from your microphone right in the browser and download the take as a WebM file. Free, with no upload and no signup needed.
Webcam Snapshot
Take a webcam photo or screenshot right in your browser and save it as a PNG. Front or back camera, mirror preview, no upload — nothing leaves your device.
Video Thumbnail
Video thumbnail extractor — scrub to any frame of a video and save it as a JPG or PNG still image. Free to use, and the video never leaves your device.
Audio Trimmer
Audio trimmer online — cut a clip from an MP3 or WAV file by setting start and end points, then download it. Free, and audio never leaves your browser.
Text to Speech
Text to speech online — have any text read aloud using built-in browser voices in multiple languages, with rate and pitch controls. Free and instant.
Speech to Text
Speech to text online — dictate into your microphone and watch a live transcript appear using built-in browser speech recognition. Free, with no signup.
Screen Recorder
Screen recorder online — capture a browser tab, window, or your entire screen and download the recording as WebM. Free, no install and no upload needed.
Video Mute
Mute video online — strip the audio track from MP4, WebM, or MOV files and download a silent copy in seconds. Free, and files never leave your device.
A privacy-first media studio that lives in a browser tab
Every tool in this category shares one design rule: your microphone audio, camera frames, screen captures, and media files are processed where they already are — on your own machine — instead of being shipped to a server for handling. That is not a marketing line; it is a consequence of which browser APIs these tools are built on. The pages below explain, tool by tool, exactly where the bytes travel so you can decide what is safe to run for any given clip.
Why “runs in the browser” usually means “stays on your device”
When a page grabs your camera with getUserMedia() or your screen with getDisplayMedia(), the resulting stream is a JavaScript object inside the tab. Encoding it into a saved file with MediaRecorder, painting a frame to a <canvas>, or trimming samples through the Web Audio API all happen in that same sandbox. None of those steps require a network call, so the finished file can be handed to you as a download without ever leaving the device. The one deliberate exception is live speech recognition — see below.
The honest exception: speech-to-text
Live transcription is the only tool here that is notfully on-device. Browsers implement the Web Speech Recognition API by streaming your microphone audio to the browser maker's cloud (Google's servers in Chrome, Apple's in Safari) and returning text. We call this out plainly on that tool's page so you can avoid dictating passwords or confidential material through it. Everything else — recording, snapshots, trimming, muting, thumbnail grabs, and text-to-speech with local voices — keeps your data on the machine.
When the browser beats a desktop app — and when it does not
Reach for these tools when you want zero install, instant access on any computer, and a guarantee that a one-off private clip never touches someone else's server. They win for a quick voice memo, a screenshot from a video, an ID photo, or stripping music off a clip before posting. Where a desktop app like FFmpeg still wins is bulk work: batch-converting hundreds of files, lossless container edits with no re-encode, or exporting directly to MP3/MP4 without a second conversion step. The browser trades a little fidelity and throughput for privacy and convenience — a good deal for the single-file jobs these tools are built for.
Start with one of these
- Audio Recorder — capture a voice memo or interview with nothing uploaded.
- Screen Recorder — record a tab, window, or full screen locally.
- Webcam Snapshot — take an ID or profile photo on-device.
- Video Thumbnail — pull a cover frame from any video file.
- Text to Speech — read text aloud with your system voices.
- Speech to Text — dictate to editable text (cloud-processed).