Automatic Video Tagging with ML — Classify Raw Footage on Mac

How It Works

Machine learning that organizes your footage for you

The Problem

Manual tagging doesn't scale. When you're dealing with hundreds or thousands of clips across multiple shoots, adding metadata by hand is tedious and inconsistent. You end up with an untagged pile of footage that's hard to search and harder to reuse.

How ML Tagging Works

When you import footage, Rushes samples frames from each clip and runs them through Apple's Vision framework and a Core ML model. The system analyzes visual content — people, scenes, objects, activities — and assigns relevant tags automatically.

What Gets Tagged

Shot types: Talking head, interview, B-roll, close-up
Environments: Outdoor, indoor, urban, nature, studio
Content: Food, animals, vehicles, architecture, screen recording
Activities: Walking, cooking, sports, presentations

Fully On-Device and Private

All ML processing runs on your Mac using Apple's Neural Engine (on Apple Silicon) or CPU (on Intel). No frames, thumbnails, or metadata ever leave your machine. There are no cloud APIs, no accounts, and no data collection.

The models require approximately 30MB of memory. On an M1 MacBook Air, processing takes roughly 1-2 seconds per clip.

Works With Your Editing Workflow

Tags are used throughout Rushes — for filtering, smart collections, and browsing. Once your footage is tagged, drag clips directly into Final Cut Pro, DaVinci Resolve, Premiere Pro, or any editor. Rushes references your original files, so there's no re-importing or transcoding step.

Who This Is For

Filmmakers managing footage from multi-day shoots
YouTube creators with growing libraries of raw clips
Wedding videographers organizing footage across dozens of events
Documentary editors working with hours of interview and B-roll footage
Stock footage producers building searchable clip libraries

Your footage,tagged automatically.