Upload any surveillance footage. YAI detects threats, tracks individuals, and delivers a full safety report — powered by YOLOv10x, ByteTrack, and AnomalyTransformer.
Upload your footage and get a complete AI-powered safety analysis in under a minute.
Drop a video file and configure your detection settings. The pipeline runs detection, tracking, and anomaly analysis automatically.
| Track ID | Class | Frames Active | Avg Confidence | Flagged |
|---|
YAI (Video Safety Analyzer) is a web product built on top of OmniTrack — a real-time video analytics pipeline combining state-of-the-art computer vision models. Users upload a video, choose a confidence threshold, and receive a structured safety report identifying behavioral anomalies, tracked individuals, and flagged events.
YOLOv10x runs frame-by-frame inference to detect people and objects. CLIP provides open-vocabulary classification, enabling zero-shot category recognition beyond the fixed YOLO labels.
ByteTrack paired with Kalman filtering associates detections across frames, handling occlusions and re-entries gracefully. Each person receives a persistent track ID with full trajectory history.
AnomalyTransformer scores trajectories for unusual behavioral patterns including loitering, speed anomalies, crowd surges, forbidden zone violations, trajectory anomalies, and disappearances.
Depth Anything v2 provides monocular depth estimates per frame, giving the pipeline 3D spatial context for more accurate threat assessment without requiring stereo cameras.
The FastAPI backend runs on a Kaggle notebook with a free T4 GPU, tunneled via an ngrok permanent domain. The frontend is a static single-file HTML app hosted on Netlify — no build step, no framework dependencies.
OmniTrack is publicly available on GitHub. Contributions, issues, and forks are welcome.