"Millions of hours of video data. Cleaned, segmented, and semantically searchable for AI labs."
TL;DR: Shofo builds complete pipelines that collect, segment, sanitize, and label videos from across social media to curate custom datasets for AI labs.
The team met while building a previous startup called Correkt. Correkt was an AI search engine focused on multimodal content and reached over 40k users before pivoting to become Shofo.
AI labs need massive video datasets, but high-quality, segmented video data is hard to access.
✅ The Solution
The team started by building the largest index of short-form videos. Then they run an end-to-end pipeline that sanitizes and applies object and activity detection, reasoning, and segmentation to produce custom training datasets for AI labs.
For example, if a lab needs 50k cooking videos featuring hand-object interactions, they query their index, run the results through their labeling pipeline, and deliver a clean, annotated dataset.
Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.