Search across continuous sensor streams.
Across 19 egocentric recordings and eight vision-language models, a streaming frame filter improves retrieval while cutting storage by 20x.
Read on arXivWe'll share research updates and announcements. No spam.