Talk on group meeting about ChatVideo
Date:
The report, published by Yifei Cao on July 13, 2023, discusses why Visual ChatGPT is unsuitable for handling video tasks and presents key insights. It mentions the relevant components of OmiTracker and OmiVL, focusing on the generation of the Tracklets database, which specifically includes the following fields: ID (primary key), Category (trajectory category), Appearance (trajectory segment instance), Motion (motion of the trajectory segment), Trajectory (trajectory of the trajectory segment), and Audio (applicable only to trajectory segments containing complete videos). You can visit PPT here.