TAPE3D:TRACKING ALL PIXELS
EFFICIENTLY IN 3D






TAPE3D captures dense, long-range, 3D trajectories from casual videos in a feed-forward manner.

Comparison with 3D tracking approach: SceneTracker and SpatialTracker, and the 3D version of DOT







More results of 3D dense tracking can be found here.

Comparison with 2D tracking approach (lift 3D with depth): CoTracker and LocoTrack






Occlusion problem when using 2D track + Depth


Frontal view


Side view

(Note that all of the methods use the same video depth input)

The baseline (2D Tracker + Depth) struggles significantly with inconsistent video depth estimation, resulting in noticeable jittering effects. Additionally, it fails to accurately track objects during occlusion, as seen from the side view when the ball rolls behind the tree.


More qualitative results can be found here.

Application: Consistent video editting in 3D space




Application: Non-rigid structure from motion