1. Detect to Track and Track to Detect_ICCV2017_Christoph Feichtenhofer et al. (code available) project page: http://www.robots.ox.ac.uk/~vgg/research/detect-track/ 1> Summary: (1) T he input to the network consists of multiple frames which are first passed through a ConvNet trunk to produce convolutional features which are shared for the task of detection and tracking. Then compute convolutional cross-correlation between the feature responses of adjacent frames to estimate the local displacement at different feature scales. On top of the features, we employ an RoI-pooling layer to classify and regress box proposals as well as an RoI-tracking layer that regresses box transformations (translation, scale, aspect ratio) across frames. 2> Network structure: Add caption 3>. Some details: (1) Reweighting scores across the detection tubes before putting them to construct the tracking regression. (2) Perform non-maximum suppression with bounding-box voting bef...