mxahan.github.io

Optical Flow

For frames in video, we assume object moves but the intensity of pixel remains same.

Now using taylors formula

Combining the earlier two gives,

This show relation between image gradients alone x, y and time axis. The unknowns are u and v. This requires methods like Mean shift color histogram tracking, Lucas-Kanade methods. It’s an optimization problem.

A distinction to keep in mind for recovering motion.

  1. Feature-tracking; Extract visual features and track them
  2. Optical flow; Recover image motion at pixel from spatio-temporal image brightness variations (the brightness assumption, small motion and spatial coherence should maintain).

Solving equation, modified and matrix form from the earlier equation.

Deep learning has some implementation of the optical flow: FlowNet and its variations.

  1. FlowNetS:
    • Simple implementaion
    • Encoder Decoder layeryer
  2. EPE/APE
    • Euclidean distance between true and ground truth vector
  3. FlowNetC
    • Correlated
    • Two similar structure
  4. FlowNet 2.0
    • 1st Layer of FlowNetC
    • FlowNetCS (Combination of C and S)
    • Warping
      • Training Dataset: Syntheic data

Eulerian Video Maginification

A computational technique to visualize the small change in video. A function approximation and magnify the function. Related to fluid mechanics in Lagrangian Prospective. Transforming image into a complex steerable pyramid. Exaggerating the phase variation. Amplify the small motions.

  1. Linear Video Magnification: First-Taylor arguments.

1.1 1D Translation: Goal is to motion magnification of the following signal.

The interesting part are the change

Using Taylor Expantion

Now the magnification:

Amplified factor (1 + )

1.2 General case: Similar like general taylor with amplification factor.

1.3. Limitation:

Has another better alternative

  1. Phase based magnification: Use of wavelet

2.1 Simplified Global case:: Assumption about the functional form of previous function

Now we get phase difference by using the change in time by which get amplified by amplification factor. Breaking image into local sinusoid using complex steerable pyramid.

2.2 Complex steerable pyramid: Concept of wavelet and basis functon to localize frequency and space.

2.3 Phase Shift and Translation: Related to phase based optical flow.

Model Compression

survey 1

Focused on 4 key Contributions

Also

blog 1

Importance Sampling

medium link