Table of Contents



Image Localization

  1. Image Localization:
    Localization is the task of finding a single object in an image.
    img
  2. Image Classification+Localization:
    Localization can be combined with classification to, not only find the location of an object but, also, to classify it into one of different classes.
    img
  3. Structure:
    • Input: Image
    • Output: A vector of 4 coordinates of the bounding box.
  4. Applications:
    • Smart Cropping
    • Regular Object Extraction (as a pre-processing step)
    • Human Pose Estimation: Represent pose as a set of 14 joint positions

Approaches

  1. Localization as a Regression Problem:
    Since we are concerned with returning real-valued numbers (the bounding box coordinates), we use a method that is suitable for the task, Regression.
    img
    • Algorithm:
      • Use any classification architecture
      • Attach two Fully Connected Layers, one for Classification and one for Localization
      • Backpropagate through the whole network using cross-entropy loss and L2 loss respectively.
    • Evaluation Metric: Intersection over Union.

Training Methods, Approaches and Algorithms

Updated Soon!