Predicting Visual Saliency of Building using Top down Approach
Sugam Anand ,CSE Sampath Kumar,CSE Mentor : Dr. Amitabha Mukerjee Indian Institute of Technology, Kanpur
Predicting Visual Saliency of Building using Top down Approach - - PowerPoint PPT Presentation
Predicting Visual Saliency of Building using Top down Approach Sugam Anand ,CSE Sampath Kumar,CSE Mentor : Dr. Amitabha Mukerjee Indian Institute of Technology, Kanpur Outline Motivation Previous Work Our Approach Saliency
Sugam Anand ,CSE Sampath Kumar,CSE Mentor : Dr. Amitabha Mukerjee Indian Institute of Technology, Kanpur
shifts of visual attention, 2000
choose for describing a route.
visual saliency
Visual Attention for Rapid Scene Analysis
28.4 % [3]
torralba(2009)-Learning to Predict where humans look
intensity, contrast , illumination and color.
high level features .
Figure taken from [1]
Taken from [6]
features
and “negative” samples, i.e., images that do not contain
distinctive features are “compressed” into the statistical model parameters.
detection of that class.
From Opencv documentation
the difference between the sum of the pixels within white and black rectangular regions for that feature.
2 2 1 1 n nh
i i i i i
f if f if x h 1 1 ) ( , where
Weak classfiers ( hi (x) ) with less error rate ,gets larger weight . Hence ,contributes in strong classifier.
1. Generating the database of positive and negative samples. 2. Make the bounding box for the object by
3. Generate the vec file out of positive samples using createsamples.exe 4. For generating classifier run the haartraining.exe 5. Run haarconv.exe to convert classifier to .xml file
[2]
Unconventional buildings attract attention against low level features used by us
attention.
After applying itti koch algo Input image thresholding
saliency based on itti and koch model
http://www.klab.caltech.edu/~harel/share/gbvs. php
covert shifts of visual attention, 2000
Learning to Predict where humans look
Human Fixations by Tilke Judd, Fredo Durand and Antonio Torralba.[2012] .
Boosted Cascade of Simple Features. Conference on Computer Vision and Pattern Recognition
Gaussian Pyramids R,G,B,Y Gabor pyramids for = {0º, 45º, 90º, 135º}
subtraction
where s [0..8] is the scale
I(2, 6) = | I(2) Q I(6)| I(3, 6) = | I(3) Q I(6)| …
Red-Green and Yellow-Blue
Center-surround Difference
Orientation Feature Maps
+R-G +G-R +G-R +B-Y +Y-B +Y-B +B-Y +B-Y
Same c and s as with intensity
) , ( ) , ( ) , , ( s O c O s c O RG(c, s) = | (R(c) - G(c)) Q (G(s) - R(s)) | BY(c, s) = | (B(c) - Y(c)) Q (Y(s) - B(s)) |
1. Normalization of map to range [0…M] 2. Compute average m of all local maxima 3. Find the global maximum M 4. Multiply the map by (M – m)2
Inhibition of return
Example of Operation:
http://disp.ee.ntu.edu.tw/class/saliencymap.