It’s conferences season again, and this time I had the pleasure of attending the International Conference on Computer Vision (ICCV) in Santiago, Chile. I couldn’t have picked a better destination myself, it was very surreal to experience 30 degrees heat in the middle of winter, whilst still being able to see snowy-topped mountains in the distance.
Santiago, Chile
Conferences are a great way to re-connect with old friends and possibly make some new ones whilst spending a week together discussing ideas from all aspects of computer vision and machine learning. Below are some of my highlights from the trip, as well as (in my own opinion) some of the stand-out papers to read.
The plenary talk was given by Stephen Boyd, author of the book on convex optimization. Stephen was showcasing cvxpy, an open-source, python-embedded modelling language for convex optimization problems of small to medium size (1-100k variables). The main idea behing cvxpy is that no tweaking of the algorithm is necessary, you just provide your cost function and constraints, and if it is convex you will get the solution. Stephen was a very entertaining speaker, and clearly very passionate about convex optimization, his main take aways are that, if you are doing convex optimization, we are almost at a state where we can treat the solution to these problems as a technology, i.e. just using software without understanding how the solution works. Cvxpy is a big step towards this becomming a technology, and I reccomend you check it out if this is an area that interests you, as it seemed extremely easy to write a solver for small convex problems. However, as you can imagine, deep learning is still the major trend within the community and these are highly non-convex problems, so we still see many papers with different tricks for convergence of these nets.
There were also some very good demos this year, a few of my favourite were:
- Scalable Color Sketch based Search of Millions of Images
[YouTube] - Showcasing SegNet: A Deep Encoder-Decoder Architecture for Real-Time Road Scene Segmentation
[PDF] [Demo] - (A shameless self plug) In the workshop on Recovering 6D Object Pose there was a demo of my previous work on 3D object pose estimation which was estimating object pose in the bin-picking scenario, running at a reasonable frame-rate on a laptop.
[PDF] [YouTube] [Workshop Challenges]
There were many interesting papers presented at the conference and I encourage you to check out the ICCV pocket guide to get a full listing. However, below is a small list of some of my favourite papers from the conference. I hope you enjoy reading them too :)
-
P. Kontschieder, M. Fiterau, A. Criminisi, S. R. Bulò
Deep Neural Decision Forests (best paper award)
[PDF] -
M. Paulin, M. Douze, Z. Harchaoui, J, Mairal, F. Perronnin, C. Schmid
Local Convolutional Features with Unsupervised Training for Image Retrieval
[PDF] -
K. He, X. Zhang, S. Ren, J. Sun
Delving deep into rectifiers: Surpassing human level performance on ImageNet classification
[PDF] -
R. Girshick
Fast R-CNN
[PDF] -
C. Doersch, A. Gupta, A. Efros
Unsupervised Visual Representation Learning by Context Prediction
[PDF] -
G. Hu, Y. Yang, J. Kittler, W. Christmas, S. Li, T. Hospedales
When Face Recognition meets with Deep Learning: an Evaluation of Convolutional Nueral Networks for Face Recognition [PDF]
Additonally, although not presented at ICCV, these are some other papers that were presented recently (at NIPS) or have recently popped up on arxiv that I think are very interesting.
-
K. He, X. Zhang, S. Ren, J. Sun
Deep Residual Learning for Image Recognition (ImageNet and COCO winners 2015)
[PDF] -
P. Krähenbühl, C. Doersch, J. Donahue, T. Darrell
Data-Dependent Initializations of Convolutional Neural Networks
[PDF] -
S. Ren K. He, R. Girshick, J. Sun
Faster R-CNN
[PDF]
</li>
-
C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens
Rethinking the Inception Architecture for Computer Vision
[PDF]
</ol>