Efficient Visual Learning for Scene Understanding