Zoom better to see clearer: Human and object parsing with hierarchical auto-zoom net