Abstract: Enabling robots to perform tool manipulation like humans remains a great challenge. A semantic understanding of tool affordances and precise spatial localization is essential for this task.