Multimodal Language ProcessingPEOPLE

The ability of robots to interact through language, while being user-friendly for the non-expert user, is limited by the complexity of understanding natural language. We developed Target-dependent UNITER, which used multimodal information to understand object-fetching instructions, and achieved the highest accuracy on a standard dataset.