learning to perform a task using a limited number of examples from a single task distribution.
Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"
OAG-BERT is a heterogeneous entity-augmented academic language model which not only understands academic texts but also heterogeneous entity knowledge in OAG.
CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition