2021
Learning Transferable Visual Models From Natural Language Supervision
CLIP explained: contrastive learning on 400M image-text pairs enables zero-shot image classification and powerful vision-language understanding.
Explore machine learning papers and reviews related to clip. Find insights, analysis, and implementation details.
CLIP explained: contrastive learning on 400M image-text pairs enables zero-shot image classification and powerful vision-language understanding.