2023
I-JEPA: Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture
How I-JEPA learns visual representations by predicting abstract feature representations of masked image regions — no pixel reconstruction, no augmentation — achieving 81.7% linear probe accuracy with ViT-H.
