Tagged with

transformers

Explore machine learning papers and reviews related to transformers. Find insights, analysis, and implementation details.

Papers Found

Back to all papers

Papers Related to transformers

2022

Exploring Plain Vision Transformer Backbones for Object Detection

Transformers Computer Vision Object Detection Deep Learning

Investigating the effectiveness of plain Vision Transformers as backbones for object detection and proposing modifications to improve their performance.

Read review Original Paper

2020

End-to-End Object Detection with Transformers

Transformers Computer Vision Object Detection Deep Learning DETR

Introducing DETR, a novel end-to-end object detection framework that leverages Transformers to directly predict a set of object bounding boxes.

Read review Original Paper

2021

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Transformers Computer Vision Image Recognition Deep Learning

Vision Transformer (ViT) explained: how splitting images into 16x16 patches enables pure transformer architecture for state-of-the-art image recognition.

Read review Original Paper

2023

A Survey of Techniques for Optimizing Transformer Inference

Transformers Inference Optimization Pruning Quantization Knowledge Distillation Neural Architecture Search Hardware Acceleration

Survey of transformer inference optimization: pruning, quantization, knowledge distillation, neural architecture search, and hardware acceleration.

Read review Original Paper

2021

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Transformers Computer Vision Image Classification Object Detection Semantic Segmentation Deep Learning

Swin Transformer: hierarchical Vision Transformer using shifted windows for efficient image classification, object detection, and segmentation.

Read review Original Paper

2017

Attention Is All You Need

Transformers Attention Deep Learning NLP

Deep dive into the Transformer architecture that revolutionized NLP. Understand self-attention, multi-head attention, and positional encoding.

Read review Original Paper

2021

Data Movement Is All You Need: A Case Study on Optimizing Transformers

GPUs Transformers Deep Learning

Analysis of transformer performance bottlenecks caused by data movement. Learn optimization strategies for memory-bound operations on GPUs.

Read review Original Paper