Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
An implementation of Performer, a linear attention-based transformer, in Pytorch
Implementation of TabTransformer, attention network for tabular data, in Pytorch
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
Implementation of MeshGPT, SOTA Mesh generation using Attention, in Pytorch
A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models
Transformer based on a variant of attention that is linear complexity in respect to sequence length
Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch
Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper
Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
A concise but complete implementation of CLIP with various experimental improvements from recent papers
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
Implementation of Bottleneck Transformer in Pytorch
Implementation of MagViT2 Tokenizer in Pytorch
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
A simple way to keep track of an Exponential Moving Average (EMA) version of your Pytorch model
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch
Implementation of the Point Transformer layer, in Pytorch
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence
Implementation of Enformer, Deepmind's attention network for predicting gene expression, in Pytorch
Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch
Implementation of Classifier Free Guidance in Pytorch, with emphasis on text conditioning, and flexibility to include multiple text embedding models
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
Unofficial implementation of iTransformer - SOTA Time Series Forecasting using Attention networks, out of Tsinghua / Ant group
Implementation of E(n)-Equivariant Graph Neural Networks, in Pytorch
Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch