Multi-Modal-Transformer

The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and self-supervised learning models. Additionally, it also collects many useful tutorials and tools in these related domains.

language

View on GitHub

235 Stars

32 Forks

235 Watchers

100 SrcLog Score

Cost to Build

$17.8K

Market Value

$35.0K

How is this calculated?

Growth over time

7 data points · 2021-11-01 → 2026-04-01

Stars Forks Watchers

💬

How do you feel about this project?

Ask AI about Multi-Modal-Transformer

Question copied to clipboard

What is the junchen14/Multi-Modal-Transformer GitHub project? Description: "The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and self-supervised learning models. Additionally, it also collects many useful tutorials and tools in these related domains. ". Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone Multi-Modal-Transformer

Clone via HTTPS

git clone https://github.com/junchen14/Multi-Modal-Transformer.git

Clone via SSH

[email protected]:junchen14/Multi-Modal-Transformer.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the Multi-Modal-Transformer issue tracker:

Open GitHub Issues

Similar to Multi-Modal-Transformer

go TypeScript ruby crystal awesome-cheatsheets Eve ChatterBot awesome-nlp solidity proposals Nim red scala-js wren lang gravity proselint corpora franc haxe enso Carp sdk nlp_tasks mimesis moonscript dmd rant coconut slugify