Open Source Computer Vision Library
We write your reusable computer vision tools. 💜
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Open source comprehensive 2D content creation tool suite for graphic design, digital art, and interactive real-time motion graphics — featuring node-b...
Rembg is a tool to remove images background
CVPR 2026 论文和开源项目合集
🌊 A flexible and fun JavaScript file upload library
pix2tex: Using a ViT to convert images of equations into LaTeX code.
ImageMagick is a free, open-source software suite for creating, editing, converting, and displaying images. It supports 200+ formats and offers powerf...
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
JavaScript image cropper.
Python Imaging Library (fork)
Content aware image cropping
🏞 A lightweight, versatile image viewer
Blind&Invisible Watermark ,图片盲水印,提取水印无须原图!
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
A fast image processing library with low memory needs.
🐍 Geometric Computer Vision Library for Spatial AI
Fast and secure standalone server for resizing, processing, and converting images on the fly
Content aware image resize library
An Android transformation library providing a variety of image transformations for Glide.
Turn your two-bit doodles into fine artworks with deep neural networks, generate seamless textures from photos, transfer style from one image to anoth...
Best Practices, code samples, and documentation for Computer Vision.
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
Self-hosted collection of powerful web-based tools for everyday tasks. No ads, no tracking, just fast, accessible utilities right from your browser!
Deeplake is AI Data Runtime for Agents. It provides serverless postgres with a multimodal datalake, enabling scalable retrieval and training.
Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!
:camera: A modern, cross-platform, 2D Graphics library for .NET
⚠️ [Deprecated] No longer maintained, please use https://github.com/fengyuanchen/jquery-cropper
Go package for computer vision using OpenCV 4 and beyond. Includes support for DNN, CUDA, OpenCV Contrib, and OpenVINO.
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model...
a cross-platform image super-resolution tool
a language for fast, portable data-parallel computation
Image processing in Python
A beautiful, non-destructive, and GPU-accelerated RAW image editor built with performance in mind.
Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices
AI算法岗求职攻略(涵盖准备攻略、刷题指南、内推和AI公司清单等资料)
Fast, simple, scalable, Docker-ready HTTP microservice for high-level image processing
👷 Build images with images
OpenCV wrapper for .NET
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training...
Blurry is an easy blur library for Android
Thumbnailator - a thumbnail generation library for Java
Keras model to generate HTML code from hand-drawn website mockups. Implements an image captioning architecture to drawn source images.
A view controller for iOS that allows users to crop portions of UIImage objects
🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.