List of Computer Science courses with video lectures.
Data Apps & Dashboards for Python. No JavaScript Required.
A set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing.
Official git repository for Biopython (originally converted from CVS)
A curated list of awesome Bioinformatics libraries and software.
Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.
A DSL for data-driven computational pipelines
Making Protein folding accessible to all!
Single-cell analysis in Python. Scales to >100M cells.
An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)
A versatile pairwise aligner for genomic and spliced nucleotide sequences
MMseqs2: ultra fast and sensitive search and clustering suite
A full spaCy pipeline and models for scientific/biomedical documents.
Official code repository for GATK versions 4 and up
Conda recipes for the bioconda channel.
Data intensive science for everyone.
Burrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)
A cross-platform and ultrafast toolkit for FASTA/Q file manipulation
Toolkit for processing sequences in FASTA/Q formats
A curated list of awesome AI tools, libraries, papers, datasets, and frameworks that accelerate scientific discovery — from physics and chemistry to b...
Aggregate results from bioinformatics analyses across many samples into a single report.
Unix, R and python tools for genomics and data science
Therapeutics Commons (TDC): Multimodal Foundation for Therapeutic Science
Foldseek enables fast and sensitive comparisons of large structure sets.
Pytorch Repo for DeepGCNs (ICCV'2019 Oral, TPAMI'2021), DeeperGCN (arXiv'2020) and GNN1000(ICML'2021): https://www.deepgcns.org
scikit-bio: a community-driven Python library for bioinformatics, providing versatile data structures, algorithms and educational resources.
Protein Graph Library
A cross-platform, efficient and practical CSV/TSV toolkit in Golang
A Deep Learning Toolkit for DTI, Drug Property, PPI, DDI, Protein Function Prediction (Bioinformatics)
Robust, flexible and resource-efficient pipelines using Go and the commandline
A cross-platform command-line tool for executing jobs in parallel
A plotly.js React component from Plotly 📈
Scripts to download genomes from the NCBI FTP servers
Circular visualization in Python (Circos Plot, Chord Diagram, Radar Chart)
Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environme...
Cloud-native genomic dataframes and batch computing
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
A comprehensive tutorial about GWAS and PRS
Empower Large Language Models (LLM) using Knowledge Graph based Retrieval-Augmented Generation (KG-RAG) for knowledge intensive tasks
A python library for multi omics included bulk, single cell and spatial RNA-seq analysis.
A comprehensive library for computational molecular biology
C library for high-throughput sequencing data formats
Python library to facilitate genome assembly, annotation, and comparative genomics
Pysam is a Python package for reading, manipulating, and writing genomics data such as SAM/BAM/CRAM and VCF/BCF files. It's a lightweight wrapper of t...
🐟 🍣 🍱 Highly-accurate & wicked fast transcript-level quantification from RNA-seq reads using selective alignment
A curated list of Cheminformatics libraries and software.
Specification for the Workflow Description Language (WDL).
The next version of bwa-mem
Python package for graph neural networks in chemistry and biology