Topic

bioinformatics

Repositories (1435)

cs-video-courses
cs-video-courses Developer-Y

List of Computer Science courses with video lectures.

80k
dash
dash plotly Python

Data Apps & Dashboards for Python. No JavaScript Required.

24.3k
claude-scientific-skills
claude-scientific-skills K-Dense-AI Python

A set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing.

17.7k
biopython
biopython biopython Python

Official git repository for Biopython (originally converted from CVS)

5k
Awesome-Bioinformatics
Awesome-Bioinformatics danielecook

A curated list of awesome Bioinformatics libraries and software.

3.9k
awesome-single-cell
awesome-single-cell seandavi

Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.

3.7k
deepvariant
deepvariant google Python

DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.

3.7k
nextflow
nextflow nextflow-io Groovy

A DSL for data-driven computational pipelines

3.3k
ColabFold
ColabFold sokrypton Jupyter Notebook

Making Protein folding accessible to all!

2.7k
scanpy
scanpy scverse Python

Single-cell analysis in Python. Scales to >100M cells.

2.4k
fastp
fastp OpenGene C++

An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)

2.3k
minimap2
minimap2 lh3 C

A versatile pairwise aligner for genomic and spliced nucleotide sequences

2.2k
MMseqs2
MMseqs2 soedinglab C

MMseqs2: ultra fast and sensitive search and clustering suite

2k
scispacy
scispacy allenai Python

A full spaCy pipeline and models for scientific/biomedical documents.

1.9k
gatk
gatk broadinstitute Java

Official code repository for GATK versions 4 and up

1.9k
bioconda-recipes
bioconda-recipes bioconda Shell

Conda recipes for the bioconda channel.

1.8k
galaxy
galaxy galaxyproject Python

Data intensive science for everyone.

1.8k
bwa
bwa lh3 C

Burrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)

1.7k
seqkit
seqkit shenwei356 Go

A cross-platform and ultrafast toolkit for FASTA/Q file manipulation

1.5k
seqtk
seqtk lh3 C

Toolkit for processing sequences in FASTA/Q formats

1.5k
awesome-ai-for-science
awesome-ai-for-science ai-boost

A curated list of awesome AI tools, libraries, papers, datasets, and frameworks that accelerate scientific discovery — from physics and chemistry to b...

1.4k
MultiQC
MultiQC MultiQC JavaScript

Aggregate results from bioinformatics analyses across many samples into a single report.

1.4k
getting-started-with-genomics-tools-and-resources
getting-started-with-genomics-tools-and-resources crazyhottommy Shell

Unix, R and python tools for genomics and data science

1.4k
TDC
TDC mims-harvard Jupyter Notebook

Therapeutics Commons (TDC): Multimodal Foundation for Therapeutic Science

1.2k
foldseek
foldseek steineggerlab C

Foldseek enables fast and sensitive comparisons of large structure sets.

1.2k
deep_gcns_torch
deep_gcns_torch lightaime Python

Pytorch Repo for DeepGCNs (ICCV'2019 Oral, TPAMI'2021), DeeperGCN (arXiv'2020) and GNN1000(ICML'2021): https://www.deepgcns.org

1.2k
scikit-bio
scikit-bio scikit-bio Python

scikit-bio: a community-driven Python library for bioinformatics, providing versatile data structures, algorithms and educational resources.

1.2k
graphein
graphein a-r-j Jupyter Notebook

Protein Graph Library

1.2k
csvtk
csvtk shenwei356 Go

A cross-platform, efficient and practical CSV/TSV toolkit in Golang

1.2k
DeepPurpose
DeepPurpose kexinhuang12345 Jupyter Notebook

A Deep Learning Toolkit for DTI, Drug Property, PPI, DDI, Protein Function Prediction (Bioinformatics)

1.1k
scipipe
scipipe scipipe Go

Robust, flexible and resource-efficient pipelines using Go and the commandline

1.1k
rush
rush shenwei356 Go

A cross-platform command-line tool for executing jobs in parallel

1.1k
react-plotly.js
react-plotly.js plotly JavaScript

A plotly.js React component from Plotly 📈

1.1k
ncbi-genome-download
ncbi-genome-download kblin Python

Scripts to download genomes from the NCBI FTP servers

1.1k
pyCirclize
pyCirclize moshi4 Python

Circular visualization in Python (Circos Plot, Chord Diagram, Radar Chart)

1.1k
cromwell
cromwell broadinstitute Scala

Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environme...

1.1k
hail
hail hail-is Python

Cloud-native genomic dataframes and batch computing

1.1k
adam
adam bigdatagenomics Scala

ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.

1k
GWA_tutorial
GWA_tutorial MareesAT

A comprehensive tutorial about GWAS and PRS

975
KG_RAG
KG_RAG BaranziniLab Jupyter Notebook

Empower Large Language Models (LLM) using Knowledge Graph based Retrieval-Augmented Generation (KG-RAG) for knowledge intensive tasks

941
omicverse
omicverse Starlitnightly Python

A python library for multi omics included bulk, single cell and spatial RNA-seq analysis.

941
biotite
biotite biotite-dev Python

A comprehensive library for computational molecular biology

937
htslib
htslib samtools C

C library for high-throughput sequencing data formats

913
jcvi
jcvi tanghaibao Python

Python library to facilitate genome assembly, annotation, and comparative genomics

898
pysam
pysam pysam-developers Cython

Pysam is a Python package for reading, manipulating, and writing genomics data such as SAM/BAM/CRAM and VCF/BCF files. It's a lightweight wrapper of t...

886
salmon
salmon COMBINE-lab C++

🐟 🍣 🍱 Highly-accurate & wicked fast transcript-level quantification from RNA-seq reads using selective alignment

876
awesome-cheminformatics
awesome-cheminformatics hsiaoyi0504

A curated list of Cheminformatics libraries and software.

853
wdl
wdl openwdl

Specification for the Workflow Description Language (WDL).

851
bwa-mem2
bwa-mem2 bwa-mem2 C++

The next version of bwa-mem

826
dgl-lifesci
dgl-lifesci awslabs Python

Python package for graph neural networks in chemistry and biology

798