Topic

bioinformatics

Repositories (1229)

cs-video-courses
cs-video-courses Developer-Y

List of Computer Science courses with video lectures.

69.7k
dash
dash plotly Python

Data Apps & Dashboards for Python. No JavaScript Required.

22k
biopython
biopython biopython Python

Official git repository for Biopython (originally converted from CVS)

3.7k
deepvariant
deepvariant google Python

DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.

3.5k
Awesome-Bioinformatics
Awesome-Bioinformatics danielecook

A curated list of awesome Bioinformatics libraries and software.

3.3k
nextflow
nextflow nextflow-io Groovy

A DSL for data-driven computational pipelines

3.1k
awesome-single-cell
awesome-single-cell seandavi

Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.

2.6k
scanpy
scanpy scverse Python

Single-cell analysis in Python. Scales to >100M cells.

2.1k
fastp
fastp OpenGene C++

An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)

1.5k
bioconda-recipes
bioconda-recipes bioconda Shell

Conda recipes for the bioconda channel.

1.5k
minimap2
minimap2 lh3 C

A versatile pairwise aligner for genomic and spliced nucleotide sequences

1.5k
gatk
gatk broadinstitute Java

Official code repository for GATK versions 4 and up

1.5k
seqkit
seqkit shenwei356 Go

A cross-platform and ultrafast toolkit for FASTA/Q file manipulation

1.5k
scispacy
scispacy allenai Python

A full spaCy pipeline and models for scientific/biomedical documents.

1.4k
bwa
bwa lh3 C

Burrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)

1.3k
seqtk
seqtk lh3 C

Toolkit for processing sequences in FASTA/Q formats

1.2k
galaxy
galaxy galaxyproject Python

Data intensive science for everyone.

1.1k
csvtk
csvtk shenwei356 Go

A cross-platform, efficient and practical CSV/TSV toolkit in Golang

1.1k
deep_gcns_torch
deep_gcns_torch lightaime Python

Pytorch Repo for DeepGCNs (ICCV'2019 Oral, TPAMI'2021), DeeperGCN (arXiv'2020) and GNN1000(ICML'2021): https://www.deepgcns.org

1.1k
TDC
TDC mims-harvard Jupyter Notebook

Therapeutics Commons (TDC-2): Multimodal Foundation for Therapeutic Science

1.1k
react-plotly.js
react-plotly.js plotly JavaScript

A plotly.js React component from Plotly 📈

1k
cromwell
cromwell broadinstitute Scala

Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environme...

1k
rush
rush shenwei356 Go

A cross-platform command-line tool for executing jobs in parallel

1k
adam
adam bigdatagenomics Scala

ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.

1k
MultiQC
MultiQC ewels Python

Aggregate results from bioinformatics analyses across many samples into a single report.

1k
MMseqs2
MMseqs2 soedinglab C

MMseqs2: ultra fast and sensitive search and clustering suite

1k
scipipe
scipipe scipipe Go

Robust, flexible and resource-efficient pipelines using Go and the commandline

1k
getting-started-with-genomics-tools-and-resources
getting-started-with-genomics-tools-and-resources crazyhottommy Shell

Unix, R and python tools for genomics and data science

931
pyCirclize
pyCirclize moshi4 Python

Circular visualization in Python (Circos Plot, Chord Diagram, Radar Chart)

920
GWA_tutorial
GWA_tutorial MareesAT

A comprehensive tutorial about GWAS and PRS

906
hail
hail hail-is Python

Cloud-native genomic dataframes and batch computing

885
graphein
graphein a-r-j Jupyter Notebook

Protein Graph Library

856
nucleus
nucleus google C++

Python and C++ code for reading and writing genomics data.

791
DeepPurpose
DeepPurpose kexinhuang12345 Jupyter Notebook

A Deep Learning Toolkit for DTI, Drug Property, PPI, DDI, Protein Function Prediction (Bioinformatics)

776
ncbi-genome-download
ncbi-genome-download kblin Python

Scripts to download genomes from the NCBI FTP servers

773
awesome-cheminformatics
awesome-cheminformatics hsiaoyi0504

A curated list of Cheminformatics libraries and software.

730
containers
containers BioContainers Dockerfile

Bioinformatics containers

727
khmer
khmer dib-lab Python

In-memory nucleotide sequence k-mer counting, filtering, graph traversal and more

717
seq
seq seq-lang C++

A high-performance, Pythonic language for bioinformatics

704
htslib
htslib samtools C

C library for high-throughput sequencing data formats

695
wdl
wdl openwdl Java

Workflow Description Language - Specification and Implementations

666
salmon
salmon COMBINE-lab C++

🐟 🍣 🍱 Highly-accurate & wicked fast transcript-level quantification from RNA-seq reads using selective alignment

649
DnaFeaturesViewer
DnaFeaturesViewer Edinburgh-Genome-Foundry Python

:eye: Python library to plot DNA sequence features (e.g. from Genbank files)

639
biopandas
biopandas BioPandas Python

Working with molecular structures in pandas DataFrames

620
bwa-mem2
bwa-mem2 bwa-mem2 C++

The next version of bwa-mem

614
jcvi
jcvi tanghaibao Python

Python library to facilitate genome assembly, annotation, and comparative genomics

596
clinker
clinker gamcil Python

Gene cluster comparison figure generator

594
dgl-lifesci
dgl-lifesci awslabs Python

Python package for graph neural networks in chemistry and biology

593
deepTools
deepTools deeptools Python

Tools to process and analyze deep sequencing data.

582
vsearch
vsearch torognes C++

Versatile open-source tool for microbiome analysis

578