Most popular bioinformatics repositories and open source projects

globalbioticinteractions globalbioticinteractions Java

Global Biotic Interactions provides access to existing species interaction datasets

145 18 19

ilus ShujiaHuang Python

A lightweight and handy variant calling pipeline generator for whole-genome sequencing (WGS) and whole exom sequencing data (WES) analysis by using GA...

145 36 3

FABind QizhiPei Python

FABind: Fast and Accurate Protein-Ligand Binding (NeurIPS 2023)

144 19 3

awesome-expression-browser federicomarini

😎 A curated list of software and resources for exploring and visualizing (browsing) expression data 😎

144 40 13

ksw2 lh3 C

Global alignment and alignment extension

143 28 12

Deep-Learning-for-Clustering-in-Bioinformatics rezacsedu Jupyter Notebook

Deep Learning-based Clustering Approaches for Bioinformatics

143 34 3

usearch12 rcedgar C++

Open-source usearch

142 20 7

best google Rust

Bam Error Stats Tool (best): analysis of error types in aligned reads.

142 14 9

block-aligner Daniel-Liu-c0deb0t Jupyter Notebook

SIMD-accelerated library for computing global and X-drop affine gap penalty sequence-to-sequence or sequence-to-profile alignments using an adaptive b...

142 10 6

bio-forge TKanX Rust

A high-performance, pure Rust toolkit for standardizing and preparing biomolecular systems (proteins & nucleic acids). It heals missing atoms, resolve...

141 7 1

RNAlysis GuyTeichman Python

Analyze your RNA sequencing data without writing a single line of code

140 14 4

gbdraw satoshikawato Python

A genome diagram generator for microbes and organelles

140 7 2

Kun-peng eric9n Rust

Kun-peng: an ultra-fast, low-memory footprint and accurate taxonomy classifier for all

140 14 10

fqtools alastair-droop C

An efficient FASTQ manipulation suite

139 19 5

pybel pybel Python

🌶️ An ecosystem in Python for working with the Biological Expression Language (BEL)

139 34 8

BWA-MEME kaist-ina C++

BWA-MEME: Faster BWA-MEM2 using learned-index

139 16 7

plascad David-OConnor Rust

Plasmid and primer design software

138 3 4

pandora genular Vue

PANDORA :computer:

138 19 9

MSnbase lgatto R

Base Classes and Functions for Mass Spectrometry and Proteomics

137 46 12

CalliNGS-NF CRG-CNAG Nextflow

GATK RNA-Seq Variant Calling in Nextflow

137 49 8

arpeggio PDBeurope Python

Calculation of interatomic interactions in molecular structures

137 23 5

sinto timoast Python

Tools for single-cell data processing

137 26 1

svtyper hall-lab Python

Bayesian genotyper for structural variants

136 57 9

Multi-BioNER yuzhimanhua Python

Cross-type Biomedical Named Entity Recognition with Deep Multi-task Learning (Bioinformatics'19)

136 27 8

cogent3 cogent3 Python

Comparative Genomics Toolkit 3

136 67 9

pairtools open2c Python

Extract 3D contacts (.pairs) from sequencing alignments

135 38 9

mcp-for-patent-literature patsnap

MCP server for 200M+ patents, scientific literature, chemistry and pharma records. Search prior art and R&D intelligence powered by PatSnap's propriet...

135 3 1

GenoMAS Liu-Hy Python

A minimalist multi-agent framework for rubost automation of scientific analysis workflows, such as gene expression analysis.

135 24 12

apbs Electrostatics C

Software for biomolecular electrostatics and solvation calculations

135 32 12

mygene.info biothings Python

MyGene.info: A BioThings API for gene annotations

134 22 16

ontobio biolink Python

python library for working with ontologies and ontology associations

133 33 18

AgentFigureGallery Dsadd4 Python

Drop-in scientific plotting skill for Claude Code, Codex, Cursor, and other coding agents.

133 1 0

Terpene-Profile-Parser-for-Cannabis-Strains MaxValue Python

Parser and database to index the terpene profile of different strains of Cannabis from online databases

133 18 17

full_spectrum_bioinformatics zaneveld Jupyter Notebook

An open-access bioinformatics text

133 35 3

apbs-pdb2pqr Electrostatics

APBS - software for biomolecular electrostatics and solvation

132 62 16

ctakes apache Java

Apache cTAKES is a Natural Language Processing (NLP) platform for clinical text.

132 26 11

GeneFuse OpenGene C

Gene fusion detection and visualization

132 60 12

MSFragger Nesvilab HTML

Ultrafast, comprehensive peptide identification for mass spectrometry–based proteomics

132 12 13

pymol-color-alphafold cbalbin-bio Python

PyMOL extension to color AlphaFold structures by confidence (pLDDT).

132 29 3

BixBench Future-House Python

Benchmark for LLM-based Agents in Computational Biology

132 27 8

blasr PacificBiosciences C++

BLASR: The PacBio® long read aligner

131 79 131

dockstore dockstore Java

An app store for scientific workflows, tools, notebooks, and services

131 29 18

ropebwt3 lh3 C

BWT construction and search

129 9 7

cute-nucleotides Daniel-Liu-c0deb0t Rust

Cute tricks for SIMD vectorized binary encoding and decoding of nucleotides, in Rust.

129 4 4

panseg kreshuklab Python

A tool for cell instance aware segmentation in densely packed 3D volumetric images

129 42 7

SquiggleKit Psy-Fer Python

SquiggleKit: A toolkit for manipulating nanopore signal data

128 23 12

pangraph neherlab C

A bioinformatic toolkit to align genome assemblies into pangenome graphs

128 6 5

seqfu2 telatin Nim

:rocket: seqfu - Sequece Fastx Utilities

128 9 3

philosopher Nesvilab Go

PeptideProphet, PTMProphet, ProteinProphet, iProphet, Abacus, and FDR filtering

128 21 13

MicrobiomeBestPracticeReview grimmlab Shell

Current Challenges and Best Practice Protocols for Microbiome Analysis using Amplicon and Metagenomic Sequencing

128 39 3

bioinformatics

Repositories (1466)