Topic

bioinformatics

Repositories (1438)

scikit-fingerprints
scikit-fingerprints MLCIL Python

Scikit-learn compatible library for molecular fingerprints and chemoinformatics

362
amr
amr ncbi C++

AMRFinderPlus - Identify AMR genes and point mutations, and virulence and stress resistance genes in assembled bacterial nucleotide and protein sequen...

359
awesome-bioinformatics-benchmarks
awesome-bioinformatics-benchmarks j-andrews7

A curated and summarized list of bioinformatics bench-marking papers and resources.

357
rapids-singlecell
rapids-singlecell scverse Python

rapids-singlecell: GPU-accelerated framework for scRNA analysis

357
BioClaw
BioClaw Runchuan-BU TypeScript

AI-Powered Bioinformatics Research Assistant. Built on OpenClaw.

356
pysradb
pysradb saketkc Python

Package for fetching metadata and downloading data from SRA/ENA/GEO

354
miniasm
miniasm lh3 TeX

Ultrafast de novo assembly for long noisy reads (though having no consensus step)

353
scgen
scgen theislab Python

Single cell perturbation prediction

342
immunarch
immunarch immunomind R

🧬 immunarch [R package] – Multi-Modal Immune Repertoire Analytics for Immunotherapy and Vaccine Design

338
Blacklist
Blacklist Boyle-Lab C++

Application for making ENCODE Blacklists

337
drep
drep MrOlm Python

Rapid comparison and dereplication of genomes

336
genometools
genometools genometools C

GenomeTools genome analysis system.

335
DRAM
DRAM WrightonLabCSU Python

Distilled and Refined Annotation of Metabolism: A tool for the annotation and curation of function for microbial and viral genomes

331
abyss
abyss BirolLab C++

:microscope: Assemble large genomes using short reads

328
pyGeno
pyGeno tariqdaouda Python

Personalized Genomics and Proteomics. Main diet: Ensembl, side dishes: SNPs

327
awesome-cancer-variant-resources
awesome-cancer-variant-resources seandavi

A community-maintained repository of cancer clinical knowledge bases and databases focused on cancer variants.

327
homebrew-bio
homebrew-bio brewsci Ruby

:beer::microscope: Bioinformatics formulae for the Homebrew package manager (macOS and Linux)

324
octopus
octopus luntergroup C++

Bayesian haplotype-based mutation calling

323
bioperl-live
bioperl-live bioperl Perl

Core BioPerl 1.x code

316
bionode
bionode bionode JavaScript

Modular and universal bioinformatics

313
somalier
somalier brentp Nim

fast sample-swap and relatedness checks on BAMs/CRAMs/VCFs/GVCFs... "like damn that is one smart wine guy"

312
canvasXpress
canvasXpress neuhausi R

CanvasXpress: A JavaScript Library for Data Analytics with Full Audit Trail Capabilities.

311
marsilea
marsilea Marsilea-viz Python

Declarative creation of composable visualization for Python (Complex heatmap, Upset plot, Oncoprint and more~)

310
awesome-nanopore
awesome-nanopore GoekeLab

A curated list of awesome nanopore analysis tools.

309
parallel-fastq-dump
parallel-fastq-dump rvalieris Python

parallel fastq-dump wrapper

306
deepsomatic
deepsomatic google

DeepSomatic is an analysis pipeline that uses a deep neural network to call somatic variants from tumor-normal and tumor-only sequencing data.

303
kaiju
kaiju bioinformatics-centre C

Fast taxonomic classification of metagenomic sequencing reads using a protein reference database

302
tools
tools nf-core Python

Python package with helper tools for the nf-core community.

302
Ribbon
Ribbon MariaNattestad JavaScript

A genome browser designed for complex structural variants and long reads.

301
sequenceserver
sequenceserver wurmlab JavaScript

Intuitive graphical web interface for running BLAST bioinformatics tool (i.e. have your own custom NCBI BLAST site!)

300
biocypher
biocypher biocypher Python

A unifying framework for biomedical research knowledge graphs

299
PyWGCNA
PyWGCNA mortazavilab Jupyter Notebook

PyWGCNA is a Python package designed to do Weighted Gene Correlation Network analysis (WGCNA)

298
PPanGGOLiN
PPanGGOLiN labgem Python

Build a partitioned pangenome graph from microbial genomes

297
pyfastx
pyfastx lmdu C

a python package for fast random access to sequences from plain and gzipped FASTA/Q files

293
smudgeplot
smudgeplot KamilSJaron C

Inference of ploidy and heterozygosity structure using whole genome sequencing data

293
gcp-for-bioinformatics
gcp-for-bioinformatics lynnlangit Jupyter Notebook

GCP for Bioinformatics Researchers

293
sortmerna
sortmerna sortmerna C++

SortMeRNA: next-generation sequence filtering and alignment tool

292
bionumpy
bionumpy bionumpy Python

Python library for array programming on biological datasets. Documentation available at: https://bionumpy.github.io/bionumpy/

290
mag
mag nf-core Nextflow

Assembly and binning of metagenomes

289
sage
sage lazear Rust

Proteomics search & quantification so fast that it feels like magic

288
hgvs
hgvs biocommons Python

Python library to parse, format, validate, normalize, and map sequence variants according to HGVS Nomenclature (https://hgvs-nomenclature.org/).

288
medical-research-skills
medical-research-skills aipoch Python

Hundreds of agent skills for medical research, including protocol design, data analysis, evidence insights, and academic writing.

285
bioinformatics-workflows
bioinformatics-workflows GoekeLab Python

minimal example implementations for bioinformatics workflow managers

285
wgsim
wgsim lh3 C

Reads simulator

283
jbrowse-components
jbrowse-components GMOD TypeScript

Source code for JBrowse 2, a modern React-based genome browser

282
DeepECG
DeepECG ismorphism Python

ECG classification programs based on ML/DL methods

281
snp-sites
snp-sites sanger-pathogens C

Finds SNP sites from a multi-FASTA alignment file

278
muscle
muscle rcedgar C++

Multiple sequence and structure alignment with top benchmark scores scalable to thousands of sequences. Generates replicate alignments, enabling asses...

278
ProteinFlow
ProteinFlow adaptyvbio Python

Versatile computational pipeline for processing protein structure data for deep learning applications.

277
serratus
serratus ababaian Jupyter Notebook

Ultra-deep search for novel viruses

276