Most popular bioinformatics repositories and open source projects

diffxpy

Differential expression analysis for single-cell RNA-seq data.

22   161   161  

IMGTHLA

Github for files currently published in the IPD-IMGT/HLA FTP Directory...

59   161   161  

pegasus

Pegasus Workflow Management System - Automate, recover, and debug scie...

68   157   157  

readfq

Fast multi-line FASTA/Q reader in several programming languages

63   156   156  

sgas

SGAS: Sequential Greedy Architecture Search (CVPR'2020) https://www.de...

27   154   154  

p2rank

P2Rank: Protein-ligand binding site prediction tool based on machine l...

28   153   153  

bigsnpr

R package for the analysis of massive SNP arrays.

40   152   152  

hts-nim

nim wrapper for htslib for parsing genomics data files

25   151   151  

kmer-cnt

Code examples of fast and simple k-mer counters for tutorial purposes

11   151   151  

pyGenomeViz

A genome visualization python package for comparative genomics

12   149   149  

graphtyper

Population-scale genotyping using pangenome graphs

21   148   148  

jbrowse-components

Source code for JBrowse 2, a modern React-based genome browser

50   148   148  

TOBIAS

Transcription factor Occupancy prediction By Investigation of ATAC-seq...

30   147   147  

cgranges

A C/C++ library for fast interval overlap queries (with a "bedtools co...

17   144   144  

indra

INDRA (Integrated Network and Dynamical Reasoning Assembler) is an aut...

55   142   142  

shinyCircos

an R/shiny application for creation of Circos plot interactively

47   142   142  

GenomicSQLite

Genomics Extension for SQLite

8   141   141  

pymzML

pymzML - an interface between Python and mzML Mass spectrometry Files

90   141   141  

bioinformatics-workbook

Bioinformatics Workbook repository

76   141   141  

chromap

Fast alignment and preprocessing of chromatin profiles

17   141   141  

ngless

NGLess: NGS with less work

24   140   140  

awesome-vdj

📚 Tools and databases for analyzing HLA and VDJ genes.

22   140   140  

biowasm

WebAssembly modules for genomics

13   139   139  

PyComplexHeatmap

PyComplexHeatmap: A Python package to plot complex heatmap (clustermap...

17   139   139  

ariba

Antimicrobial Resistance Identification By Assembly

50   138   138  

InSilicoSeq

:rocket: A sequencing simulator

31   138   138  

docker-builds

:package: :whale: Dockerfiles and documentation on tools for public he...

96   138   138  

MutScan

Detect and visualize target mutations by scanning FastQ files directly

39   137   137  

metaeuk

MetaEuk - sensitive, high-throughput gene discovery and annotation for...

25   136   136  

awesome-nanopore

A curated list of awesome nanopore analysis tools.

27   136   136  

YouTubeTutorials

80   136   136  

awesome-small-molecule-ml

A curated list of resources for machine learning for small-molecule dr...

24   134   134  

bio

Bioinformatics library for .NET

46   133   133  

svtools

Tools for processing and analyzing structural variants.

55   133   133  

MethylDackel

A (mostly) universal methylation extractor for BS-seq experiments.

40   133   133  

BioSequences.jl

Biological sequences for the julia language

47   132   132  

alphapept

A modular, python-based framework for mass spectrometry. Powered by nb...

29   132   132  

ComputationalGenomicsManual

Robs manual for the computational genomics and bioinformatics class.

51   132   132  

blasr

BLASR: The PacBio® long read aligner

79   131   131  

psmc

Implementation of the Pairwise Sequentially Markovian Coalescent (PSMC...

56   131   131  

pydna

Clone with Python! Data structures for double stranded DNA & simulatio...

39   131   131  

OBOFoundry.github.io

Metadata and website for the Open Bio Ontologies Foundry Ontology Regi...

198   130   130  

mag

Assembly and binning of metagenomes

81   130   130  

rasusa

Randomly subsample sequencing reads to a specified coverage

14   130   130  

metaGEM

:gem: An easy-to-use workflow for generating context specific genome-s...

33   130   130  

SigProfilerExtractor

SigProfilerExtractor allows de novo extraction of mutational signature...

45   129   129  

harmonypy

🎼 Integrate multiple high-dimensional datasets with fuzzy k-means and...

21   129   129  

weblogo

WebLogo 3: Sequence Logos redrawn

37   128   128  

VariantSpark

machine learning for genomic variants

44   127   127  

Multi-BioNER

Cross-type Biomedical Named Entity Recognition with Deep Multi-task Le...

27   127   127