Topic

bioinformatics

Repositories (1438)

APPT
APPT Bindwell Python

Affinity Protein-Protein Transformers—State of the art protein-protein binding affinity in seconds!

108
alphadia
alphadia MannLabs Jupyter Notebook

modular & open DIA search

108
configs
configs nf-core Nextflow

Config files used to define parameters specific to compute environments at different Institutions

108
TileDB-VCF
TileDB-VCF TileDB-Inc C++

Efficient variant-call data storage and retrieval library using the TileDB storage library.

108
QUBEKit
QUBEKit qubekit Python

Quantum Mechanical Bespoke Force Field Derivation Toolkit

107
ClairS
ClairS HKU-BAL Python

ClairS - a deep-learning method for long-read somatic small variant calling

107
swne
swne yanwu2014 R

Similarity Weighted Nonnegative Embedding (SWNE), a method for visualizing high dimensional datasets

107
ntHash
ntHash BirolLab C++

Fast hash function for DNA/RNA sequences

107
xpclr
xpclr hardingnj Python

Code to compute the XP-CLR statistic to infer natural selection

106
BioStructures.jl
BioStructures.jl BioJulia Julia

A Julia package to read, write and manipulate macromolecular structures

106
awesome-molecular-docking
awesome-molecular-docking Thinklab-SJTU

We would like to maintain a list of resources which aim to solve molecular docking and other closely related tasks.

106
aviary
aviary rhysnewell Python

A hybrid assembly and MAG recovery pipeline (and more!)

106
Clair
Clair HKU-BAL Python

Clair: Exploring the limit of using deep neural network on pileup data for germline variant calling

105
What_the_Phage
What_the_Phage replikation Nextflow

WtP: Phage identification via nextflow and docker or singularity

105
ganon
ganon pirovc Python

ganon2 classifies genomic sequences against large sets of references efficiently, with integrated download and update of databases (refseq/genbank), t...

105
funcscan
funcscan nf-core Nextflow

(Meta-)genome screening for functional and natural product gene sequences

105
ska.rust
ska.rust bacpop Rust

Split k-mer analysis – version 2

105
fastq.bio
fastq.bio robertaboukhalil Svelte

An interactive web tool for quality control of DNA sequencing data

105
havengrc
havengrc kindlyops JavaScript

☁️Haven GRC - easier governance, risk, and compliance 👨‍⚕️👮‍♀️🦸‍♀️🕵️‍♀️👩‍🔬

105
rustybam
rustybam vollgerlab Rust

bioinformatics toolkit in rust

104
fqgrep
fqgrep fulcrumgenomics Rust

Grep for FASTQ files

104
DeepMicrobes
DeepMicrobes MicrobeLab Python

DeepMicrobes: taxonomic classification for metagenomics with deep learning

104
GFF3toolkit
GFF3toolkit NAL-i5K Python

Python programs for processing GFF3 files

104
PhaBOX
PhaBOX KennthShang Python

Local version of the virus identification and analysis web server (tool set)

103
obi
obi obi-ontology Python

The Ontology for Biomedical Investigations

102
bio_scripts
bio_scripts shenwei356 Perl

Practical, reusable scripts for bioinformatics

102
BlazeSeq
BlazeSeq MoSafi2 Mojo

High-Performance FASTQ Parsing for Mojo — Zero-Copy to GPU

102
saber
saber BaderLab Python

Saber is a deep-learning based tool for information extraction in the biomedical domain. Pull requests are welcome! Note: this is a work in progress....

102
HoneyBADGER
HoneyBADGER JEFworks-Lab R

HMM-integrated Bayesian approach for detecting CNV and LOH events from single-cell RNA-seq data

102
trackplot
trackplot ygidtu Python

trackplot is a tool for visualizing various next-generation sequencing (NGS) data, including DNA-seq, RNA-seq, single-cell RNA-seq and full-length seq...

102
referenceseeker
referenceseeker oschwengers Python

Rapid determination of appropriate reference genomes.

101
veba
veba jolespin Python

A modular end-to-end suite for in silico recovery, clustering, and analysis of prokaryotic, microeukaryotic, and viral genomes from metagenomes

101
atacseq_pipeline
atacseq_pipeline epigen Python

Ultimate ATAC-seq Data Processing, Quantification and Annotation Snakemake Workflow and MrBiomics Module.

101
shapeit5
shapeit5 odelaneau C++

Segmented HAPlotype Estimation and Imputation Tool

101
biopython-coronavirus
biopython-coronavirus chris-rands Jupyter Notebook

Biopython Jupyter Notebook tutorial to characterize a small genome

101
pyliftover
pyliftover konstantint Python

Pure-python implementation of UCSC liftOver genome coordinate conversion

101
VerifyBamID
VerifyBamID Griffan mupad

VerifyBamID2: A robust tool for DNA contamination estimation from sequence reads using ancestry-agnostic method.

100
CharGer
CharGer ding-lab Python

Characterization of Germline variants

100
bcalm
bcalm GATB Python

compacted de Bruijn graph construction in low memory

100
Drug-Drug-Interaction-Prediction
Drug-Drug-Interaction-Prediction rezacsedu Jupyter Notebook

Drug-Drug Interaction Prediction Based on Knowledge Graph Embeddings and Convolutional-LSTM Network

100
Packages
Packages BioArchLinux Shell

Aim to be the bioinformatics repository with more and newer packages https://doi.org/10.1093/bioinformatics/btaf106

100
USearchMolecules
USearchMolecules ashvardanian Python

Searching for structural similarities across billions of molecules in milliseconds

100
BUSCO_phylogenomics
BUSCO_phylogenomics jamiemcg Python

BUSCO_Phylogenomics | Pipeline to construct species phylogenies using BUSCO proteins

99
Peptides
Peptides dosorio R

An R package to calculate indices and theoretical physicochemical properties of peptides and protein sequences.

99
GenomicsDB
GenomicsDB GenomicsDB C++

High performance data storage for importing, querying and transforming variants.

99
quantQ
quantQ hanssmail q

The repository for the Machine Learning and Big Data with kdb+/q book by Novotny et al.

99
SingleRust
SingleRust SingleRust Rust

Single Rust: Pioneering single-cell analysis with Rust's concurrency for scalable, high-throughput pipelines. 🧬🚀

99
ga4gh-server
ga4gh-server ga4gh Python

Reference implementation of the APIs defined in ga4gh-schemas. RETIRED 2018-01-24

99
clusterflow
clusterflow ewels Perl

A pipelining tool to automate and standardise bioinformatics analyses on cluster environments.

99
STalign
STalign JEFworks-Lab HTML

Python tool for alignment of spatial transcriptomics (ST) data using diffeomorphic metric mapping

98