ENCODE Software
All software used or developed by the ENCODE Consortium
Showing 50 of 350 results
Number of displayed results:
- CellBender — sourceCellBender is a software package for eliminating technical artifacts from high-throughput single-cell RNA sequencing (scRNA-seq) data.Software type: other
- Distal Regulation E-G correlation — sourceCompute correlation metrics between DNase-seq signal at cCREs with DNase-seq signal at gene promoters or RNA expression levels of genes.
- Distal regulation ENCODE-rE2G — sourceTrain ENCODE-rE2G models on CRISPR enhancer screen data and apply to generate genome-wide predictions of enhancer-gene regulatory connections.
- mex_gene_archive — sourcemex_gene_archive is a minimal file format designed to meet the needs of archiving sparse gene matrices in a format compatible with the ENCODE 4 Data Coordination Center.Software type: other
- OpenMiChrom — sourceUsed to create an ensemble of 3D structures with chromatin dynamics simulation software with input data from the Sequence Annotations (bed file) from PyMEGABASE.
- PyMEGABASE — sourcePyMEGABASE is used to generate sequence annotations at the compartment and subcompartment level for physical modeling annotations.
- PROcapNet Model Zoo Pipeline — sourceSoftware for BPNet models using PRO-cap data.Software type: machine learning
- TF ChIP-seq BPNet Model Zoo Pipeline — sourcePlaceholder description.Software type: machine learning
- ATAC-seq DNase-seq ChromBPNet Model Zoo Pipeline — sourcePlaceholder description.Software type: machine learning
- ChromBPNet — sourceChromBPNet is a fully convolutional neural network that uses dilated convolutions with residual connections to enable large receptive fields with efficient parameterization.Software type: machine learning
- pyrangesGenomicRanges for Python.
- pandasPandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of the Python programming language.
- SwanSwan is a Python library designed for the analysis and visualization of transcriptomes.Software type: other
- ABC-Enhancer-Gene-Prediction — sourceCell type specific enhancer-gene predictions using ABC model (Fulco, Nasser et al, Nature Genetics 2019)
- EPIraction — sourceThe EPIraction algorithm uses Tikhonov-regularized least squares models to predict the interacting promoter-enhancer pairs.
- AnalyzeSpearATACSoftware used to analyze Greenleaf lab's SpearATAC (perturbation followed by snATAC-seq) data.
- CerberusCerberus software for long-read RNA-seq analysisSoftware type: other
- CRISPRi-FlowFISHSoftware for the analysis of CRISPRi-FlowFISH data from Engreitz lab.
- GraphReg — sourceGraphReg (Chromatin interaction aware gene regulatory modeling with graph attention networks) is a graph neural network based gene regulation model which integrates DNA sequence, 1D epigenomic data (such as chromatin accessibility and histone modifications), and 3D chromatin conformation data (such as Hi-C, HiChIP, Micro-C, HiCAR) to predict gene expression in an informative way.
- HiCDCPlus — sourceThe package HiCDCPlus provides methods to determine significant and differential chromatin interactions by use of a negative binomial generalized linear model, as well as implementations for TopDom to call topologically associating domains (TADs), and Juicer eigenvector to find the A/B compartments. This vignette explains the use of the package and demonstrates typical workflows on HiC and HiChIP data.
- ZeroneZerone discretizes several ChIP-seq replicates simultaneously and resolves conflicts between them. Publication available at: doi: 10.1093/bioinformatics/btw336
- GEM-ToolsGEM-Tools is a C API and a Python module to support and simplify usage of the GEM Mapper.
- Fastx Toolkit — sourceThe FASTX-Toolkit is a collection of command line tools for Short-Reads FASTA/FASTQ files preprocessing.
- TRACETranscription Factor Footprinting Using DNase I Hypersensitivity Data and DNA Sequence
- 3d-dna — sourceWe begin with a series of iterative steps whose goal is to eliminate misjoins in the input scaffolds. Each step begins with a scaffold pool (initially, this pool is the set of input scaffolds themselves). The scaffolding algorithm is used to order and orient these scaffolds. Next, the misjoin correction algorithm is applied to detect errors in the scaffold pool, thus creating an edited scaffold pool. Finally, the edited scaffold pool is used as an input for the next iteration of the misjoin correction algorithm. The ultimate effect of these iterations is to reliably detect misjoins in the input scaffolds without removing correctly assembled sequence. After this process is complete, the scaffolding algorithm is applied to the revised input scaffolds, and the output – a single “megascaffold” which concatenates all the chromosomes – is retained for post-processing.
- bioraddbg ATAC-seq MACS2 — sourceThis Docker container provides an easy to use Docker interface to MACS2 for peak calling with settings tailored for Bio-Rad Single Cell ATAC-seq chemistry.
- bioraddbg ATAC-seq filter beads — sourceThis Docker container provides an easy to use Docker interface to a bead filtration tool with settings tailored for Bio-Rad Single Cell ATAC-seq chemistry. This container takes in .BAM files and performs "knee calling" to compute a bead barcode whitelist and jaccard index threshold for bead-to-droplet merging.
- bioraddbg ATAC-seq BWA — sourceThis Docker container provides an easy to use Docker interface to the BWA alignment tool with settings tailored for Bio-Rad ATAC-Seq chemistry.
- bioraddbg ATAC-seq deconvolute — sourceThis Docker container provides an easy to use Docker interface to BAP tool with settings tailored for Bio-Rad ATAC-seq chemistry.
- guppy_basecaller — sourceOnt-Guppy is a basecalling software available to Oxford Nanopore customers. For more information, please see https://nanoporetech.com/
- polyAsite_workflow — sourcePipeline to infer poly(A) site clusters through processing of 3' end sequencing libraries prepared according to various protocols.
- gencode_utr_fix — sourceThis package fixes UTR features in the third columns of Gencode GTF by converting UTR annotation into five_prime_utr and three_prime_utr similar to Ensembl.