ENCODE Software

All software used or developed by the ENCODE Consortium

Showing 50 of 199 results

List Report

Number of displayed results:

25 50 100 200

Distal Regulation E-G correlation — source
Compute correlation metrics between DNase-seq signal at cCREs with DNase-seq signal at gene promoters or RNA expression levels of genes.
Software
released
FUNCODE — source
Scripts for computing Functional Conservation of DNA Elements (FUNCODE) scores from ENCODE DNase-seq, ATAC-seq and Histone ChIP-seq data
Software
released
Distal regulation ENCODE-rE2G — source
Train ENCODE-rE2G models on CRISPR enhancer screen data and apply to generate genome-wide predictions of enhancer-gene regulatory connections.
Software
released
mex_gene_archive — source
mex_gene_archive is a minimal file format designed to meet the needs of archiving sparse gene matrices in a format compatible with the ENCODE 4 Data Coordination Center.
Software type: other
Software
released
CNDBTools — source
Used to generate the in-silico Hi-C map for each chromosome.
Software
released
OpenMiChrom — source
Used to create an ensemble of 3D structures with chromatin dynamics simulation software with input data from the Sequence Annotations (bed file) from PyMEGABASE.
Software
released
PyMEGABASE — source
PyMEGABASE is used to generate sequence annotations at the compartment and subcompartment level for physical modeling annotations.
Software
released
PROcapNet Model Zoo Pipeline — source
Software for BPNet models using PRO-cap data.
Software type: machine learning
Software
released
ProCapNet — source
Software for BPNet models using PRO-cap data.
Software type: machine learning
Software
released
TF ChIP-seq BPNet Model Zoo Pipeline — source
Placeholder description.
Software type: machine learning
Software
released
BPNet — source
BPNet is a python package with a CLI to train and interpret base-resolution deep neural networks trained on functional genomics data such as ChIP-nexus or ChIP-seq.
Software type: machine learning
Software
released
Swan
Swan is a Python library designed for the analysis and visualization of transcriptomes.
Software type: other
Software
released
seqFISH+ — source
Pipeline to process seqFISH data.
Software type: quantification, other
Software
released
ABC-Enhancer-Gene-Prediction — source
Cell type specific enhancer-gene predictions using ABC model (Fulco, Nasser et al, Nature Genetics 2019)
Software
released
EPIraction — source
The EPIraction algorithm uses Tikhonov-regularized least squares models to predict the interacting promoter-enhancer pairs.
Software
released
AnalyzeSpearATAC
Software used to analyze Greenleaf lab's SpearATAC (perturbation followed by snATAC-seq) data.
Software
released
gRNA_to_log2FC — source
Script for computing log2fc bigwig from gRNA counts
Software
released
GT_Scan — source
GT-Scan is a web-based tool that scans a user-defined genomic region for candidate targets and ranks them in terms of the number of exact or approximate off-targets in the genome.
Software
released
Cerberus
Cerberus software for long-read RNA-seq analysis
Software type: other
Software
released
LAPA — source
Alternative polyadenylation detection from diverse data sources such as 3'-seq, long-read and short-reads.
Software type: other
Software
released
CRISPRi-FlowFISH
Software for the analysis of CRISPRi-FlowFISH data from Engreitz lab.
Software
released
CRISPy — source
CRISPy is a lightweight versatile pipeline for CRISPR-screening analysis.
Software type: quantification
Software
released
GraphReg — source
GraphReg (Chromatin interaction aware gene regulatory modeling with graph attention networks) is a graph neural network based gene regulation model which integrates DNA sequence, 1D epigenomic data (such as chromatin accessibility and histone modifications), and 3D chromatin conformation data (such as Hi-C, HiChIP, Micro-C, HiCAR) to predict gene expression in an informative way.
Software
released
HiCDCPlus — source
The package HiCDCPlus provides methods to determine significant and differential chromatin interactions by use of a negative binomial generalized linear model, as well as implementations for TopDom to call topologically associating domains (TADs), and Juicer eigenvector to find the A/B compartments. This vignette explains the use of the package and demonstrates typical workflows on HiC and HiChIP data.
Software
released
TRACE
Transcription Factor Footprinting Using DNase I Hypersensitivity Data and DNA Sequence
Software
released
psf-to-bedpe — source
Quick script that converts psf to bedpe.
Software
released
3d-dna — source
We begin with a series of iterative steps whose goal is to eliminate misjoins in the input scaffolds. Each step begins with a scaffold pool (initially, this pool is the set of input scaffolds themselves). The scaffolding algorithm is used to order and orient these scaffolds. Next, the misjoin correction algorithm is applied to detect errors in the scaffold pool, thus creating an edited scaffold pool. Finally, the edited scaffold pool is used as an input for the next iteration of the misjoin correction algorithm. The ultimate effect of these iterations is to reliably detect misjoins in the input scaffolds without removing correctly assembled sequence. After this process is complete, the scaffolding algorithm is applied to the revised input scaffolds, and the output – a single “megascaffold” which concatenates all the chromosomes – is retained for post-processing.
Software
released
chromVar — source
chromVAR is an R package for the analysis of sparse chromatin accessibility data from single cell or bulk ATAC or DNAse-seq data.
Software
released
ArchR — source
R package for single-cell ATAC-seq data analysis
Software
released
DELTA — source
Tool to produce chromatin stripes, long range chromatin interactions, and topologically associated domains for Hi-C data.
Software type: other
Software
released
SLICE — source
Subcompartment Landscape Identification via Clustering Enrichments
Software type: other
Software
released
LR-splitpipe — source
Demultiplexing and debarcoding tool designed for LR-Split-seq data.
Software
released
PINTS — source
Yu lab repository for signal generation and peak calling scripts
Software
released
sQTLseekeR — source
sQTLseekeR is a package to detect splicing QTLs (sQTLs), which are variants associated with change in the splicing pattern of a gene. In sQTLSeeker, splicing patterns are modeled by the relative expression of the transcripts of a gene. The most recent version of sQTLseekeR can be employed to detect genetic variant associated to any multivariate phenotype
Software type: variant annotation
Software
released
ggsashimi — source
a command-line tool for the visualization of splicing events across multiple samples. Given a specified genomic region, ggsashimi creates sashimi plots for individual RNA-seq experiments as well as aggregated plots for groups of experiments. It uses popular bioinformatics file formats, it is annotation-independent, and allows the visualization of splicing events even for large genomic regions by scaling down the genomic segments between splice sites. It is implemented in python, and internally generates R code for plotting.
Software type: visualization
Software
released
MPRAmodel — source
Tool to analyze counts and generate processed files
Software type: quantification, file format conversion
Software
released
MPRAcount — source
Tool to process Tag-seq data and generate the count matrix
Software type: quantification
Software
released
MPRAmatch — source
Tool to identify barcode-oligo pairs
Software type: utility
Software
released
Library sequencing match — source
House script that was matching the guides (from an input list) to the fastq files as returned by deep sequencing
Software type: quantification
Software
released
FORGE2 — source
FORGE2 identifies tissue- or cell type-specific signal by analysing a minimum set of 5 single nucleotide polymorphisms (SNPs) for overlap with epigenetic data peaks compared to matched background SNPs and provides both graphical and tabular outputs.
Software type: integrated analysis
Software
released
eFORGE — source
eFORGE identifies tissue or cell type-specific signal by analysing a minimum set of 5 differentially methylated positions (DMPs) for overlap with DNase I hypersensitive sites (DHSs) compared to matched background DMPs and provides both graphical and tabulated outputs.
Software type: integrated analysis
Software
released
GenomeStudio — source
Software developed by Illumina for analysis of microarray data.
Software type: other
Software
released
CRISPR screen peak calling — source
Takes CASA output and makes ENCODE sandard element quantification file
Software type: file format conversion
Software
released
CRISPR screen track builder — source
Takes guide quantification and builds a browser track perturbation signal file
Software type: quantification
Software
released
merge_bcs — source
This Jupyter notebook merges files of barcodes to create a pass list of barcodes in common between the input files.
Software
released
ptools_bin — source
A data-sanitization software allowing raw functional genomics reads to be shared while minimizing privacy leakage, enabling principled privacy-utility trade-offs.
Software type: other
Software
released
SCREEN — source
SCREEN is a web-based visualizer for the ENCODE Registry of cCREs. Users can search for cCREs by genomic region or by associated features such as genes and SNPs, and can also visualize associated underlying annotations from the ground and integrative levels of the ENCODE Encyclopedia such as gene expression, TF ChIP-seq peaks, chromatin states, and cCRE-target gene links. Additionally, users can access ENCODE data on the functional characterization of cCREs.
Software
released
POSSUM — source
PCA Of Sparse, SUper Massive Matrices (POSSUM) contains R and C/C++ functions for very fast eigenvector calculation
Software
released
apricot — source
apricot implements submodular optimization for the purpose of summarizing massive data sets into minimally redundant subsets that are still representative of the original data. These subsets are useful for both visualizing the modalities in the data and for training accurate machine learning models with just a fraction of the examples and compute.
Software
released
CRADLE — source
CRADLE (Correcting Read counts and Analysis of DifferentiaLly Expressed regions) is a package that was developed to analyze STARR-seq data. CRADLE removes technical biases from sonication, PCR, mappability and G-quadruplex sturcture, and generates bigwig files with corrected read counts. CRADLE then uses those corrected read counts and detects both activated and repressed enhancers. CRADLE will help find enhancers with better accuracy and credibility.
Software
released

ENCODE Software

Software type

Award

Lab

Showing 50 of 199 results