ENCODE Software

All software used or developed by the ENCODE Consortium

Showing 14 of 14 results

List Report

Number of displayed results:

25 50 100 200

VALIS — source
High performance WebGL genome visualization
Software
released
Croo — source
Croo is a Python package for organizing outputs from Cromwell. Croo parses metadata.json which is an output from Cromwell and makes an organized directory with a copy (or a soft link) of each output file as described in an output definition JSON file specified by --out-def-json.
Software type: framework
Software
released
Caper — source
Caper (Cromwell Assisted Pipeline ExecutoR) is a wrapper Python package for Cromwell. Caper is based on Unix and cloud platform CLIs (curl, gsutil and aws) and provides easier way of running Cromwell server/run modes by automatically composing necessary input files for Cromwell. Also, Caper supports easy automatic file transfer between local/cloud storages (local path, s3://, gs:// and http(s)://). You can use these URIs in input JSON file or for a WDL file itself.
Software type: framework
Software
released
Check Files — source
Files are checked to see if the MD5 sum (both for gzipped and ungzipped) is identical to the submitted metadata, as well as run through the validateFiles program from Jim Kent's source utilities.
Software
released
SnoVault — source
The ENCODE DCC has created a general purpose software system, known as SnoVault, that supports metadata and file submission, a database used for metadata storage, web pages for displaying the metadata and a robust API for querying the metadata.
Software type: database
Software
released
encodeD — source
Metadata database for ENCODE project
Software type: database
Software
released
FASTQ read-name correction
A script resolving FASTQ read-name inconsistencies
Software
released
xsv — source
xsv is a command line program for indexing, slicing, analyzing, splitting and joining CSV files.
Software
released
bsseq — source
This R package is the reference implementation of the BSmooth algorithm for analyzing whole-genome bisulfite sequencing (WGBS) data.
Software
released
gemBS — source
gemBS is a high performance bioinformatic pipeline designed for highthroughput analysis of DNA methylation data from whole genome bisulfites sequencing data (WGBS). It combines GEM3, a high performance read aligner and bs_call, a high performance variant and methyation caller, into a streamlined and efficient pipeline for bisulfite sueqnce analysis.
Software
released
bedSort — source
UCSC Genome Browser tool for sorting .bed files by chrom,chromStart.
Software type: other
Software
released
Sambamba — source
Sambamba is a high performance highly parallel robust and fast tool (and library), written in the D programming language, for working with SAM and BAM files. Because of its efficiency is an important work horse running in many sequencing centres around the world today.
Software
released
kallisto
kallisto is a program for quantifying abundances of transcripts from RNA-Seq data, or more generally of target sequences using high-throughput sequencing reads. It is based on the novel idea of pseudoalignment for rapidly determining the compatibility of reads with targets, without the need for alignment. On benchmarks with standard RNA-Seq data, kallisto can quantify 30 million human reads in less than 3 minutes on a Mac desktop computer using only the read sequences and a transcriptome index that itself takes less than 10 minutes to build. Pseudoalignment of reads preserves the key information needed for quantification, and kallisto is therefore not only fast, but also as accurate than existing quantification tools. In fact, because the pseudoalignment procedure is robust to errors in the reads, in many benchmarks kallisto significantly outperforms existing tools.
Software
released
dbGaP SRA to fastq
Converts dbGaP-protected raw data in sra format to fastq format.
Software
released

ENCODE Software

Software type

Award

Lab

Showing 14 of 14 results