Release notes

December 27th, 2021

Data Cruncher default environment update

The default environment for Data Cruncher interactive analyses has been updated to include more up-to-date versions of Python (upgraded to 3.9) and R (upgraded to 4.1).

GDC Datasets version update

As of December 17, GDC datasets available through the Data Browser and the API correspond to GDC Data Release 30.0.

Recently published apps

GRIDSS/PURPLE/LINX Workflow, used for somatic genomic rearrangement detection and classification on WGS data. This workflow takes a pair of matched tumor/normal BAM files and produces allele-specific copy number of every base of the genome, overall sample purity and ploidy, annotated SV clusters and gene fusion predictions. Moreover, it outputs detailed visualisations of the rearrangements in the tumor genome via integrated Circos plots showing copy number changes, clustered SVs, derivative chromosome predictions and impacted genes.
PURPLE CNV Calling Workflow, used for somatic CNV calling and purity and ploidy estimation on WGS data. It is based on PURPLE 2.51, and consists of two additional tools – AMBER and COBALT. The workflow first calculates B-allele frequency (BAF) with AMBER and read depth ratios with COBALT, which is then used by PURPLE to estimate the purity, ploidy and copy number profile of a tumor sample.

Metadata editing using manifest files just got easier

Seven Bridges Platform provides the capability to modify metadata for multiple files in a project by using the Export metadata manifest and Edit metadata with manifest options in the File Browser. This release brings some major improvements to this feature:

Support for different manifest file formats. Besides CSV, we have added support for the TSV file format.
Use either file name or ID to identify a file. Files whose metadata is being edited can be specified using only file ID or file name (along with path) in the manifest file used with the Edit metadata with manifest option.
Support for folders. The name column can contain file path within the project (along with the file name) if the file is in a folder instead of the project root.
Better file naming and placement. A manifest file generated using the Export metadata manifest action is named in a user-friendly manner, in the manifest__YYYYMMDD_HHMMSS format. Also, an exported manifest file is generated in the project and made available as any other Platform file, meaning it can be downloaded, copied into another project, used in a task, etc.
Added file size info. Manifest files exported via the Export metadata manifest option now contain file size information.
Handling of non-standard characters. It is possible to have a comma or tab (or any other character) in a manifest file used with the Edit metadata with manifest option. Similarly, these characters will be properly formatted in a manifest file generated by the Export metadata manifest action.

Learn more:

Recently published apps

We have just published and upgraded versions (from 2.17 to 2.22) of minimap2, a sequence alignment program that aligns DNA or mRNA sequences against a reference database, and minimap2 build index, a reference indexer for minimap2 aligner.

This week’s publishing streak also includes METAL, a tool for meta-analysis genome-wide association scans. METAL can combine either (a) test statistics and standard errors or (b) p-values across studies (taking sample size and direction of effect into account). A METAL analysis is a convenient alternative to a direct analysis of merged data from multiple studies.

Recently published apps

We have just published Picard FastqToSam, a tool that converts FASTQ files to an unaligned SAM or BAM file, and a set of seven Delly tools:

Delly CNV for calling copy-number variants
Delly Call, a structural variants caller
Delly LR, a structural variants caller for long reads data
Delly Sansa Annotate for annotating structural variants
Delly Classify for classifying somatic or germline copy-number variants
Delly Filter, a tool that filters structural variants
Delly Merge for merging of structural variants in BCF format

Recently published apps

We have just published the following apps:

CrossMap, a tool that converts genomic coordinates between different assemblies, and CrossMap Viewchain that prints the chain file for two assemblies in a human-readable format.
VerifyBamID2 that estimates contamination of DNA samples from read data, accounting for ancestry information.

Recently published apps

We have just published DRAGMAP, the open source DRAGEN mapper/aligner that can be used to align single or paired-end reads (FASTQ) or an input BAM file. The app is available in the Public Apps gallery.

Recently published apps

We have just updated the content of our public app galleries with new GATK releases:

GATK Pre-Processing For Variant Discovery 4.2.0.0 workflow is used to prepare data for variant calling analysis. The workflow consists of two major segments: alignment to reference genome and data cleanup operations that correct technical biases. Resulting BAM files are ready for variant calling analysis and can be further processed by other BROAD best practice pipelines, like Generic Germline Short Variant Per-Sample Calling workflow, Somatic CNVs workflow, and Somatic SNVs + INDELs workflow.
GATK Generic Germline Short Variant Per-Sample Calling 4.2.0.0 workflow that calls germline variants in a WGS sample with GATK HaplotypeCaller, starting from an analysis-ready BAM file.

And six GATK 4.2.0.0 tools:

GATK GatherBQSRReports tool that gathers scattered BQSR recalibration reports into a single file.
GATK BaseRecalibrator tool that generates a recalibration table based on various covariates for input mapped read data.
GATK ApplyBQSR tool that recalibrates the base quality scores of an input BAM or CRAM file containing reads.
GATK HaplotypeCaller tool for calling germline SNPs and indels from input BAM file(s) via local re-assembly of haplotypes.
GATK VariantFiltration tool used for filtering variants in a VCF file based on INFO and/or FORMAT annotations.
GATK MergeVcfs, used for combining multiple variant files.

Recently published apps

We’ve just published four tools from the OncoGEMINI 1.0.0 toolkit:

OncoGEMINI Bottleneck that identifies somatic variants with increasing allele frequency in longitudinal data.
OncoGEMINI Loh, a command tool that performs loss of heterozygosity analysis.
OncoGEMINI Truncal that recovers variants that appear in all tumor samples, but are absent in the normal sample.
OncoGEMINI Unique tool for identifying somatic variants unique to a subset of samples.

Billing information just got more informative and organized

The following improvements have been made on the Billing page available to Enterprise and Division administrators:

The Billing page has been redesigned and now consists of three sections: Billing information, Instance limits and Payment information.
The start date from which costs are calculated is now displayed for the current billing period.
Additional charges and credits information is now renamed to Charges and Refunds and grouped in the Additional subsection.
Total Platform charges now sum up costs for Analysis, Storage and Additional charges.

December 27th, 2021

Data Cruncher default environment update

GDC Datasets version update

December 17th, 2021

Recently published apps

December 6th, 2021

Metadata editing using manifest files just got easier

Recently published apps

November 29th, 2021

Recently published apps

November 8th, 2021

Recently published apps

November 1st, 2021

Recently published apps

October 25th, 2021

Recently published apps

September 20th, 2021

Recently published apps

August 30th, 2021

Billing information just got more informative and organized

Request sent