Info • Mesostigma viride NIES-296

Status

The Mesostigma viride NIES-296 genome sequence and gene models have not been determined by the JGI, but were downloaded from NCBI (GCA_009746045.1, GSE123852) on July 18, 2020. In order to ensure this genome is comparable to those sequenced by the JGI, we applied filters to remove if present: 1) transposable elements, 2) pseudogenes, 3) alternative transcripts and overlapping models, 4) alleles on secondary scaffolds and 5) unsupported short models. This resulted in the removal of 14,453 models and the generation of the FilteredModels1 (GeneCatalog) gene track. All published models are available in the ExternalModels track. Please note that this copy of the genome is not maintained by NCBI and is therefore not automatically updated. In order to allow comparative analyses with other algal genomes sequenced by the JGI, a copy of this genome is incorporated into PhycoCosm. The JGI Annotation Pipeline was used to add functional annotation to the genes.

The genome is likely a diploid sample, and this is reflected in an assembly and annotation with significant separation of alleles. Many of the scaffolds are very similar to larger scaffolds and are predicted to constitute an alternate or secondary haplotype. To represent these primary and secondary haplotypes in the Portal, we have created 'primary alleles' and 'secondary alleles' gene model tracks, comprising the models found on each haplotype. The goal of the GeneCatalog (GC) is to produce a non-redundant set of models which captures the full functional repertiore of the genome, and so the few secondary alleles that are unique were included in the GC, while all others were not.

Genome Assembly
Genome Assembly size (Mbp) 441.70
Sequencing read coverage depth 113x Pacbio
162x Illumina
# of contigs 3022
# of scaffolds 2352
# of scaffolds >= 2Kbp 2343
Scaffold N50 41
Scaffold L50 (Mbp) 2.56
# of gaps 670
% of scaffold length in gaps 4.3%
Three largest Scaffolds (Mbp) 14.33, 12.14, 10.09


Gene Models FilteredModels1
length (bp) of: average median
gene 8184 5100
transcript 2520 1647
exon 401 161
intron 1075 797
description:
protein length (aa) 626 424
exons per gene 6.28 4
# of gene models 16232


Genome Reference(s)

Funding

This project was not sequenced at the JGI.