Title: A set of 44 transcriptomes from Silene and related genera.
SNIC Project: SNIC 2020/13-69
Project Type: SNIC Small Compute
Principal Investigator: Patrik Cangren <Patrik.cangren@bioenv.gu.se>
Affiliation: Göteborgs universitet
Duration: 2020-08-27 – 2021-09-01
Classification: 10607


Silene is a highly variable genus of flowerings plants with interesting properties, and has been used to study various processes such as: plastid evolution, evolution of sex chromosomes, heavy metal tolerance, speciation, invasive species and introgression. Unfortunately few large scale genetic resources are available for the genus which complicates further research and its use as a model organism. In order to increase the availability of genetic resources for use in further studies of the genus, we performed high throughput sequencing using Illumina Miseq/Hiseq of 44 individuals from 33 species representing the phylogenetic diversity of the genus Silene, A total of 808 million reads were processed and assembled into 4.4million transcript sequences. Assembled transcripts are to be annotated and identified using BLASTx against the genbank non-redundant-protein database (GenBank nr). From the blast results gene ontology (GO)-terms are to be applied using Blast2GO. Due to the massive size of the dataset BLAST searching needs to be performed at high performance clusters under the Swedish SNIC project. After publication the final set of assembled genes will be publicly deposited on genbank.