Skip to content
Snippets Groups Projects
Commit 0ec4bbff authored by Arslan Tariq's avatar Arslan Tariq
Browse files

add Readme file for assys

parent 5ae2126c
Branches main
No related tags found
No related merge requests found
Pipeline #5603 passed
Showing with 15 additions and 3 deletions
# Orthologs database
These are the eudicots_odb10 databases that were used for the [BUSCO](https://busco.ezlab.org/) and [Compleasm](https://github.com/huangnengCSU/compleasm) analysis.
\ No newline at end of file
No preview for this file type
# Genome Annotations
The combination of ab initio gene prediction using the deep learning tool [Helixer](https://github.com/weberlab-hhu/Helixer) and the RNASeq-based [StringTie](https://ccb.jhu.edu/software/stringtie/) predicted transcripts were combined with a high-quality genome annotation with [Mikado](https://mikado.readthedocs.io/en/stable/) to select the best transcript sets.
\ No newline at end of file
No preview for this file type
# Genome assembly
The genome of Anjibaicha, Zijuan, and L618 was initially assembled using [Hifiasm](https://www.nature.com/articles/s41592-020-01056-5) ([v0.19.5] and then purge the assembly using [Purge Haplotigs](https://bitbucket.org/mroachawri/purge_haplotigs/src/master/).
We only had [Hi-C](https://en.wikipedia.org/wiki/Hi-C_(genomic_analysis_technique)) data for Anjibaicha. First, the Hi-C reads were processed using [Arima pipeline](https://github.com/ArimaGenomics/mapping_pipeline) and then removed small contigs which were highly similar to large scaffolds using [Purge Haplotigs](https://bitbucket.org/mroachawri/purge_haplotigs/src/master/).
We didn't have Hi-C data for ZJ and L618, so we used [RagTag](https://github.com/malonge/RagTag) using the AJ chromosome as a reference.
\ No newline at end of file
No preview for this file type
# Functional annotation
The functional annotation for all accession's proteome was performed using [Mercator4](https://www.plabipd.de/mercator_main.html)
\ No newline at end of file
No preview for this file type
# Transposable Elements (TEs) annotation
The advanced Extensive de novo TE Annotator [EDTA](https://github.com/oushujun/EDTA) to identify TEs within the tea genomes. The LTR, TIR, and Helitron modes were run separately, followed by a final EDTA run with `-overwrite 0` and `--cds` options using Mikado's gene model CDS sequence, ensuring accurate and comprehensive annotation of TEs. For pan TE identification, the panEDTA module was used
\ No newline at end of file
No preview for this file type
No preview for this file type
# *Camellia* genomes
De novo genome assemblies of 3 diverse tea accessions, the purple-leaved *assamica* cultivar “Zijuan”, the temperature-sensitive *sinensis* cultivar “Anjibaicha” and the wild accession “L618” were merged with eight existing *Camellia* genomes to generate a new pan genome. To obtain leaf material for the de novo assemblies, plants were grown in the tea germplasm resource nursery of Huazhong Agricultural University, Wuhan, China.
# *Camellia sinensis* genomes
De novo genome assemblies of 3 diverse tea accessions, the purple-leaved *assamica* cultivar “Zijuan”, the temperature-sensitive *sinensis* cultivar “Anjibaicha” and the wild accession “L618” were merged with eight existing *Camellia* genomes to generate a new pan genome. To obtain leaf material for the de novo assemblies, plants were grown in the tea germplasm resource nursery of Huazhong Agricultural University, Wuhan, China.
No preview for this file type
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment