diff --git a/assays/BUSCO_and_Compleasm_db/README.md b/assays/BUSCO_and_Compleasm_db/README.md index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..4ed76233eb59b0823fe60c52582fb95fa15ea4fc 100644 --- a/assays/BUSCO_and_Compleasm_db/README.md +++ b/assays/BUSCO_and_Compleasm_db/README.md @@ -0,0 +1,2 @@ +# Orthologs database +These are the eudicots_odb10 databases that were used for the [BUSCO](https://busco.ezlab.org/) and [Compleasm](https://github.com/huangnengCSU/compleasm) analysis. \ No newline at end of file diff --git a/assays/BUSCO_and_Compleasm_db/isa.assay.xlsx b/assays/BUSCO_and_Compleasm_db/isa.assay.xlsx index ee50f27ce6f178c7d7b8f92309709b874094763c..988bb86c74a2fe8364c10f1086e51d1a925146ed 100644 Binary files a/assays/BUSCO_and_Compleasm_db/isa.assay.xlsx and b/assays/BUSCO_and_Compleasm_db/isa.assay.xlsx differ diff --git a/assays/Genome_annotations/README.md b/assays/Genome_annotations/README.md index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..55d32a7032e271c6cbb8fe46918eda1959c629e0 100644 --- a/assays/Genome_annotations/README.md +++ b/assays/Genome_annotations/README.md @@ -0,0 +1,2 @@ +# Genome Annotations +The combination of ab initio gene prediction using the deep learning tool [Helixer](https://github.com/weberlab-hhu/Helixer) and the RNASeq-based [StringTie](https://ccb.jhu.edu/software/stringtie/) predicted transcripts were combined with a high-quality genome annotation with [Mikado](https://mikado.readthedocs.io/en/stable/) to select the best transcript sets. \ No newline at end of file diff --git a/assays/Genome_annotations/isa.assay.xlsx b/assays/Genome_annotations/isa.assay.xlsx index 370a9f5cf5066fc5ab8238b98e2413cdc2a268a4..5d2e28d19aa98af173ade6b79f305f5a18652028 100644 Binary files a/assays/Genome_annotations/isa.assay.xlsx and b/assays/Genome_annotations/isa.assay.xlsx differ diff --git a/assays/Genome_assemblies/README.md b/assays/Genome_assemblies/README.md index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..74545f8a367cf473c0973dc3fe9234ad69637aaf 100644 --- a/assays/Genome_assemblies/README.md +++ b/assays/Genome_assemblies/README.md @@ -0,0 +1,4 @@ +# Genome assembly +The genome of Anjibaicha, Zijuan, and L618 was initially assembled using [Hifiasm](https://www.nature.com/articles/s41592-020-01056-5) ([v0.19.5] and then purge the assembly using [Purge Haplotigs](https://bitbucket.org/mroachawri/purge_haplotigs/src/master/). +We only had [Hi-C](https://en.wikipedia.org/wiki/Hi-C_(genomic_analysis_technique)) data for Anjibaicha. First, the Hi-C reads were processed using [Arima pipeline](https://github.com/ArimaGenomics/mapping_pipeline) and then removed small contigs which were highly similar to large scaffolds using [Purge Haplotigs](https://bitbucket.org/mroachawri/purge_haplotigs/src/master/). +We didn't have Hi-C data for ZJ and L618, so we used [RagTag](https://github.com/malonge/RagTag) using the AJ chromosome as a reference. \ No newline at end of file diff --git a/assays/Genome_assemblies/isa.assay.xlsx b/assays/Genome_assemblies/isa.assay.xlsx index 33d1a6d796764a33f327a809d37811d225be37ac..46a25d7c0186567d6b9c349651209e0163b9dc8b 100644 Binary files a/assays/Genome_assemblies/isa.assay.xlsx and b/assays/Genome_assemblies/isa.assay.xlsx differ diff --git a/assays/Mercator_Bins/README.md b/assays/Mercator_Bins/README.md index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..5c06ae82ba5c90e656d33a04db1af71491c0e220 100644 --- a/assays/Mercator_Bins/README.md +++ b/assays/Mercator_Bins/README.md @@ -0,0 +1,2 @@ +# Functional annotation +The functional annotation for all accession's proteome was performed using [Mercator4](https://www.plabipd.de/mercator_main.html) \ No newline at end of file diff --git a/assays/Mercator_Bins/isa.assay.xlsx b/assays/Mercator_Bins/isa.assay.xlsx index 2a0e50d1e4ae1df80a7d6ad0057b17d7b5451325..762f18b4e5d8bd707c1ba2be17ccf3866ac1f161 100644 Binary files a/assays/Mercator_Bins/isa.assay.xlsx and b/assays/Mercator_Bins/isa.assay.xlsx differ diff --git a/assays/Transposable_elements/README.md b/assays/Transposable_elements/README.md index e69de29bb2d1d6434b8b29ae775ad8c2e48c5391..f5eaaaeea5c4dd5a33cd1ec31c3e89fbab25b7d2 100644 --- a/assays/Transposable_elements/README.md +++ b/assays/Transposable_elements/README.md @@ -0,0 +1,2 @@ +# Transposable Elements (TEs) annotation +The advanced Extensive de novo TE Annotator [EDTA](https://github.com/oushujun/EDTA) to identify TEs within the tea genomes. The LTR, TIR, and Helitron modes were run separately, followed by a final EDTA run with `-overwrite 0` and `--cds` options using Mikado's gene model CDS sequence, ensuring accurate and comprehensive annotation of TEs. For pan TE identification, the panEDTA module was used \ No newline at end of file diff --git a/assays/Transposable_elements/isa.assay.xlsx b/assays/Transposable_elements/isa.assay.xlsx index ce8bdb7e2cb1ad7db893418b4b3e6f6de92d2d8b..efa32c38e01dec90add0f67a3ca6240c0d1db43d 100644 Binary files a/assays/Transposable_elements/isa.assay.xlsx and b/assays/Transposable_elements/isa.assay.xlsx differ diff --git a/isa.investigation.xlsx b/isa.investigation.xlsx index 3593da2a6158759aa5cf1df3d490853d264fe183..93598ee27697035e980071587f428d2b68800a03 100644 Binary files a/isa.investigation.xlsx and b/isa.investigation.xlsx differ diff --git a/studies/Camellia_sinensis_pangenome/README.md b/studies/Camellia_sinensis_pangenome/README.md index 47423b6daabb037099e940aaf21526bd11de81e8..543071305901c94027bb95b67a3efa0eb4aca8f5 100644 --- a/studies/Camellia_sinensis_pangenome/README.md +++ b/studies/Camellia_sinensis_pangenome/README.md @@ -1,3 +1,3 @@ -# *Camellia* genomes - -De novo genome assemblies of 3 diverse tea accessions, the purple-leaved *assamica* cultivar “Zijuanâ€, the temperature-sensitive *sinensis* cultivar “Anjibaicha†and the wild accession “L618†were merged with eight existing *Camellia* genomes to generate a new pan genome. To obtain leaf material for the de novo assemblies, plants were grown in the tea germplasm resource nursery of Huazhong Agricultural University, Wuhan, China. +# *Camellia sinensis* genomes + +De novo genome assemblies of 3 diverse tea accessions, the purple-leaved *assamica* cultivar “Zijuanâ€, the temperature-sensitive *sinensis* cultivar “Anjibaicha†and the wild accession “L618†were merged with eight existing *Camellia* genomes to generate a new pan genome. To obtain leaf material for the de novo assemblies, plants were grown in the tea germplasm resource nursery of Huazhong Agricultural University, Wuhan, China. diff --git a/studies/Camellia_sinensis_pangenome/isa.study.xlsx b/studies/Camellia_sinensis_pangenome/isa.study.xlsx index 3bf3fc57f75d04467776c4200c644384ced01155..a98834eb8aa8b0620fb39fb9ae8ad678e781c738 100644 Binary files a/studies/Camellia_sinensis_pangenome/isa.study.xlsx and b/studies/Camellia_sinensis_pangenome/isa.study.xlsx differ