From adbceeba24d75e7ca56ae5ef9e87e88bf7afdcf8 Mon Sep 17 00:00:00 2001
From: Viktoria Petrova <vipet103@hhu.de>
Date: Sat, 2 Nov 2024 14:54:25 +0100
Subject: [PATCH] add bacterial genome assembly assay and protocol

---
 .../README.md                                    |   0
 .../dataset/.gitkeep                             |   0
 .../isa.assay.xlsx                               | Bin 0 -> 6911 bytes
 .../protocols/.gitkeep                           |   0
 ...cterialGenomeAssemblyAndAnnotationProtocol.md |   3 +++
 5 files changed, 3 insertions(+)
 create mode 100644 assays/BacterialGenomeAssemblyAndAnnotation/README.md
 create mode 100644 assays/BacterialGenomeAssemblyAndAnnotation/dataset/.gitkeep
 create mode 100644 assays/BacterialGenomeAssemblyAndAnnotation/isa.assay.xlsx
 create mode 100644 assays/BacterialGenomeAssemblyAndAnnotation/protocols/.gitkeep
 create mode 100644 assays/BacterialGenomeAssemblyAndAnnotation/protocols/BacterialGenomeAssemblyAndAnnotationProtocol.md

diff --git a/assays/BacterialGenomeAssemblyAndAnnotation/README.md b/assays/BacterialGenomeAssemblyAndAnnotation/README.md
new file mode 100644
index 0000000..e69de29
diff --git a/assays/BacterialGenomeAssemblyAndAnnotation/dataset/.gitkeep b/assays/BacterialGenomeAssemblyAndAnnotation/dataset/.gitkeep
new file mode 100644
index 0000000..e69de29
diff --git a/assays/BacterialGenomeAssemblyAndAnnotation/isa.assay.xlsx b/assays/BacterialGenomeAssemblyAndAnnotation/isa.assay.xlsx
new file mode 100644
index 0000000000000000000000000000000000000000..32b9da3d095f128a27995a6d20d3f525ff96bc09
GIT binary patch
literal 6911
zcmai3WmuG3*QOb|L%OBAL`p(n=xzxanxRWVM7leqkq{)LV<-uwI}}hPq>&z4zd>H~
z9M5^bcmJ4cu4mT0*S_wxp0(D#M@<nC2^|g&4gkjsHrJ1z>p;_ohlATgf`cP~JvEee
z1iOL2ZYEmZP9RrfPA>=h@+4(&CpT{JiJ0sP2N2U(OIx-xzjlDQ<OpuUONyOh?jWsw
z&J&-xl$EB`lz=T=XxAsZ`;KaTw8Av0{QZnbbwNV3*>M&JlBUzl_M%w2Jb=zKXJXeX
zY5c?`u;_w!K~M(A*Zi0<2?YgEG?v<`s6mAmCqM3)AbUZm3mQh!N!Q|Y_N-{@$7F^$
z>QTjdZ~MDfKm)NS4-m-k`wky{sCI$>Ol~Z-(rjdS9GXA{xZ~+TR$5mKj}auQV9mA$
zAKK=btu%_Bcqix-1?%<7bZ<1q0M}Z|M^GXGi?Q|welnXbR)fU~VqiWQk08_5bQ!z@
z>eZ!+QnlejA#9hU-9EfCzvb^w{DOvyQ01Mi;=p3i;o4XuX~|<eMCzyv)YBcm#YAHh
zg|FXj0<biP9g+%bzKHJQVV(Xk{+{`^{{FFj5bfr`OELV9=bKgL_t}CskqkW~%9hC0
z{u2qfpOJu(|7)SZzL~gy>|METZEm7<kfSiW0sBo60S*r1mksB&WqHC|l`d{f$z5OB
zm5Uc(c|cwvgr?vV6Tf%|1DL!n`epa?Z@b}fl9<sps4>plYgPkWL!!}}(Hc++O!n7g
zgvbNKp~D>m-))?6STRsS=}~!|3NSNPEw7a5fhB#dvHC>vy(mxOPjlVt#gx<7VhH<H
z2W+HRMkY7HJUU{IN-K8V&ievZgG%a$C1;x%UtRHx3)q7V9XH~4%<^fh3@-bbEtDw3
zpLtL2W`^jl=CP`ct{LmhKrY3Kv*kzga_cj#@ia?RmIe&k-6`gb(bh(y6r~z|G*P?=
z4y5<gM|*t9JAPMa)EDlSk$<v=2xG*{{ugEcG(fvC_%~*Q7^-xxnQekGOZHbYPe&I!
zb4N$J>o?(+XxlDsT$$Y~^r{mB4+9`eyerEyBf}HKpxSyRdj)GDhpQ89a(N0s9%@c^
z*@yJ^@>@eE!eOdPlc<%stU%m#<u|RngueW97go64vxZ_wvFJ<;A0|DwgedseC|&#*
z;*~{NoCf!0fxOsq%CceWhsxyRukH;@_PrW<<jJfHI%AlaN8H3vt*SvEAJ(CH!LszU
zMS2d3V)Fc6)cP4Ph#@;<kc%u!!~$8)*n&E4W?5J9o$Kcw*Yg}v?AL~vIFUME+@vms
zStRya!p}~L2bUALO=x%eXb<kJ@Sn>Ly)U`ry}!Ik{ojekz9HzE8dqx&$nDQGZHQ2B
z4uTQc2_u~JAGSXsJlA|Cr>Hw-aN`CnoUp|$kP8bj&?1z2sMjfE@H=Edw@C@g5FXYi
z)?XY$gcVmRa;I?Og~`RSFWNE>GuA4iv8BjM&I3ExY2(+MN-f&idPhPpUK52!f&6+5
zrXmAd0Mg)ooU_F7*dn~nhk$kE2V{1k?iA#o!(!6Y$z)>r>c%N+NH>k`Q8@IzQf<zr
zG4;`XOWW=Xe>1&n_NqC>$ewV)Gr!?(7%@jpTAR&^>2PgKJL(M2RU>sp<0guXJ&0{5
z@_E9m<*gCOBb$1M{hA!J;nVb;K%r9=m-4969Zct^1b!Do`k6iQtcSoQ5(-_h@qEr+
zj<Q|mxW>UxLnoYljv}QpL(uZ8ifnPTCOh_T>Oi`>hVtzm5@%cv6(R<2F{pYSco5WF
z@kp@K$*3~0VB;$hxut%(-7(=}KjOE8j+hI89jP~G`WRv2>U$r@eH02_QtEz&SR4?Q
z>m}`G=Sy1A&3)5MPs&aR)XObX2xoCf?DMfO7uI4Sm81BuVbC5i6&K|ihqRYsYy;_9
zxs(@MYy-s_h${AbY^rz5RPfsBx63B`OXD~sBA+GQHJK*qG>JAzZuNcK0^AlC>-=@v
zUb;Pj!0_f!n)nVOG$tT*pb>%zQ`#$kz&M(3j;RzJfEV4j(6kdCkbv&{+En1#U|?LE
zD7P67R(0q(bhN}0&jROoC-28BflsT0>HYoWjjV0u$%xr!PV`45vCVd`Q9OgSYqh6K
zzflN$s1Fu@`07D<k{IfK_=WU`D_j;1o(G=i%)Pm_Q%VNDr0<`yWm*e8@3xe#@vhe!
z$87SNtWJnkBBdaFFLBqX%3v1tSW&0mIqPF^D)i|fO&djfzr<~p-xOFTvDiKbSUZ%1
zwL`+2%yqRka{*asy1CeZtz3U*ZPEi(Sfvf#y^LV?OsPVYdd0zNEh~>%1f~#vF>OfC
zI2@)~``u?8jn-7VC#I@>rlp%p84R5c!#OIWEIcshO-3kjsVj0|HJR=?OD3cUu;q^7
zPNuAG=^MshrO8hcPF4m$><SxgM-f!#u~|F>hYbPF8x0YjRC%i?JbEuJCpHSJa}^@`
z>j*XOWhjg#_&e@@gTyhG)i79`*(SF$v08+0VwlKUu6H9mGmIr>33G4g`k`(f{W+qg
zH10w=vUs{JPeK;rQeLT+bZD+u_pX%IYQCPl>KIj|XDasyNmCIqk9ve#&-(ci+IGC-
z`dk?}4}D!A&4<B?Ct|cMowHcvt_!?WZ6$x#N{1>m<V24An2%_{TVFIa(z}UFusW(F
zN@yRzV=L|NuO1Bj@s_xm7^gtGcI4$3QIyFj`~fEMFlSXK--Dd1mI^<L6XBYV)-Q2V
zMf#J6znrFGceC)^pT+_y^+-H`qTPn)zZP>hYmfuzSIxhPV#2V%-6&WO?WVaIYG6&_
z#_Z?k&(gD_uj)7_Nce*_@LPq2;XzI=*lfUKLeM7g-OeaI{RHu2Aiq3oxH=)}>;xww
zidz>aqQ{&Wsg5bK&UQ5pL2l2qUznJ}clueajCz|fGiSJr0{`9p_a{`;Cf%=RRHbbx
z9&-XB4J^+h9LR(ZKE}x`r+7P#atSC&%M)VZf$+fTeC&f;GZpz9cfR0~(@=(UO!6YK
z6dR^6`jS3eLeE$NWJi<Y#0POw8%oEhNZQGlM>x!s;l}s!(-8)aw~mFKejsIF9V$Ga
z3a*rou;?hh>`=8DR4n+gSde)cmcGX<6ocoG&OH*Bg}GP4j};LVgvv3YB2nbNQ(8xu
zUo9zkcScphwW+DYNkif5>8TUTx^RW29oMlg?^Q<QqgLImsg!dKxC1{u8~v9xkM=X~
z38MDDAp7*`NrQpT-o2|SEZ;fh?}WbY!JdM9S8WZ|X4pvrpRiiyY&@Dv{XK&#9lmUN
zK(;1{e9_UF!<opf=gPHPdu&tV@LtXtculyUANNzVN$EGO_pi|F6WY;=n77~rOg70U
z^v2b&P|;F5QRT115j+l!LIkw1M5G0nLCjBfSbgt4Rhb2fMbvZmD_BXleINy%l6J0H
zuo|v%jM}c!aGA6S`&?db@9$cvW@n0hcNae?niq_C=iM|T;deNKV?y@y2XuL>cwuv&
zF81m1fNtgY(?jLWLa8l?Rb{q>&)L<?)2mA@L`PS*v``%RuwZA@98ynx0LG+J0L~di
z0TV%n5RDK{197`g68>{KZE2VL8{5tc3f^FBFM4r}Mk*xH6dA-8${=*7RGwKgBS_|4
zpV8UZ?lC=yd$EU&5CG-YDs-A+KC&41K&E&c<BgPSPGG|rshl2JAD`e`J$h&WZchcI
zdrj$0JU<og%jaR!42Ukw$5L|)NbQ4eNN=Yi96WZ(DIdCTLQeH$JL@I2it#mjqJ-4g
z)n9?4q+ZB<qq(y#Pc_(BNopj&cSNdbc@&}t$Qq=|VR3Vm4jF^^r4CCG^)aTe$K4OB
z60II!dKA!ElEN1rC<-k~lMJJKoTI>jR`e~d0LNcaJrA-=z`o#MxbPga04HumVbI+W
zfzMrHcpYGtbs&h7ygj=rLr$6HRTc!ren*&<OZ)C!Yletu2CvwLA-qMhLs`ubN5MPW
z@z^&3yeuVl{4tDDXde+pEs=datn9+ALh^C<>4rR+Y<rKsBux~Ji*2eVX#u^eZH^hB
zGGW7o?h=wfJx0I0*qJ-ApWS1h4UYNre(_9s*6k0T!^G`554c0a%<Z;8EMwj(U!A2<
zrN<_uA+*zvHEqyJkg#6fGh*;@+5biL+Djr)5|d5B!lYw_V!#PCyInJL43sC6Ays<_
zq<#3b%r*jg0OD}fA26Ec%_&RKPv+A3&Y@b$$nN`<k1mO-_fF4!Iu2i??JF~ClnE8q
zTHh{ytlWD!DsQ9j0q0Px4lwARKZjE`d*LF|r8lS@8_WYj0N$-p^E7zUv&czq`SR<*
zQXvz7rim<c+>Qr3_{HEuyW^=jNa1;{5V1D`xb*0f!6mVTp_B~&c=Jp?X(!TrZ1RnD
z(n5D<zc(`WT|`F5F5|n8krq*C#gs0S`nXzr7PC2di<-A1EWMT2Iz`LldV@27bR;40
z6jWPXB->k-n!B43DGyusd3+rlR#C`G#FmkuMpMWF;xFn4@h&HjhMdsPWfDqoDdjjO
zQeCCnRV~w9Xd&`Qci8=s$|V@^U7si5nv3nCkd-$kv$<a!GS1XWNDSz2sl0oM2L-+D
zEyWn&Wt=Uh2B?Uzql}}z-Rl-E;AxNSPy@GSk~+x;85y|BdvP}E>KG~N9+tl{sxPZ&
z&YyrAQcd#Z9r<G8qj0w&CTZimlsyX#=lxZru=@gMZ3;gIcH9#);tQ`(-xzJ}k^ZUG
zX3H-BLzewTRH+yPY1Q)5VjVXRBhv?Q85KIF4Zu7hi9LS3SOYd)-J;fLLkABpr1*tk
zTY(}C3$FJnd6!V)lX!h5c{!4gU$w}1?s6RF?bK6NE<Is{EJA@QQ&oys4aOXG#tjDF
znm~}b5>bSgy)^8QkMn+~Ryo6~snCtncRhZM%ypHg?6rCZsD>p@Pn=CE2u<e6k=PI4
z=xhkR9#ScvEUUj@ES|GTdhw`Sb91`oNyEbN{KJ|bz5FqnZzd|9sFcqzPxWW(y4ZS1
zwU4n+WdRF!Cg-BFC;a&v1j>B(A>5ArTeI!AJ?^G?A>R*p<N+I=5n&As>pvrio3}ma
z=j>6KWDJ{_2}5>I*w_~|5wY0A18i*~v`R8q1rj~B0u;*55${aID_`v1N5RK;pDlv+
zzpsqVIXvfU+9;6Z8ss;6O^B*1{!odidBrw$sWM|e<ME_kDUh|+oJIT%%7O)T!aFJ?
zpIJUjJY22BrPjLRgt(#)d<q$C#(c>1$$|+6>n|v6Y`d&KHY#Yujh;NFW6|}ZVF&F?
z;3^Ec8u|pM2-NuwAUA|&2gkFgoZK(O`h@N@i_B+`%NeHZRW_<v86p3O^wWC(Blx8C
z(j{b{9rp*sLBs@TZyf27@%FpS7WFl3K;&BMI{F{{a_@%@SQWOvp2xc9+}ocZh2anv
ztTEV!<i^@0oJ+#0K_QYS)QR%I<wL}yOCG=j3AwSyog75Ic8vsmYI%-iUtDNaDx^dK
z`UvFUboF~~qoj`50%___0w_v3C$ZaEd7(5>!%Rn5lb>ugJGC>R%gmr~)8t^ZkX>j+
zT{9DS)ZoK9JIlRKHnPFeR8~>&rWj1!iIB>LQVb7W+Npk{*$dttr~OZxv9zRQUGRKv
z6hATr-z-GS8oT#N<XuRS93@7~D-@kZ`jDGT<vI)mf8E3hd8Y4?IO9Lr{N6<$nUwLh
zHQV+of7%ku_d($2{=KB*UXxnNnxfOUSz5Rdva%YS=pDmJvh1ZC&)2b$$Hr7oT%QV@
zSkAB?5RcjJ$kxnFJ;~y_9BsdHhuSXeTfPrPi38Eo3`9jz4oN5*M2v_$zw-al0c|>w
zCB4YW?>ITIli0K5$?`co6*)Ly2F&c+yOVq!OFvnUZVyroH07n{;7C<+>fzjR9M%?Q
zSpN>fTk*oTIB%<2xJ}T1AHpmgAFH`II{kSG?EBgU3WAN&Ot5P?E-a6KnVC5`U60XO
z2@f3SuE*#LqL6RVnPBD+NgbX7@+W##a0+wF9er->S}92J0tr{=eb`GPTPhH-1e}o^
zmeM)twFoMLk6+lTjFR#t9i<$u+VEo`j=DLr4=~$MNJMp(!Wh8EsC5do=H`)FD@dxz
z*6GVYATNi9WTYt6dpu|oevNkrWKCwb$C);REjF-K-4gF*dL%Hd>x@oV+^s$GgWLd>
zZ*Oek8>wryj)}S3G~8OwB+hh!WZvS^*4tc77jdn7){(0M`r?}!ecmPYC=?a0Z@?XR
ztY_0|CxEVDK~n;osl!saE?Odsr)%ZLkxL}F*uu%{wEBDOD)pK?wj|++{@5GA1~N>H
z#jDI~pPTA)j$HUHoNbZ_<fl`^?hjyZ$kjaS{z3C-I9v{e=W#8P#4|w5F762R1)D!*
zGeCS-O=S~ablZ7PWfOADbwuj9h)p`8*k)hqfPuDk$vHu<gyo)NlSqEKMSMxMRkL<w
zDjZ%s@p0?xQ2wP}$}(f753tR+@jK#eU~ejqCeVDu1QxMnSoOjCYs4Nqx`2L$Ok?0$
z32ZrGqg!}RAn7MVEAP-#BA<+IJC$L_^(Em$N-O`$?v^M(=Ii9#@bI=5waVIiIvoEF
zk&>yX?w4UmxsAjc4ptVe20<qbPQ6K{O*-84M8doyU=1*S1xJg4Lq|Rmi5LA>e6XcO
zs5L&2a^b<-`Chi9yF#<Xm}0etwxkO+9HY*JpT-g5&_bf7L-IB#6|7ji9Br@#9WjxL
z<D}uyDWm0h#O(0w7b}SQ2ZO8Q^zJmWip95{uUl}j54dQlkd^F86j2YPHds!ol|Ip?
zkWWF+cEZ4{8=AJQbSu$#693veDNTC105U4~ZSNS1V$Kc6m%YQ84eR`V-3ra(NL6U*
zq>k_iwwfPH$eXEp+Y-J58Y}G7@AKqnOI0E#=cz!*r+U5sz@<up$Md_Zeg{V_266Tj
z(ZyyIp<`Y|stFs?yiFHnK2(!ASdri2pWe^LY(7Kz!Fz%H_h?vhNc$VXqTvRM2F^_%
z_^TEbouGVu3k|uxg`SjiJcpl}RRF|!JfH|*JJ9fqOyTO;(EA|<mW#(F;dI{ajy1Wj
zU#>bGM*nd9vvzgbnntc>+Zb54uVJS6V>w|`hk$8I7Z57%Vr)_-?k8&BDghYw)avsC
z<_+>{_|!_eQjhK{Y`qNb2G4^C>7=B6x-FDF($@wC13eCun3tcDwmH;wqE?DTa%R!u
zVKL~a`cPWu>mO5R279e{DtVKWR!u!D_74J@X*YemM{Q=;xTWbV7Vmw5>QEPv?VGV0
zbxwR%yg)Y+u{7_}YFqdA<Zu!>Xb;olm3aZT<VMvPS_aTXd0Ap#m&RkUZ6YKKBHI?H
ze|3n@rOy9B%4(eL^^5h(qJh^f%mrT((3TY^<<!JNL+8~g{S0|3GAxA&z*+@p;sJp3
zeFBCU3IQ7`Gz=Djg{X{GTR0Gb!ZY<U3vT+2qFye?cjSq~=NM{=@Cbx(|6L@9MdL5)
zUnKM28~n3mUTeEI>$)mNz#@FBNB-UEpIYxN3J$K~7v`rR`Tse)mPBvXwX+UbD*Y5l
zZ$16<bk`E;&AJ9G-T?mnfPYG+w?Nom3oy4ZKR2+yChh+`@3pdcv##B<z*HF6K7Z30
zZ;yUcvi?*WZ_#jY+W)}*)*XK{cC*W$s?YCTa8Q4q;2#>$?;ZXZg%yCGJKV|?e?R`8
zLd9>#Q-6j1mMwn!{r6$<_XY{DngQd^_1Eb58~VSF%!mJ%Zv7i}<MGe7{@bI$Ut#}g
z@_(ZLY%#ylm@v8R*NAZ2aQ=k<SrdQ5Ul9FDm0Pu5O%eI$yM_xJKi<JIlAh$+{Qm&6
CKUOXP

literal 0
HcmV?d00001

diff --git a/assays/BacterialGenomeAssemblyAndAnnotation/protocols/.gitkeep b/assays/BacterialGenomeAssemblyAndAnnotation/protocols/.gitkeep
new file mode 100644
index 0000000..e69de29
diff --git a/assays/BacterialGenomeAssemblyAndAnnotation/protocols/BacterialGenomeAssemblyAndAnnotationProtocol.md b/assays/BacterialGenomeAssemblyAndAnnotation/protocols/BacterialGenomeAssemblyAndAnnotationProtocol.md
new file mode 100644
index 0000000..7a78805
--- /dev/null
+++ b/assays/BacterialGenomeAssemblyAndAnnotation/protocols/BacterialGenomeAssemblyAndAnnotationProtocol.md
@@ -0,0 +1,3 @@
+## Bacterial genome assembly and annotation
+
+Paired-end Illumina reads were first subjected to length-trimming and quality-filtering using Trimmomatic (Bolger, A. M. et al., 2014). Reads were assembled using the A5 assembly pipeline (Tritt, A. et al, 2012), which uses the IDBA algorithm (Peng, Y. et al., 2012) to assemble error-corrected reads. Detailed assembly statistics and corresponding metadata can be found in Supplementary Data 2. Genomes with multi-modal *k*-mer and GC content distributions or multiple instances of marker genes from diverse taxonomic groups were flagged as not originating from clonal cultures. These samples were processed using a metagenome binning approach (Pasolli, E. et al., 2019). Briefly, contigs from each metagenome sample were clustered using METABAT2 (Kang, D. D. et al., 2019), followed by an assessment of completeness and contamination of each metagenome-assembled genome using CheckM (Parks, D. H. et al., 2015). Only bins with completeness scores larger than 75% and contamination rates lower than 5% were retained and added to the collection (Supplementary Data 2, designated metagenome-assembled genome (MAG) in the column ‘type’). Functional annotation of genes was conducted using Prokka and using a custom database based on Kyoto Encyclopedia of Genes and Genomes (KEGG) orthologue groups (Kanehisa, M. et al., 2014) downloaded from the KEGG FTP server in November 2019. Hits to sequences in the database were filtered using an E value threshold of 10 × 10−9 and a minimum coverage of 80% of the length of the query sequence.
\ No newline at end of file
-- 
GitLab