From adbceeba24d75e7ca56ae5ef9e87e88bf7afdcf8 Mon Sep 17 00:00:00 2001 From: Viktoria Petrova <vipet103@hhu.de> Date: Sat, 2 Nov 2024 14:54:25 +0100 Subject: [PATCH] add bacterial genome assembly assay and protocol --- .../README.md | 0 .../dataset/.gitkeep | 0 .../isa.assay.xlsx | Bin 0 -> 6911 bytes .../protocols/.gitkeep | 0 ...cterialGenomeAssemblyAndAnnotationProtocol.md | 3 +++ 5 files changed, 3 insertions(+) create mode 100644 assays/BacterialGenomeAssemblyAndAnnotation/README.md create mode 100644 assays/BacterialGenomeAssemblyAndAnnotation/dataset/.gitkeep create mode 100644 assays/BacterialGenomeAssemblyAndAnnotation/isa.assay.xlsx create mode 100644 assays/BacterialGenomeAssemblyAndAnnotation/protocols/.gitkeep create mode 100644 assays/BacterialGenomeAssemblyAndAnnotation/protocols/BacterialGenomeAssemblyAndAnnotationProtocol.md diff --git a/assays/BacterialGenomeAssemblyAndAnnotation/README.md b/assays/BacterialGenomeAssemblyAndAnnotation/README.md new file mode 100644 index 0000000..e69de29 diff --git a/assays/BacterialGenomeAssemblyAndAnnotation/dataset/.gitkeep b/assays/BacterialGenomeAssemblyAndAnnotation/dataset/.gitkeep new file mode 100644 index 0000000..e69de29 diff --git a/assays/BacterialGenomeAssemblyAndAnnotation/isa.assay.xlsx b/assays/BacterialGenomeAssemblyAndAnnotation/isa.assay.xlsx new file mode 100644 index 0000000000000000000000000000000000000000..32b9da3d095f128a27995a6d20d3f525ff96bc09 GIT binary patch literal 6911 zcmai3WmuG3*QOb|L%OBAL`p(n=xzxanxRWVM7leqkq{)LV<-uwI}}hPq>&z4zd>H~ z9M5^bcmJ4cu4mT0*S_wxp0(D#M@<nC2^|g&4gkjsHrJ1z>p;_ohlATgf`cP~JvEee z1iOL2ZYEmZP9RrfPA>=h@+4(&CpT{JiJ0sP2N2U(OIx-xzjlDQ<OpuUONyOh?jWsw z&J&-xl$EB`lz=T=XxAsZ`;KaTw8Av0{QZnbbwNV3*>M&JlBUzl_M%w2Jb=zKXJXeX zY5c?`u;_w!K~M(A*Zi0<2?YgEG?v<`s6mAmCqM3)AbUZm3mQh!N!Q|Y_N-{@$7F^$ z>QTjdZ~MDfKm)NS4-m-k`wky{sCI$>Ol~Z-(rjdS9GXA{xZ~+TR$5mKj}auQV9mA$ zAKK=btu%_Bcqix-1?%<7bZ<1q0M}Z|M^GXGi?Q|welnXbR)fU~VqiWQk08_5bQ!z@ z>eZ!+QnlejA#9hU-9EfCzvb^w{DOvyQ01Mi;=p3i;o4XuX~|<eMCzyv)YBcm#YAHh zg|FXj0<biP9g+%bzKHJQVV(Xk{+{`^{{FFj5bfr`OELV9=bKgL_t}CskqkW~%9hC0 z{u2qfpOJu(|7)SZzL~gy>|METZEm7<kfSiW0sBo60S*r1mksB&WqHC|l`d{f$z5OB zm5Uc(c|cwvgr?vV6Tf%|1DL!n`epa?Z@b}fl9<sps4>plYgPkWL!!}}(Hc++O!n7g zgvbNKp~D>m-))?6STRsS=}~!|3NSNPEw7a5fhB#dvHC>vy(mxOPjlVt#gx<7VhH<H z2W+HRMkY7HJUU{IN-K8V&ievZgG%a$C1;x%UtRHx3)q7V9XH~4%<^fh3@-bbEtDw3 zpLtL2W`^jl=CP`ct{LmhKrY3Kv*kzga_cj#@ia?RmIe&k-6`gb(bh(y6r~z|G*P?= z4y5<gM|*t9JAPMa)EDlSk$<v=2xG*{{ugEcG(fvC_%~*Q7^-xxnQekGOZHbYPe&I! zb4N$J>o?(+XxlDsT$$Y~^r{mB4+9`eyerEyBf}HKpxSyRdj)GDhpQ89a(N0s9%@c^ z*@yJ^@>@eE!eOdPlc<%stU%m#<u|RngueW97go64vxZ_wvFJ<;A0|DwgedseC|&#* z;*~{NoCf!0fxOsq%CceWhsxyRukH;@_PrW<<jJfHI%AlaN8H3vt*SvEAJ(CH!LszU zMS2d3V)Fc6)cP4Ph#@;<kc%u!!~$8)*n&E4W?5J9o$Kcw*Yg}v?AL~vIFUME+@vms zStRya!p}~L2bUALO=x%eXb<kJ@Sn>Ly)U`ry}!Ik{ojekz9HzE8dqx&$nDQGZHQ2B z4uTQc2_u~JAGSXsJlA|Cr>Hw-aN`CnoUp|$kP8bj&?1z2sMjfE@H=Edw@C@g5FXYi z)?XY$gcVmRa;I?Og~`RSFWNE>GuA4iv8BjM&I3ExY2(+MN-f&idPhPpUK52!f&6+5 zrXmAd0Mg)ooU_F7*dn~nhk$kE2V{1k?iA#o!(!6Y$z)>r>c%N+NH>k`Q8@IzQf<zr zG4;`XOWW=Xe>1&n_NqC>$ewV)Gr!?(7%@jpTAR&^>2PgKJL(M2RU>sp<0guXJ&0{5 z@_E9m<*gCOBb$1M{hA!J;nVb;K%r9=m-4969Zct^1b!Do`k6iQtcSoQ5(-_h@qEr+ zj<Q|mxW>UxLnoYljv}QpL(uZ8ifnPTCOh_T>Oi`>hVtzm5@%cv6(R<2F{pYSco5WF z@kp@K$*3~0VB;$hxut%(-7(=}KjOE8j+hI89jP~G`WRv2>U$r@eH02_QtEz&SR4?Q z>m}`G=Sy1A&3)5MPs&aR)XObX2xoCf?DMfO7uI4Sm81BuVbC5i6&K|ihqRYsYy;_9 zxs(@MYy-s_h${AbY^rz5RPfsBx63B`OXD~sBA+GQHJK*qG>JAzZuNcK0^AlC>-=@v zUb;Pj!0_f!n)nVOG$tT*pb>%zQ`#$kz&M(3j;RzJfEV4j(6kdCkbv&{+En1#U|?LE zD7P67R(0q(bhN}0&jROoC-28BflsT0>HYoWjjV0u$%xr!PV`45vCVd`Q9OgSYqh6K zzflN$s1Fu@`07D<k{IfK_=WU`D_j;1o(G=i%)Pm_Q%VNDr0<`yWm*e8@3xe#@vhe! z$87SNtWJnkBBdaFFLBqX%3v1tSW&0mIqPF^D)i|fO&djfzr<~p-xOFTvDiKbSUZ%1 zwL`+2%yqRka{*asy1CeZtz3U*ZPEi(Sfvf#y^LV?OsPVYdd0zNEh~>%1f~#vF>OfC zI2@)~``u?8jn-7VC#I@>rlp%p84R5c!#OIWEIcshO-3kjsVj0|HJR=?OD3cUu;q^7 zPNuAG=^MshrO8hcPF4m$><SxgM-f!#u~|F>hYbPF8x0YjRC%i?JbEuJCpHSJa}^@` z>j*XOWhjg#_&e@@gTyhG)i79`*(SF$v08+0VwlKUu6H9mGmIr>33G4g`k`(f{W+qg zH10w=vUs{JPeK;rQeLT+bZD+u_pX%IYQCPl>KIj|XDasyNmCIqk9ve#&-(ci+IGC- z`dk?}4}D!A&4<B?Ct|cMowHcvt_!?WZ6$x#N{1>m<V24An2%_{TVFIa(z}UFusW(F zN@yRzV=L|NuO1Bj@s_xm7^gtGcI4$3QIyFj`~fEMFlSXK--Dd1mI^<L6XBYV)-Q2V zMf#J6znrFGceC)^pT+_y^+-H`qTPn)zZP>hYmfuzSIxhPV#2V%-6&WO?WVaIYG6&_ z#_Z?k&(gD_uj)7_Nce*_@LPq2;XzI=*lfUKLeM7g-OeaI{RHu2Aiq3oxH=)}>;xww zidz>aqQ{&Wsg5bK&UQ5pL2l2qUznJ}clueajCz|fGiSJr0{`9p_a{`;Cf%=RRHbbx z9&-XB4J^+h9LR(ZKE}x`r+7P#atSC&%M)VZf$+fTeC&f;GZpz9cfR0~(@=(UO!6YK z6dR^6`jS3eLeE$NWJi<Y#0POw8%oEhNZQGlM>x!s;l}s!(-8)aw~mFKejsIF9V$Ga z3a*rou;?hh>`=8DR4n+gSde)cmcGX<6ocoG&OH*Bg}GP4j};LVgvv3YB2nbNQ(8xu zUo9zkcScphwW+DYNkif5>8TUTx^RW29oMlg?^Q<QqgLImsg!dKxC1{u8~v9xkM=X~ z38MDDAp7*`NrQpT-o2|SEZ;fh?}WbY!JdM9S8WZ|X4pvrpRiiyY&@Dv{XK	lmUN zK(;1{e9_UF!<opf=gPHPdu&tV@LtXtculyUANNzVN$EGO_pi|F6WY;=n77~rOg70U z^v2b&P|;F5QRT115j+l!LIkw1M5G0nLCjBfSbgt4Rhb2fMbvZmD_BXleINy%l6J0H zuo|v%jM}c!aGA6S`&?db@9$cvW@n0hcNae?niq_C=iM|T;deNKV?y@y2XuL>cwuv& zF81m1fNtgY(?jLWLa8l?Rb{q>&)L<?)2mA@L`PS*v``%RuwZA@98ynx0LG+J0L~di z0TV%n5RDK{197`g68>{KZE2VL8{5tc3f^FBFM4r}Mk*xH6dA-8${=*7RGwKgBS_|4 zpV8UZ?lC=yd$EU&5CG-YDs-A+KC&41K&E&c<BgPSPGG|rshl2JAD`e`J$h&WZchcI zdrj$0JU<og%jaR!42Ukw$5L|)NbQ4eNN=Yi96WZ(DIdCTLQeH$JL@I2it#mjqJ-4g z)n9?4q+ZB<qq(y#Pc_(BNopj&cSNdbc@&}t$Qq=|VR3Vm4jF^^r4CCG^)aTe$K4OB z60II!dKA!ElEN1rC<-k~lMJJKoTI>jR`e~d0LNcaJrA-=z`o#MxbPga04HumVbI+W zfzMrHcpYGtbs&h7ygj=rLr$6HRTc!ren*&<OZ)C!Yletu2CvwLA-qMhLs`ubN5MPW z@z^&3yeuVl{4tDDXde+pEs=datn9+ALh^C<>4rR+Y<rKsBux~Ji*2eVX#u^eZH^hB zGGW7o?h=wfJx0I0*qJ-ApWS1h4UYNre(_9s*6k0T!^G`554c0a%<Z;8EMwj(U!A2< zrN<_uA+*zvHEqyJkg#6fGh*;@+5biL+Djr)5|d5B!lYw_V!#PCyInJL43sC6Ays<_ zq<#3b%r*jg0OD}fA26Ec%_&RKPv+A3&Y@b$$nN`<k1mO-_fF4!Iu2i??JF~ClnE8q zTHh{ytlWD!DsQ9j0q0Px4lwARKZjE`d*LF|r8lS@8_WYj0N$-p^E7zUv&czq`SR<* zQXvz7rim<c+>Qr3_{HEuyW^=jNa1;{5V1D`xb*0f!6mVTp_B~&c=Jp?X(!TrZ1RnD z(n5D<zc(`WT|`F5F5|n8krq*C#gs0S`nXzr7PC2di<-A1EWMT2Iz`LldV@27bR;40 z6jWPXB->k-n!B43DGyusd3+rlR#C`G#FmkuMpMWF;xFn4@h&HjhMdsPWfDqoDdjjO zQeCCnRV~w9Xd&`Qci8=s$|V@^U7si5nv3nCkd-$kv$<a!GS1XWNDSz2sl0oM2L-+D zEyWn&Wt=Uh2B?Uzql}}z-Rl-E;AxNSPy@GSk~+x;85y|BdvP}E>KG~N9+tl{sxPZ& z&YyrAQcd#Z9r<G8qj0w&CTZimlsyX#=lxZru=@gMZ3;gIcH9#);tQ`(-xzJ}k^ZUG zX3H-BLzewTRH+yPY1Q)5VjVXRBhv?Q85KIF4Zu7hi9LS3SOYd)-J;fLLkABpr1*tk zTY(}C3$FJnd6!V)lX!h5c{!4gU$w}1?s6RF?bK6NE<Is{EJA@QQ&oys4aOXG#tjDF znm~}b5>bSgy)^8QkMn+~Ryo6~snCtncRhZM%ypHg?6rCZsD>p@Pn=CE2u<e6k=PI4 z=xhkR9#ScvEUUj@ES|GTdhw`Sb91`oNyEbN{KJ|bz5FqnZzd|9sFcqzPxWW(y4ZS1 zwU4n+WdRF!Cg-BFC;a&v1j>B(A>5ArTeI!AJ?^G?A>R*p<N+I=5n&As>pvrio3}ma z=j>6KWDJ{_2}5>I*w_~|5wY0A18i*~v`R8q1rj~B0u;*55${aID_`v1N5RK;pDlv+ zzpsqVIXvfU+9;6Z8ss;6O^B*1{!odidBrw$sWM|e<ME_kDUh|+oJIT%%7O)T!aFJ? zpIJUjJY22BrPjLRgt(#)d<q$C#(c>1$$|+6>n|v6Y`d&KHY#Yujh;NFW6|}ZVF&F? z;3^Ec8u|pM2-NuwAUA|&2gkFgoZK(O`h@N@i_B+`%NeHZRW_<v86p3O^wWC(Blx8C z(j{b{9rp*sLBs@TZyf27@%FpS7WFl3K;&BMI{F{{a_@%@SQWOvp2xc9+}ocZh2anv ztTEV!<i^@0oJ+#0K_QYS)QR%I<wL}yOCG=j3AwSyog75Ic8vsmYI%-iUtDNaDx^dK z`UvFUboF~~qoj`50%___0w_v3C$ZaEd7(5>!%Rn5lb>ugJGC>R%gmr~)8t^ZkX>j+ zT{9DS)ZoK9JIlRKHnPFeR8~>&rWj1!iIB>LQVb7W+Npk{*$dttr~OZxv9zRQUGRKv z6hATr-z-GS8oT#N<XuRS93@7~D-@kZ`jDGT<vI)mf8E3hd8Y4?IO9Lr{N6<$nUwLh zHQV+of7%ku_d($2{=KB*UXxnNnxfOUSz5Rdva%YS=pDmJvh1ZC&)2b$$Hr7oT%QV@ zSkAB?5RcjJ$kxnFJ;~y_9BsdHhuSXeTfPrPi38Eo3`9jz4oN5*M2v_$zw-al0c|>w zCB4YW?>ITIli0K5$?`co6*)Ly2F&c+yOVq!OFvnUZVyroH07n{;7C<+>fzjR9M%?Q zSpN>fTk*oTIB%<2xJ}T1AHpmgAFH`II{kSG?EBgU3WAN&Ot5P?E-a6KnVC5`U60XO z2@f3SuE*#LqL6RVnPBD+NgbX7@+W##a0+wF9er->S}92J0tr{=eb`GPTPhH-1e}o^ zmeM)twFoMLk6+lTjFR#t9i<$u+VEo`j=DLr4=~$MNJMp(!Wh8EsC5do=H`)FD@dxz z*6GVYATNi9WTYt6dpu|oevNkrWKCwb$C);REjF-K-4gF*dL%Hd>x@oV+^s$GgWLd> zZ*Oek8>wryj)}S3G~8OwB+hh!WZvS^*4tc77jdn7){(0M`r?}!ecmPYC=?a0Z@?XR ztY_0|CxEVDK~n;osl!saE?Odsr)%ZLkxL}F*uu%{wEBDOD)pK?wj|++{@5GA1~N>H z#jDI~pPTA)j$HUHoNbZ_<fl`^?hjyZ$kjaS{z3C-I9v{e=W#8P#4|w5F762R1)D!* zGeCS-O=S~ablZ7PWfOADbwuj9h)p`8*k)hqfPuDk$vHu<gyo)NlSqEKMSMxMRkL<w zDjZ%s@p0?xQ2wP}$}(f753tR+@jK#eU~ejqCeVDu1QxMnSoOjCYs4Nqx`2L$Ok?0$ z32ZrGqg!}RAn7MVEAP-#BA<+IJC$L_^(Em$N-O`$?v^M(=Ii9#@bI=5waVIiIvoEF zk&>yX?w4UmxsAjc4ptVe20<qbPQ6K{O*-84M8doyU=1*S1xJg4Lq|Rmi5LA>e6XcO zs5L&2a^b<-`Chi9yF#<Xm}0etwxkO+9HY*JpT-g5&_bf7L-IB#6|7ji9Br@#9WjxL z<D}uyDWm0h#O(0w7b}SQ2ZO8Q^zJmWip95{uUl}j54dQlkd^F86j2YPHds!ol|Ip? zkWWF+cEZ4{8=AJQbSu$#693veDNTC105U4~ZSNS1V$Kc6m%YQ84eR`V-3ra(NL6U* zq>k_iwwfPH$eXEp+Y-J58Y}G7@AKqnOI0E#=cz!*r+U5sz@<up$Md_Zeg{V_266Tj z(ZyyIp<`Y|stFs?yiFHnK2(!ASdri2pWe^LY(7Kz!Fz%H_h?vhNc$VXqTvRM2F^_% z_^TEbouGVu3k|uxg`SjiJcpl}RRF|!JfH|*JJ9fqOyTO;(EA|<mW#(F;dI{ajy1Wj zU#>bGM*nd9vvzgbnntc>+Zb54uVJS6V>w|`hk$8I7Z57%Vr)_-?k8&BDghYw)avsC z<_+>{_|!_eQjhK{Y`qNb2G4^C>7=B6x-FDF($@wC13eCun3tcDwmH;wqE?DTa%R!u zVKL~a`cPWu>mO5R279e{DtVKWR!u!D_74J@X*YemM{Q=;xTWbV7Vmw5>QEPv?VGV0 zbxwR%yg)Y+u{7_}YFqdA<Zu!>Xb;olm3aZT<VMvPS_aTXd0Ap#m&RkUZ6YKKBHI?H ze|3n@rOy9B%4(eL^^5h(qJh^f%mrT((3TY^<<!JNL+8~g{S0|3GAxA&z*+@p;sJp3 zeFBCU3IQ7`Gz=Djg{X{GTR0Gb!ZY<U3vT+2qFye?cjSq~=NM{=@Cbx(|6L@9MdL5) zUnKM28~n3mUTeEI>$)mNz#@FBNB-UEpIYxN3J$K~7v`rR`Tse)mPBvXwX+UbD*Y5l zZ$16<bk`E;&AJ9G-T?mnfPYG+w?Nom3oy4ZKR2+yChh+`@3pdcv##B<z*HF6K7Z30 zZ;yUcvi?*WZ_#jY+W)}*)*XK{cC*W$s?YCTa8Q4q;2#>$?;ZXZg%yCGJKV|?e?R`8 zLd9>#Q-6j1mMwn!{r6$<_XY{DngQd^_1Eb58~VSF%!mJ%Zv7i}<MGe7{@bI$Ut#}g z@_(ZLY%#ylm@v8R*NAZ2aQ=k<SrdQ5Ul9FDm0Pu5O%eI$yM_xJKi<JIlAh$+{Qm&6 CKUOXP literal 0 HcmV?d00001 diff --git a/assays/BacterialGenomeAssemblyAndAnnotation/protocols/.gitkeep b/assays/BacterialGenomeAssemblyAndAnnotation/protocols/.gitkeep new file mode 100644 index 0000000..e69de29 diff --git a/assays/BacterialGenomeAssemblyAndAnnotation/protocols/BacterialGenomeAssemblyAndAnnotationProtocol.md b/assays/BacterialGenomeAssemblyAndAnnotation/protocols/BacterialGenomeAssemblyAndAnnotationProtocol.md new file mode 100644 index 0000000..7a78805 --- /dev/null +++ b/assays/BacterialGenomeAssemblyAndAnnotation/protocols/BacterialGenomeAssemblyAndAnnotationProtocol.md @@ -0,0 +1,3 @@ +## Bacterial genome assembly and annotation + +Paired-end Illumina reads were first subjected to length-trimming and quality-filtering using Trimmomatic (Bolger, A. M. et al., 2014). Reads were assembled using the A5 assembly pipeline (Tritt, A. et al, 2012), which uses the IDBA algorithm (Peng, Y. et al., 2012) to assemble error-corrected reads. Detailed assembly statistics and corresponding metadata can be found in Supplementary Data 2. Genomes with multi-modal *k*-mer and GC content distributions or multiple instances of marker genes from diverse taxonomic groups were flagged as not originating from clonal cultures. These samples were processed using a metagenome binning approach (Pasolli, E. et al., 2019). Briefly, contigs from each metagenome sample were clustered using METABAT2 (Kang, D. D. et al., 2019), followed by an assessment of completeness and contamination of each metagenome-assembled genome using CheckM (Parks, D. H. et al., 2015). Only bins with completeness scores larger than 75% and contamination rates lower than 5% were retained and added to the collection (Supplementary Data 2, designated metagenome-assembled genome (MAG) in the column ‘type’). Functional annotation of genes was conducted using Prokka and using a custom database based on Kyoto Encyclopedia of Genes and Genomes (KEGG) orthologue groups (Kanehisa, M. et al., 2014) downloaded from the KEGG FTP server in November 2019. Hits to sequences in the database were filtered using an E value threshold of 10 × 10−9 and a minimum coverage of 80% of the length of the query sequence. \ No newline at end of file -- GitLab