{"id":1650,"date":"2024-01-12T15:35:50","date_gmt":"2024-01-12T13:35:50","guid":{"rendered":"https:\/\/bioinfo2.ugr.es\/bioinfo\/?page_id=1650"},"modified":"2024-01-31T18:31:11","modified_gmt":"2024-01-31T16:31:11","slug":"customizar_base_de_datos","status":"publish","type":"page","link":"https:\/\/bioinfo2.ugr.es\/bioinfo\/customizar_base_de_datos\/","title":{"rendered":"customizar_base_de_datos"},"content":{"rendered":"\n<p>Mediante un Blast local podemos buscar secuencias dentro de una base de datos customizada<\/p>\n<h2>Aplicaciones:<\/h2>\n<ul>\n<li>Buscar la existencia de un gen dentro de un genoma (frecuentemente dentro de un ensambldo <em>de novo<\/em>)<\/li>\n<li>Anotar genes predichos<\/li>\n<\/ul>\n<h2>Paso 1: Generar una base de datos<\/h2>\n<p>En este caso, generamos una base de datos usando el ensamblado a26.fa<\/p>\n<pre><br \/>makeblastdb -in \/home\/biocomp\/bioinfo_genomas\/a26_1000.fa -parse_seqids -title \"26_1000\" -out a26 -dbtype nucl<\/pre>\n<h2>Paso 2: Buscar una secuencia dentro del ensamblado<br \/><br \/><\/h2>\n<p>Descargar la secuencia de un gene de la accession: <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/gene\/?term=NC_007795\" target=\"_blank\" rel=\"noopener\">NC_007795<\/a><\/p>\n<p>Por ejemplo la beta-galactosidasa: <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/nuccore\/NC_007795.1?from=2273155&amp;to=2274567&amp;report=fasta&amp;strand=true\" target=\"_blank\" rel=\"noopener\">CDS<\/a> o <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/protein\/YP_500916.1\" target=\"_blank\" rel=\"noopener\">proteina<\/a><\/p>\n<h3>Paso 3: Ejectutar Blastn<\/h3>\n<p>Siendo beta-gal.fa el fichero del gen que hemos descargado:<\/p>\n<pre>blastn -query beta-gal.fa -db a26 -max_target_seqs 5 -outfmt 6 -perc_identity 90 -evalue 1E-10 <br \/><br \/><\/pre>\n<p><a href=\"https:\/\/www.metagenomics.wiki\/tools\/blast\/blastn-output-format-6\" target=\"_blank\" rel=\"noopener\">Formato de salida (outfmt 6)<\/a>\u00a0<\/p>\n<h3>Resultado<\/h3>\n<p>NC_007795.1:c2274567-2273155 NODE_2_length_304732_cov_60.346849 100.000 1413 0 0 11413 296103 297515 0.0 2610<\/p>\n<p>&#8211;&gt; el gen de la beta-galactosidasa se ubica en el contig &#8216;NODE_2&#8217; entre las coordenadas 296103-297515<\/p>\n<h4>Parametros adicionales del blast<\/h4>\n<p>-qcov_hsp_perc &#8216;n\u00famero&#8217; &#8211;&gt; cubertura m\u00ednima de la secuencia de entrada<\/p>\n<h4>Customizar la salida del blast<\/h4>\n<p>-outfmt &#8220;6 qseqid qlen&#8221; &#8211;&gt; ID y longitud de la secuencia de entrada.<\/p>\n<p>Abajo una lista de todas las columnas posibles:<\/p>\n<p>qseqid means Query Seq-id<br \/>qgi means Query GI<br \/>qacc means Query accesion<br \/>qaccver means Query accesion.version<br \/>qlen means Query sequence length<br \/>sseqid means Subject Seq-id<br \/>sallseqid means All subject Seq-id(s), separated by a &#8216;;&#8217;<br \/>sgi means Subject GI<br \/>sallgi means All subject GIs<br \/>sacc means Subject accession<br \/>saccver means Subject accession.version<br \/>sallacc means All subject accessions<br \/>slen means Subject sequence length<br \/>qstart means Start of alignment in query<br \/>qend means End of alignment in query<br \/>sstart means Start of alignment in subject<br \/>send means End of alignment in subject<br \/>qseq means Aligned part of query sequence<br \/>sseq means Aligned part of subject sequence<br \/>evalue means Expect value<br \/>bitscore means Bit score<br \/>score means Raw score<br \/>length means Alignment length<br \/>pident means Percentage of identical matches<br \/>nident means Number of identical matches<br \/>mismatch means Number of mismatches<br \/>positive means Number of positive-scoring matches<br \/>gapopen means Number of gap openings<br \/>gaps means Total number of gaps<br \/>ppos means Percentage of positive-scoring matches<br \/>frames means Query and subject frames separated by a &#8216;\/&#8217;<br \/>qframe means Query frame<br \/>sframe means Subject frame<br \/>btop means Blast traceback operations (BTOP)<br \/>staxid means Subject Taxonomy ID<br \/>ssciname means Subject Scientific Name<br \/>scomname means Subject Common Name<br \/>sblastname means Subject Blast Name<br \/>sskingdom means Subject Super Kingdom<br \/>staxids means unique Subject Taxonomy ID(s), separated by a &#8216;;&#8217;<br \/>(in numerical order)<br \/>sscinames means unique Subject Scientific Name(s), separated by a &#8216;;&#8217;<br \/>scomnames means unique Subject Common Name(s), separated by a &#8216;;&#8217;<br \/>sblastnames means unique Subject Blast Name(s), separated by a &#8216;;&#8217;<br \/>(in alphabetical order)<br \/>sskingdoms means unique Subject Super Kingdom(s), separated by a &#8216;;&#8217;<br \/>(in alphabetical order)<br \/>stitle means Subject Title<br \/>salltitles means All Subject Title(s), separated by a &#8216;&lt;&gt;&#8217;<br \/>sstrand means Subject Strand<br \/>qcovs means Query Coverage Per Subject<br \/>qcovhsp means Query Coverage Per HSP<br \/>qcovus means Query Coverage Per Unique Subject (blastn only)<\/p>\n<p>\u00a0<\/p>\n<p>\u00a0<\/p>\n<p>\u00a0<\/p>\n<p>\u00a0<\/p>\n\n\n","protected":false},"excerpt":{"rendered":"<p>Mediante un Blast local podemos buscar secuencias dentro de una base de datos customizada Aplicaciones: Buscar la existencia de un gen dentro de un genoma (frecuentemente dentro de un ensambldo de novo) Anotar genes predichos Paso 1: Generar una base de datos En este caso, generamos una base de datos usando el ensamblado a26.fa makeblastdb &hellip; <a href=\"https:\/\/bioinfo2.ugr.es\/bioinfo\/customizar_base_de_datos\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":[],"_links":{"self":[{"href":"https:\/\/bioinfo2.ugr.es\/bioinfo\/wp-json\/wp\/v2\/pages\/1650"}],"collection":[{"href":"https:\/\/bioinfo2.ugr.es\/bioinfo\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/bioinfo2.ugr.es\/bioinfo\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/bioinfo2.ugr.es\/bioinfo\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/bioinfo2.ugr.es\/bioinfo\/wp-json\/wp\/v2\/comments?post=1650"}],"version-history":[{"count":11,"href":"https:\/\/bioinfo2.ugr.es\/bioinfo\/wp-json\/wp\/v2\/pages\/1650\/revisions"}],"predecessor-version":[{"id":1852,"href":"https:\/\/bioinfo2.ugr.es\/bioinfo\/wp-json\/wp\/v2\/pages\/1650\/revisions\/1852"}],"wp:attachment":[{"href":"https:\/\/bioinfo2.ugr.es\/bioinfo\/wp-json\/wp\/v2\/media?parent=1650"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}