<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[下载SRA数据过程记录]]></title><description><![CDATA[<h1>SRA数据下载</h1>
<h2>拆解说明</h2>
<p dir="auto">1） 首先肯定是找到一个SRA号啦</p>
<p dir="auto">2） 登录 <a href="https://sra-explorer.info/" rel="nofollow ugc">https://sra-explorer.info/</a>? 网站，输入SRA号进行搜索</p>
<p dir="auto"><img src="/assets/uploads/files/1608887881134-27b7b0c9-aadc-4762-a210-e8ebe754d8b5-image.png" alt="27b7b0c9-aadc-4762-a210-e8ebe754d8b5-image.png" class=" img-responsive img-markdown" /></p>
<p dir="auto">3）筛选需要下载的数据，并保存到 datasets 里</p>
<p dir="auto"><img src="/assets/uploads/files/1608888242148-79263f8e-449e-4e0a-a152-2947c6d43e66-image.png" alt="79263f8e-449e-4e0a-a152-2947c6d43e66-image.png" class=" img-responsive img-markdown" /></p>
<p dir="auto">4）拷贝脚本命令</p>
<p dir="auto"><img src="/assets/uploads/files/1608888520678-b84c056e-88d5-4e3a-aa3e-c5b170c76324-image.png" alt="b84c056e-88d5-4e3a-aa3e-c5b170c76324-image.png" class=" img-responsive img-markdown" /></p>
<p dir="auto">5） 到装有aspera的服务器上执行</p>
<pre><code>ascp -QT -l 300m -P33001 -i $HOME/.aspera/connect/etc/asperaweb_id_dsa.openssh era-fasp@fasp.sra.ebi.ac.uk:vol1/fastq/SRR828/SRR828261/SRR828261.fastq.gz . &amp;&amp; mv SRR828261.fastq.gz SRR828261_The_population_structure_and_recent_colonization_history_of_Oregon_threespine_stickleback_determined_using_RAD-seq.fastq.gz

ascp -QT -l 300m -P33001 -i $HOME/.aspera/connect/etc/asperaweb_id_dsa.openssh era-fasp@fasp.sra.ebi.ac.uk:vol1/fastq/SRR828/SRR828262/SRR828262.fastq.gz . &amp;&amp; mv SRR828262.fastq.gz SRR828262_The_population_structure_and_recent_colonization_history_of_Oregon_threespine_stickleback_determined_using_RAD-seq.fastq.gz

</code></pre>
<h2>脚本批量下载</h2>
<pre><code class="language-sh"># vim list.txt ,将SRR号粘贴到文件里，每行一个SRR号

srr_no=$(cat list.txt)
# 指定下载文件夹路径
base_dir="/data_raid1/gene_data/data_archive/radseq_sra070979"


for f in $srr_no
do
        filename=$base_dir/${f}.fastq.gz
        if [ ! -f $filename ];then
                ascp -QT -l 300m -P33001 -i \
                        ~/.aspera/connect/etc/asperaweb_id_dsa.openssh \
                        era-fasp@fasp.sra.ebi.ac.uk:vol1/fastq/${f:0:6}/${f}/${f}.fastq.gz . 
                echo "download over ${f}"
                if [ ! -f ${f}.fastq.gz ];then
                        echo "ascp raise error"
                else
                        mv ${f}.fastq.gz /data_raid1/gene_data/data_archive/radseq_sra070979/
                        echo "mv success"
                fi
        fi
done

</code></pre>
]]></description><link>http://an.forum.genostack.com/topic/145/下载sra数据过程记录</link><generator>RSS for Node</generator><lastBuildDate>Sat, 13 Jun 2026 12:31:36 GMT</lastBuildDate><atom:link href="http://an.forum.genostack.com/topic/145.rss" rel="self" type="application/rss+xml"/><pubDate>Fri, 25 Dec 2020 09:40:32 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to 下载SRA数据过程记录 on Thu, 28 Jan 2021 01:56:07 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="http://an.forum.genostack.com/uid/3">@ice-melt</a></p>
<p dir="auto">sra数据查询下载</p>
<p dir="auto">地址： <a href="https://www.ncbi.nlm.nih.gov/sra" rel="nofollow ugc">https://www.ncbi.nlm.nih.gov/sra</a><br />
使用说明： <a href="https://www.jianshu.com/p/680e8d720516" rel="nofollow ugc">https://www.jianshu.com/p/680e8d720516</a></p>
]]></description><link>http://an.forum.genostack.com/post/351</link><guid isPermaLink="true">http://an.forum.genostack.com/post/351</guid><dc:creator><![CDATA[ice-melt]]></dc:creator><pubDate>Thu, 28 Jan 2021 01:56:07 GMT</pubDate></item><item><title><![CDATA[Reply to 下载SRA数据过程记录 on Thu, 21 Jan 2021 08:44:57 GMT]]></title><description><![CDATA[<h2>使用 aspera 下载NCBI ftp中的数据</h2>
<pre><code>## 示例1，来自王通的纳米孔宏基因课件                                                                   
.aspera/connect/bin/ascp -i .aspera/connect/etc/asperaweb_id_dsa.openssh \
--overwrite=diff -QTr -l 6000m anonftp@ftp.ncbi.nlm.nih.gov:blast/db/swissprot.tar.gz ./
</code></pre>
<pre><code>## 示例2，来自 https://www.jianshu.com/p/0bff79fcde3d                                                     
ascp -QT \
 -i ~/.aspera/connect/etc/asperaweb_id_dsa.openssh \
 -k1 -l 300m \
 anonftp@ftp.ncbi.nlm.nih.gov:/blast/db/FASTA/nt.gz ./ 
</code></pre>
<pre><code>## 示例3，下载烟草参考基因组时使用的命令
ascp -i ~/.aspera/connect/etc/asperaweb_id_dsa.openssh --overwrite=diff -QTr -l 6000m \
anonftp@ftp.ncbi.nlm.nih.gov:genomes/all/GCF/000/715/135/GCF_000715135.1_Ntab-TN90/GCF_000715135.1_Ntab-TN90_genomic.fna.gz
</code></pre>
<p dir="auto">注意点：</p>
<ul>
<li>需要正确给出 aspera-license ，在<code>-i</code>后面接license文件，这里是<br />
<code>asperaweb_id_dsa.openssh</code></li>
<li>ftp 账号要写正确
<ul>
<li>NCBI  账号是：<code>anonftp@ftp.ncbi.nlm.nih.gov</code></li>
<li>EBI 公共账号： <code>era-fasp</code></li>
</ul>
</li>
<li>ftp 地址后面接冒号，然后是ftp文件的具体位置</li>
</ul>
]]></description><link>http://an.forum.genostack.com/post/335</link><guid isPermaLink="true">http://an.forum.genostack.com/post/335</guid><dc:creator><![CDATA[ice-melt]]></dc:creator><pubDate>Thu, 21 Jan 2021 08:44:57 GMT</pubDate></item></channel></rss>