SARS-CoV-2 discovery and surveillance
-
命令介绍:
1. 质量控制 (srynobio/fastp:latest) fastp -i SRR11954304.fastq.gz -o SRR11954304_fastp.fastq 2. 去除宿主 (trinityrnaseq/trinityrnaseq:latest) bowtie2 -p 15 -x /ceph_disk2/data/tmp/index -U SRR12045797.fastq -S a.sam samtools view -bS -F 4 a.sam > a.bam samtools sort a.bam -o a.sorted.bam samtools index a.sorted.bam samtools fastq a.sorted.bam > a.fq 3. 转录组组装软件(trinityrnaseq/trinityrnaseq:latest) Trinity --seqType fq --max_memory 100G --single a.fq 4. 分类 (ncbi/blast:2.12.0) (nanozoo/diamond:latest) diamond makedb --in Trinity.fasta -d Trinity diamond blastx -d Trinity -q Trinity.fasta -o t1 makeblastdb -in Trinity.fasta -dbtype prot -title Trinity -out Trinity blastx -db test -query Trinity.fasta -out test.out blastn -db test -query Trinity.fasta -out test.out 5. 多序列比对:(staphb/mafft:latest) MAFFT mafft --maxiterate 100 sample.fasta > result.fa python trans.py result.fa result.phy 6. 系统发育和进化分析(biocontainers/raxml:v8.2.12dfsg-1-deb_cv1) RAxML raxmlHPC -d -p 12345 -m GTRGAMMAI -s result.phy -n txt参考文献: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7929396/#ref33