生物信息分析

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

免疫组学

2
主题

3
帖子

A

https://www.sc-best-practices.org/air_repertoire/ir_profiling.html
56a55e24-8f0e-4b43-a73b-0ee7beaf8d4e-image.png

VDJ过程
adc29db5-511a-48a1-bd0f-e9269ac0a751-image.png
细胞与基因疗法 cell and gene therapy

3
主题

4
帖子

A

Definition of Gene Therapy in the EU
In the EU, the definition of a Gene Therapy Medicinal Product (GTMP) is provided
in Directive 2009/120/EC amending Directive 2001/83/EC, part IV of Annex I.
A GTMP means a biological medicinal product that has the following
characteristics:
(a) it contains an active substance that contains or consists of a recombinant
nucleic acid used in or administered to human beings with a view to
regulating, repairing, replacing, adding or deleting a genetic sequence;
(b) its therapeutic, prophylactic or diagnostic effect relates directly to the
recombinant nucleic acid sequence it contains, or to the product of the
genetic expression of this sequence.
GTMPs do not include vaccines against infectious diseases.
Hazel Aranha, Humberto Vega-Mercado - Handbook of Cell and Gene Therapy_ From Proof-of-Concept through Manufacturing to Commercialization-CRC Press (2023).pdf
代谢组学

5
主题

9
帖子

A

核磁在代谢组学中的应用

1.核磁共振技术原理
https://www.youtube.com/watch?v=pUWcXvw1Rsg
https://www.bilibili.com/video/BV1CU4y1E7xL/?spm_id_from=333.788.recommend_more_video.-1(有中文翻译比较好)
司法

3
主题

8
帖子

A

https://bitbucket.org/rirgabiss/mhinngs/src/master/
MHinNGS is a tool for analysis of microhaplotypes (MHs) in singleend sequencing data obtained through a massive parallel sequencing plattform (MPS). The tool identifies the reads with the MHs and calls the genotypes of the MHs according to the criteria and positions specified in the configuration file. It also searches for additional variants in the region defined by the start and the stop positions specified in the configuration file.
这个软件的输入是参考序列和原始单端测序的fastq 包括了比对过程依赖的第三方软件包括
python 3.6 including pip
samtools version 1.9
hisat2 version 2.1.0
stringtie version 2.0.3
agrep T.R.E. version 0.8.0
不支持双端序列
表观遗传学

1
主题

4
帖子

A

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3828144/
Practical Guidelines for the Comprehensive Analysis of ChIP-seq Data
3d1f0fc6-d2ed-4539-9cde-0c03bde51949-image.png
合成生物学

4
主题

39
帖子

A

https://github.com/mesoscope
植物基因组学

13
主题

44
帖子

M

参考

https://gatk.broadinstitute.org/hc/en-us/articles/360035531572-Evaluating-the-quality-of-a-germline-short-variant-callset
蛋白组学

10
主题

25
帖子

A

https://www.sinobiological.com/resource/protein-review/fc-fusion-proteins
46c1ca94-e2a9-44bd-b53e-593a82afcd3c-image.png
临床生物信息

10
主题

28
帖子

A

1689181129856.gif
肿瘤NGS数据分析

6
主题

7
帖子

A

https://civicdb.org/pages/about

登录以发表

A

构建本地nt/nr数据库
• anneng

19

0
赞同

19
帖子

76
浏览

A

如果想把一个fasta文件中的序列都当作一个物种对待那么可以使用taxid参数
合并两个数据库：
makeblastdb -in mysequences.fna -dbtype nucl -title "some sequences I found" -out mysequences -parse_seqids
blastdb_aliastool -dblist nt mysequences -dbtype nucl -title "nt database + my own sequences" -out ntandmore
如果有多个fasta 文件每个文件是一个物种可以先分别建库然后用blastdb_aliastool合并
A

eQTL
• anneng

1

0
赞同

1
帖子

5
浏览

尚无回复
A

Stacks分析RAD-Seq数据的内部原理
• anneng

2

0
赞同

2
帖子

21
浏览

A

编译调试版本
./configure CFLAGS='-g -O0' CXXFLAGS='-g -O0'
A

使用bcl2fastq处理illumina下机数据
• anneng

1

0
赞同

1
帖子

7
浏览

尚无回复
A

wdl仓库
• anneng

1

0
赞同

1
帖子

4
浏览

尚无回复
A

使用snps来构建进化树
• anneng

1

0
赞同

1
帖子

6
浏览

尚无回复
A

idseq 病原鉴定流程
• anneng

1

0
赞同

1
帖子

5
浏览

尚无回复
A

生殖道微生物相关研究
• anneng

1

0
赞同

1
帖子

4
浏览

尚无回复
A

物种的中文名字问题
• anneng

2

0
赞同

2
帖子

9
浏览

A

http://www.sp2000.org.cn/
中国物种名录
A

通过序列比对的方式构建进化树
• anneng

1

0
赞同

1
帖子

11
浏览

尚无回复
A

使用KrakenUniq进行病原分析
• anneng

9

0
赞同

9
帖子

36
浏览

I

@anneng 在使用KrakenUniq进行病原分析中说：

krakenuniq --report-file res_archaea.tsv --db archaea_db --threads 10 test_archaea.fna

krakenuniq构建数据库参数说明
构建过程中需要6步，可以重复运行，每次运行会检查之前的文件确定是否需要运行当前步骤
不指定--jellyfish-hash-size会使用全部内存，内存不够会报错，建议根据服务器内存大小指定该参数
Usage: krakenuniq-build [task option] [options] Task options (exactly one can be selected -- default is build): --download-taxonomy Download NCBI taxonomic information --download-library TYPE Download partial library (TYPE = one of "refseq/bacteria", "refseq/archaea", "refseq/viral"). Use krakenuniq-download for more options. --add-to-library FILE Add FILE to library --build Create DB from library (requires taxonomy d/l'ed and at least one file in library) --rebuild Create DB from library like --build, but remove existing non-library/taxonomy files before build --clean Remove unneeded files from a built database --shrink NEW_CT Shrink an existing DB to have only NEW_CT k-mers --standard Download and create default database, which contains complete genomes for archaea, bacteria and viruses from RefSeq, as well as viral strains from NCBI. Specify --taxids-for-genomes and --taxids-for-sequences separately, if desired. --help Print this message --version Print version information Options: --db DBDIR Kraken DB directory (mandatory except for --help/--version) --threads # Number of threads (def: 1) --new-db NAME New Kraken DB name (shrink task only; mandatory for shrink task) --kmer-len NUM K-mer length in bp (build/shrink tasks only; def: 31) --minimizer-len NUM Minimizer length in bp (build/shrink tasks only; def: 15) --jellyfish-hash-size STR Pass a specific hash size argument to jellyfish when building database (build task only) --jellyfish-bin STR Use STR as Jellyfish 1 binary. --max-db-size SIZE Shrink the DB before full build, making sure database and index together use <= SIZE gigabytes (build task only) --shrink-block-offset NUM When shrinking, select the k-mer that is NUM positions from the end of a block of k-mers (default: 1) --work-on-disk Perform most operations on disk rather than in RAM (will slow down build in most cases) --taxids-for-genomes Add taxonomy IDs (starting with 1 billion) for genomes. Only works with 3-column seqid2taxid map with third column being the name --taxids-for-sequences Add taxonomy IDs for sequences, starting with 1 billion. Can be useful to resolve classifications with multiple genomes for one taxonomy ID. --min-contig-size NUM Minimum contig size for inclusion in database. Use with draft genomes to reduce contamination, e.g. with values between 1000 and 10000. --library-dir DIR Use DIR for reference sequences instead of DBDIR/library. --taxonomy-dir DIR Use DIR for taxonomy instead of DBDIR/taxonomy. Experimental: --uid-database Build a UID database (default no) --lca-database Build a LCA database (default yes) --no-lca-database Do not build a LCA database --lca-order DIR1 Impose a hierarchical order for setting LCAs. --lca-order DIR2 The directories must be specified relative to the libary directory ... (DBDIR/library). When setting the LCAs, k-mers from sequences in DIR1 will be set first, and only unset k-mers will be set from DIR2, etc, and final from the whole library. Use this option when including low-confidence draft genomes, e.g use --lca-order Complete_Genome --lca-order Chromosome to prioritize more complete assemblies. Keep in mind that this option takes considerably longer. 使用krakenuniq分析数据命令参数说明 Usage: krakenuniq --report-file FILENAME [options] <filename(s)> Options: --db NAME Name for Kraken DB (default: none) --threads NUM Number of threads (default: 1) --fasta-input Input is FASTA format --fastq-input Input is FASTQ format --gzip-compressed Input is gzip compressed --bzip2-compressed Input is bzip2 compressed --hll-precision INT Precision for HyperLogLog k-mer cardinality estimation, between 10 and 18 (default: 12) --exact Compute exact cardinality instead of estimate (slower, requires memory proportional to cardinality!) --quick Quick operation (use first hit or hits) --min-hits NUM In quick op., number of hits req'd for classification NOTE: this is ignored if --quick is not specified --unclassified-out FILENAME Print unclassified sequences to filename --classified-out FILENAME Print classified sequences to filename --output FILENAME Print output to filename (default: stdout); "off" will suppress normal output --only-classified-output Print no Kraken output for unclassified sequences --preload Loads DB into memory before classification --paired The two filenames provided are paired-end reads --check-names Ensure each pair of reads have names that agree with each other; ignored if --paired is not specified --help Print this message --version Print version information Experimental: --uid-mapping Map using UID database If none of the *-input or *-compressed flags are specified, and the file is a regular file, automatic format detection is attempted.
A

通过分析genbank的数据来挖掘病原
• anneng

1

0
赞同

1
帖子

1
浏览

尚无回复
I

KrakenUniq建库日志记录
• ice-melt

2

0
赞同

2
帖子

7
浏览

I

@ice-melt
问题：xargs: cat: terminated by signal 13 Found jellyfish v1.1.12 Kraken build set to minimize disk writes. Finding all library files Found 12 sequence files (*.{fna,fa,ffn,fasta,fsa}) in the library directory. Creating k-mer set (step 1 of 6)... Using jellyfish Hash size not specified, using '343468319717' /cromwell-executions/data/public_data/mngs/miniconda3/envs/tax_classifier/libexec/build_db.sh: line 46: 42974 Killed jellyfish count -m 31 -s 343468319717 -C -t 10 -o database /dev/fd/63 xargs: cat: terminated by signal 13
内存不够时需要指定 --jellyfish-hash-size 20000M
I

KrakenUniq 流程记录
• ice-melt

1

0
赞同

1
帖子

5
浏览

尚无回复
A

在病原分析中去掉宿主信息
• anneng

2

0
赞同

2
帖子

19
浏览

A

http://www.metagenomics.wiki/tools/short-read/remove-host-sequences
https://www.microbiologyresearch.org/content/journal/mgen/10.1099/mgen.0.000393?crawler=true
A

USP vs EP 美国药典和欧洲药典
• anneng

1

0
赞同

1
帖子

5
浏览

尚无回复
A

进化树的构建方法比较
• anneng

15

0
赞同

15
帖子

21
浏览

A

https://paleogenomics-course.readthedocs.io/en/latest/8_Filtering_SNPs.html
A

使用mega构建进化树
• anneng

1

0
赞同

1
帖子

5
浏览

尚无回复
A

烟草的基本信息
• anneng

2

0
赞同

2
帖子

9
浏览

A

https://www.nature.com/articles/ncomms4833
这篇文章中提到的烟草基因组来自这几个品系：
burley tobacco, TN90;
flue cured tobacco, K326;
Oriental tobacco, Basma Xanthi;
wild tobacco, N. otophora
而且提到有usda的National Plant Germplasm System网站记录了1600多个栽培的品种 http://www.ars-grin.gov/npgs/
A

molecular marker　分子标记
• anneng

1

0
赞同

1
帖子

5
浏览

尚无回复

11 / 15

构建本地nt/nr数据库 • anneng

eQTL • anneng

Stacks分析RAD-Seq数据的内部原理 • anneng

使用bcl2fastq处理illumina下机数据 • anneng

wdl仓库 • anneng

使用snps来构建进化树 • anneng

idseq 病原鉴定流程 • anneng

生殖道微生物相关研究 • anneng

物种的中文名字问题 • anneng

通过序列比对的方式构建进化树 • anneng

使用KrakenUniq进行病原分析 • anneng

通过分析genbank的数据来挖掘病原 • anneng

KrakenUniq建库日志记录 • ice-melt

KrakenUniq 流程记录 • ice-melt

在病原分析中去掉宿主信息 • anneng

USP vs EP 美国药典和欧洲药典 • anneng

进化树的构建方法比较 • anneng

使用mega构建进化树 • anneng

烟草的基本信息 • anneng

molecular marker 分子标记 • anneng

构建本地nt/nr数据库
• anneng

eQTL
• anneng

Stacks分析RAD-Seq数据的内部原理
• anneng

使用bcl2fastq处理illumina下机数据
• anneng

wdl仓库
• anneng

使用snps来构建进化树
• anneng

idseq 病原鉴定流程
• anneng

生殖道微生物相关研究
• anneng

物种的中文名字问题
• anneng

通过序列比对的方式构建进化树
• anneng

使用KrakenUniq进行病原分析
• anneng

通过分析genbank的数据来挖掘病原
• anneng

KrakenUniq建库日志记录
• ice-melt

KrakenUniq 流程记录
• ice-melt

在病原分析中去掉宿主信息
• anneng

USP vs EP 美国药典和欧洲药典
• anneng

进化树的构建方法比较
• anneng

使用mega构建进化树
• anneng

烟草的基本信息
• anneng

molecular marker　分子标记
• anneng