<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[单细胞分析]]></title><description><![CDATA[单细胞分析]]></description><link>http://an.forum.genostack.com/category/32</link><generator>RSS for Node</generator><lastBuildDate>Sat, 13 Jun 2026 09:44:04 GMT</lastBuildDate><atom:link href="http://an.forum.genostack.com/category/32.rss" rel="self" type="application/rss+xml"/><pubDate>Wed, 16 Oct 2024 02:51:31 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[单细胞大模型]]></title><description><![CDATA[<p dir="auto"><a href="https://medium.com/helical-ai/single-cell-bio-foundation-models-a-beginners-overview-3730a0731bdd" rel="nofollow ugc">https://medium.com/helical-ai/single-cell-bio-foundation-models-a-beginners-overview-3730a0731bdd</a></p>
]]></description><link>http://an.forum.genostack.com/topic/1098/单细胞大模型</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/1098/单细胞大模型</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Wed, 16 Oct 2024 02:51:31 GMT</pubDate></item><item><title><![CDATA[细胞类型注释]]></title><description><![CDATA[<p dir="auto"><a href="https://www.sciencedirect.com/science/article/pii/S2215016123001966" rel="nofollow ugc">https://www.sciencedirect.com/science/article/pii/S2215016123001966</a><br />
Negative markers combined with positive markers can increase the specificity of cell type identification, reducing the likelihood of misclassification and improving the overall accuracy of the analysis<br />
[3]<br />
. By default, the signature genes are expected to be highly expressed in one cell type compared to all other cell types. However, depending on the underlying data, these canonical markers may not be enough to segregate cell types with similar expression profiles (e.g., sub-groups of T-cells in the human blood). When this occurs, genes that are expected not to be detected in a specific cell type (e.g., CD8A in CD4 T cells) can be utilized to improve segregation. Therefore, genes that are characteristically lowly expressed in one cell type compared to the other cell types are introduced as the “negative markers”.</p>
]]></description><link>http://an.forum.genostack.com/topic/1090/细胞类型注释</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/1090/细胞类型注释</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 30 Aug 2024 07:59:44 GMT</pubDate></item><item><title><![CDATA[merge vs integrate]]></title><description><![CDATA[<p dir="auto"><a href="https://www.perplexity.ai/search/seurat-merge-vs-integrate-4wIsp710SlSXnWASiWuvjg" rel="nofollow ugc">https://www.perplexity.ai/search/seurat-merge-vs-integrate-4wIsp710SlSXnWASiWuvjg</a><br />
The functions merge() and IntegrateData() in Seurat serve distinct purposes in handling single-cell RNA sequencing (scRNA-seq) data, particularly when working with multiple datasets.<br />
merge()<br />
The merge() function is primarily used to concatenate two or more Seurat objects. This method combines the raw count matrices of the specified objects into a single object. Key characteristics include:<br />
Concatenation of Counts: merge() combines the raw count data from the input objects. If there are unique genes in either dataset, those genes are included in the merged object, resulting in a comprehensive count matrix that reflects all genes present across the datasets.<br />
Loss of Normalization: By default, merging erases any previously normalized and scaled data matrices, meaning that it does not account for differences in data distributions between the datasets. If normalization is desired, it must be performed after merging.<br />
Use Cases: This function is suitable for datasets with minimal batch effects or when combining technical replicates, where the primary goal is to create a larger dataset from similar conditions.<br />
IntegrateData()<br />
In contrast, IntegrateData() is designed for more complex scenarios where datasets may have inherent differences, such as batch effects or biological variability. Its main features include:<br />
Anchor-Based Integration: This function utilizes a method based on identifying "anchor" cells that are shared across datasets. This approach allows for the integration of datasets while accounting for differences in gene expression profiles, enabling better comparison of shared cell populations.<br />
Focus on Shared Variance: The result of IntegrateData() is a single integrated dataset that retains only the genes shared among the datasets, which facilitates comparative analysis of gene expression profiles across different conditions or batches.<br />
Use Cases: It is recommended for situations where datasets are expected to have substantial biological differences or batch effects, such as datasets generated from different platforms or experimental conditions.<br />
Summary<br />
In summary, use merge() when you want to combine datasets with minimal differences and retain all gene information, while IntegrateData() is preferred for integrating datasets with significant variability to ensure that shared cell populations are accurately represented and analyzed. The choice between the two depends on the experimental design and the nature of the datasets being analyzed.</p>
]]></description><link>http://an.forum.genostack.com/topic/1088/merge-vs-integrate</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/1088/merge-vs-integrate</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Thu, 15 Aug 2024 06:09:05 GMT</pubDate></item><item><title><![CDATA[新冠单细胞数据分析样例 covid19_sc]]></title><description><![CDATA[<p dir="auto">conda install -c conda-forge fftw<br />
由于jupyter的环境是在conda里面 所以当报一些包找不到时 用conda安装 用apt安装 可能路径不对 还是找不到</p>
]]></description><link>http://an.forum.genostack.com/topic/1087/新冠单细胞数据分析样例-covid19_sc</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/1087/新冠单细胞数据分析样例-covid19_sc</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 12 Aug 2024 05:40:53 GMT</pubDate></item><item><title><![CDATA[STOmics DB]]></title><description><![CDATA[<p dir="auto">We constructed the front-end framework with Vue.js (version 2.6.14) and built the backend using Django (version 2.2) and Python (version 3.7.4). STOmicsDB used <strong>PostgreSQL</strong> (version 9.6) to store the metadata of publications, and datasets. We used <strong>Elasticsearch</strong> (version 7.16.2) as the search engine in the resource center of STOmicsDB. We employed <strong>MongoDB (version 4.2) and Cirrocumulus</strong> to manage and visualize curated datasets. We used Redis (v5.0.4) as the cache to store and manage the data in memory. For task queue management, we applied <strong>RabbitMQ</strong> (v3.8.13). Nginx (v1.20.1) was used as the reverse proxy server. Currently, STOmicsDB supports the following browsers: Google Chrome (v80.0 and above), Opera (v62.0 and above), Safari (v12.0 and above) and Firefox (v80.0 and above).</p>
]]></description><link>http://an.forum.genostack.com/topic/1027/stomics-db</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/1027/stomics-db</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Thu, 21 Dec 2023 09:47:41 GMT</pubDate></item><item><title><![CDATA[单细胞 TCR BCR分析]]></title><description><![CDATA[<p dir="auto"><a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9434330/" rel="nofollow ugc">https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9434330/</a><br />
Research progress on application of single-cell TCR/BCR sequencing technology to the tumor immune microenvironment, autoimmune diseases, and infectious diseases<br />
单细胞的几个应用场景：<br />
Single-cell sequencing technologies mainly include<br />
single-cell transcriptome sequencing,<br />
single-cell assay for transposase accessible chromatin with high-throughput sequencing,<br />
single-cell immune profiling (single-cell T-cell receptor [TCR]/B-cell receptor [BCR] sequencing), and single-cell transcriptomics.<br />
93a86f70-df5d-4805-8d55-6d1ba7368a24-image.png</p>
]]></description><link>http://an.forum.genostack.com/topic/1002/单细胞-tcr-bcr分析</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/1002/单细胞-tcr-bcr分析</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 27 Nov 2023 05:36:55 GMT</pubDate></item><item><title><![CDATA[细胞群内成对分组间差异表达分析]]></title><description><![CDATA[<p dir="auto">File "/home/bioinfo/workspace/miniconda3/envs/snakemake2/lib/python3.11/site-packages/click/core.py", line 754, in invoke<br />
return __callback(*args, **kwargs)<br />
^^^^^^^^^^^^^^^^^^^^^^^^^^^<br />
File "/home/bioinfo/workspace/miniconda3/envs/snakemake2/lib/python3.11/site-packages/click/decorators.py", line 26, in new_func<br />
return f(get_current_context(), *args, **kwargs)<br />
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^<br />
File "/home/bioinfo/ms_jk_small_tool/分组间差异分析+报告展示/代码/细胞群内成对分组间差异表达分析/workflow/scripts/split_violin_plot.py", line 110, in split_violin_plot<br />
marker_gene = top_up.append(top_down)<br />
^^^^^^^^^^^^^<br />
File "/home/bioinfo/workspace/miniconda3/envs/snakemake2/lib/python3.11/site-packages/pandas/core/generic.py", line 5989, in <strong>getattr</strong><br />
return object.<strong>getattribute</strong>(self, name)<br />
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^<br />
AttributeError: 'DataFrame' object has no attribute 'append'. Did you mean: '_append'?<br />
<a href="https://stackoverflow.com/questions/75956209/error-dataframe-object-has-no-attribute-append" rel="nofollow ugc">https://stackoverflow.com/questions/75956209/error-dataframe-object-has-no-attribute-append</a></p>
]]></description><link>http://an.forum.genostack.com/topic/986/细胞群内成对分组间差异表达分析</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/986/细胞群内成对分组间差异表达分析</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 21 Aug 2023 02:19:46 GMT</pubDate></item><item><title><![CDATA[Accelerating Single-cell Bioinformatics with N-dimensional Arrays in the Cloud]]></title><description><![CDATA[<p dir="auto"><a href="https://github.com/lasersonlab/single-cell-experiments" rel="nofollow ugc">https://github.com/lasersonlab/single-cell-experiments</a></p>
项目说明
<p dir="auto">theis lab	# scanpy<br />
laserson lab	# single-cell-experiments (zappy,zarr,ndarray.scala)</p>

支持读取csv,adata,zarr,zarr_gcs(gcs,g3fs,谷歌亚/马逊云端数据)格式的单细胞数据
读取数据后依赖zarr包拆分数据成块(缺点:数据经过重复读取,每次数据读取都是全加载)
adata 数据取矩阵(.X属性的值)数据通过指定块大小后按下标索引map到不同的块对象,即PairedRDD(此时的value是zarr,可能为压缩格式,参考代码 zarr_spark.py#read_zarr_chunk|get_chunk_indices)
对RDD进行计算(参考代码anndata_spark.py#log1p)

该项目衍生的问题：

目前该项目无维护，源代码未指明依赖版本关系，无法运行
项目分析过程无法交互展示，必须定义流程过程和控制参数

]]></description><link>http://an.forum.genostack.com/topic/547/accelerating-single-cell-bioinformatics-with-n-dimensional-arrays-in-the-cloud</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/547/accelerating-single-cell-bioinformatics-with-n-dimensional-arrays-in-the-cloud</guid><dc:creator><![CDATA[ice-melt]]></dc:creator><pubDate>Mon, 14 Feb 2022 03:56:40 GMT</pubDate></item><item><title><![CDATA[10X的数据格式]]></title><description><![CDATA[<p dir="auto"><a href="https://support.10xgenomics.com/single-cell-gene-expression/software/pipelines/latest/output/matrices" rel="nofollow ugc">https://support.10xgenomics.com/single-cell-gene-expression/software/pipelines/latest/output/matrices</a></p>
<p dir="auto"><a href="https://math.nist.gov/MatrixMarket/formats.html" rel="nofollow ugc">https://math.nist.gov/MatrixMarket/formats.html</a><br />
Market Exchange Format (MEX)</p>
]]></description><link>http://an.forum.genostack.com/topic/492/10x的数据格式</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/492/10x的数据格式</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Wed, 15 Dec 2021 10:01:54 GMT</pubDate></item><item><title><![CDATA[单细胞数据库]]></title><description><![CDATA[<p dir="auto">the NIH Human Biomolecular Atlas Program</p>
]]></description><link>http://an.forum.genostack.com/topic/474/单细胞数据库</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/474/单细胞数据库</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Wed, 01 Dec 2021 05:47:50 GMT</pubDate></item><item><title><![CDATA[bbrowser的细胞类型注释工具talk2data]]></title><description><![CDATA[<p dir="auto"><a href="https://talk2data.bioturing.com/predict" rel="nofollow ugc">https://talk2data.bioturing.com/predict</a></p>
]]></description><link>http://an.forum.genostack.com/topic/473/bbrowser的细胞类型注释工具talk2data</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/473/bbrowser的细胞类型注释工具talk2data</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Wed, 01 Dec 2021 02:54:04 GMT</pubDate></item><item><title><![CDATA[bbrowser产品文档]]></title><description><![CDATA[<p dir="auto"><a href="/assets/uploads/files/1637837945329-blog.bioturing.com.pdf">blog.bioturing.com.pdf</a></p>
]]></description><link>http://an.forum.genostack.com/topic/458/bbrowser产品文档</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/458/bbrowser产品文档</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Thu, 25 Nov 2021 10:59:07 GMT</pubDate></item><item><title><![CDATA[normalization 归一化]]></title><description><![CDATA[<p dir="auto"><a href="https://kb.10xgenomics.com/hc/en-us/articles/115004583806-How-are-the-UMI-counts-normalized-before-PCA-and-differential-expression-" rel="nofollow ugc">https://kb.10xgenomics.com/hc/en-us/articles/115004583806-How-are-the-UMI-counts-normalized-before-PCA-and-differential-expression-</a></p>
]]></description><link>http://an.forum.genostack.com/topic/457/normalization-归一化</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/457/normalization-归一化</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Thu, 25 Nov 2021 09:55:29 GMT</pubDate></item><item><title><![CDATA[scanpy详细研究]]></title><description><![CDATA[<p dir="auto">gene_symbols和gene_ids<br />
当读取数据的时候 默认设置用基因名还是ID作为列头 实际上scanpy会把这两个的对应关系作为第一个var保存下来 下面是使用gene_ids打开mtx的情况<br />
e67b672e-505c-4b92-a65b-8785fc609bf9-image.png</p>
]]></description><link>http://an.forum.genostack.com/topic/456/scanpy详细研究</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/456/scanpy详细研究</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Wed, 24 Nov 2021 08:44:49 GMT</pubDate></item><item><title><![CDATA[单细胞教程]]></title><description><![CDATA[<p dir="auto"><a href="https://bookdown.org/ytliu13207/SingleCellMultiOmicsDataAnalysis/" rel="nofollow ugc">https://bookdown.org/ytliu13207/SingleCellMultiOmicsDataAnalysis/</a></p>
]]></description><link>http://an.forum.genostack.com/topic/455/单细胞教程</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/455/单细胞教程</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Wed, 24 Nov 2021 01:51:50 GMT</pubDate></item><item><title><![CDATA[单细胞文献]]></title><description><![CDATA[<p dir="auto"><a href="https://www.nature.com/articles/s12276-020-0409-x" rel="nofollow ugc">https://www.nature.com/articles/s12276-020-0409-x</a><br />
An era of single-cell genomics consortia</p>
]]></description><link>http://an.forum.genostack.com/topic/453/单细胞文献</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/453/单细胞文献</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 22 Nov 2021 06:43:45 GMT</pubDate></item><item><title><![CDATA[Falco  Falco: A quick and flexible single-cell RNA-seq processing framework on the cloud]]></title><description><![CDATA[<p dir="auto"><a href="https://github.com/VCCRI/Falco/" rel="nofollow ugc">https://github.com/VCCRI/Falco/</a></p>
]]></description><link>http://an.forum.genostack.com/topic/452/falco-falco-a-quick-and-flexible-single-cell-rna-seq-processing-framework-on-the-cloud</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/452/falco-falco-a-quick-and-flexible-single-cell-rna-seq-processing-framework-on-the-cloud</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 22 Nov 2021 06:29:15 GMT</pubDate></item><item><title><![CDATA[chanzuckerberg基金会两个Spark单细胞项目]]></title><description><![CDATA[<p dir="auto">Accelerating Cross-Sample Analysis of Single-Cell Genomic Data with Adam and Apache Spark<br />
Project Goal<br />
To build computational tools that enable researchers to harness distributed computing to enable machine learning and interactive data exploration across raw single-cell data.</p>
<p dir="auto">Results &amp; Resources<br />
The Joseph lab’s primary goal was to support the Apache Spark ecosystem to extend their work on hyper scalable workflows and visualization. They pursued a wide number of projects:</p>
<p dir="auto">ADAM, a library and command line tool to parallelize genomic data analysis across cluster and cloud computing environments.<br />
Mango, a distributed visualization tool for visualizing and manipulating large genomic sequencing datasets in a Jupyter notebook.<br />
Modin, a drop-in replacement for pandas that allows users to interpret large datasets in table format with high throughput and low latency.</p>
]]></description><link>http://an.forum.genostack.com/topic/451/chanzuckerberg基金会两个spark单细胞项目</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/451/chanzuckerberg基金会两个spark单细胞项目</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 22 Nov 2021 06:02:11 GMT</pubDate></item><item><title><![CDATA[Elbow-and-Jackstraw-plots 用来查看PCA]]></title><description><![CDATA[<p dir="auto"><a href="https://www.researchgate.net/figure/figure-supplement-4-Elbow-and-Jackstraw-plots-used-for-determination-of-principal_fig9_345454907" rel="nofollow ugc">https://www.researchgate.net/figure/figure-supplement-4-Elbow-and-Jackstraw-plots-used-for-determination-of-principal_fig9_345454907</a><br />
<img src="/assets/uploads/files/1637376545385-caee9b86-f158-4a9d-8300-88abd16c9c14-image.png" alt="caee9b86-f158-4a9d-8300-88abd16c9c14-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/topic/449/elbow-and-jackstraw-plots-用来查看pca</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/449/elbow-and-jackstraw-plots-用来查看pca</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Sat, 20 Nov 2021 02:49:07 GMT</pubDate></item><item><title><![CDATA[单细胞浏览器分析]]></title><description><![CDATA[<p dir="auto"><a href="https://github.com/lilab-bcb/cirrocumulus" rel="nofollow ugc">https://github.com/lilab-bcb/cirrocumulus</a></p>
]]></description><link>http://an.forum.genostack.com/topic/448/单细胞浏览器分析</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/448/单细胞浏览器分析</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 19 Nov 2021 11:43:27 GMT</pubDate></item><item><title><![CDATA[单细胞数据分析概要]]></title><description><![CDATA[<p dir="auto"><a href="/assets/uploads/files/1637909875091-%E5%8D%95%E7%BB%86%E8%83%9E%E5%A4%A7%E6%95%B0%E6%8D%AE%E8%A7%A3%E5%86%B3%E6%96%B9%E6%A1%88.pptx">单细胞大数据解决方案.pptx</a></p>
]]></description><link>http://an.forum.genostack.com/topic/350/单细胞数据分析概要</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/350/单细胞数据分析概要</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Wed, 21 Jul 2021 03:58:45 GMT</pubDate></item></channel></rss>