搜索
查看: 2589|回复: 0

差异分析中foldchange和显著性P值阈值的选择问题-biostar

[复制链接]

365

主题

512

帖子

1713

积分

管理员

Rank: 9Rank: 9Rank: 9

积分
1713
发表于 2016-11-30 16:18:12 | 显示全部楼层 |阅读模式
值得大家看看,我当一个搬运工吧!
https://www.biostars.org/p/6498/
https://www.biostars.org/p/163407/
https://www.biostars.org/p/157210/
https://www.biostars.org/p/172360/
https://www.biostars.org/p/157357/
https://www.biostars.org/p/125165/

what is your experimental design regarding your analysis ? For instance the case samples are cancer samples ? If so, you could expect a lot of DE between cancer and control samples. Or the case samples represent some drug treatment ? Nevertheless, if you could give us more information about your procedure(specific microarray platform, if you have performed any non-specific filtering prior DE testing etc). Generally,

I believe that there is no general characterization as "correct" or acceptable, as it depends on the biology of your system and your samples/analysis. Also, limma is capable of taking into account the imbalance between the case and the control samples.


Finally, just to pinpoint that generally you should not consider DEG genes only with a adjusted p-value threshold and just use the cutoff to get a first pool of DEG candidates. You can use the functions treat & topTreat from limma to test differential expression against a minimum log-fold change cutoff. It will reduce the number of your deg genes, and also probably will represent a more biologically meaningful subset of genes for your analysis.

Yes, cases are un-treated APL (M3 leukemia) vs normal BM samples. Thanks for your finally suggestion. Also, are there any similar functions for package like RankProd (Rank product analysis)?

In this case, the big number of DEG genes(based on the adjusted p-value criterion only) can be explained by the generally large differences between your leukemia samples and normal ones. Regarding the RankProd analysis, im familiar with it but i would not reccomended it, as it is usually for a few samples and in simple words just "ranks the log-fcs". Limma is far more powerful and with many more capabilities for handling any issue like small samples sizes or inbalanced studies.

In fact, you could use treat() after lmFit(), instead of eBayes step-and then use topTreat() like topTable to return the subset of the DEG candidates. Check ?treat() from the limma package. For instance you can use an lfc=0.5, to give you genes with at least higher lfc-in this case bigger than 1.5 fold change.

Finally, i would also consider non-specific intensity filtering to remove low expressed probes in most samples-which would be uniformative for any further analysis.



上一篇:psych包中的corr.test()做矩阵的相关性检验
下一篇:有没有好的关于肿瘤以及血液游离癌症DNA测序方面 文献推荐
回复

使用道具 举报

您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

QQ|手机版|小黑屋|生信技能树 ( 粤ICP备15016384号  

GMT+8, 2019-9-16 09:08 , Processed in 0.036828 second(s), 26 queries .

Powered by Discuz! X3.2

© 2001-2013 Comsenz Inc.