第7页 - 关于covariates的讨论汇总 - 话题女王

全部话题 - 话题: covariates

l**********1
发帖数: 5204

Sure
plus
one Australian University of Adelaide 2010 PhD Thesis
Title: Bayesian networks for high-dimensional data with complex mean
structure.
by Kasza, JE
Abstract: In a microarray experiment, it is expected that there will be
correlations between the expression levels of different genes under study.
These correlation structures are of great interest from both biological and
statistical points of view. From a biological perspective, the
identification of correlation structures can le... 阅读全帖

g**********y
发帖数: 423

来自主题: Biology版 - NGS_Illumina类

欢迎讨论，特别是各种程序的调用参数。。。
http://dl.dropbox.com/u/62547840/NGS_Illumina.pm
http://dl.dropbox.com/u/62547840/NGS_Illumina.pl
screen output:
Illumina 1.3+ fastq format: ASCII(min, max) = (66, 102)
2012/08/25 11:41:15 START maq ill2sanger Run1_testicular-28T_lane2_read1_
sequence.txt Testis_T28_read1_sanger.fq
2012/08/25 11:42:57 SUCCESS after running 0 hours 1 minutes 42 seconds
2012/08/25 11:42:57 START maq ill2sanger Run1_testicular-28T_lane2_read2_
sequence.txt Testis_T28_read2_sanger.fq
2012/08/25... 阅读全帖

e*******e
发帖数: 1837

来自主题: Biology版 - 请教一个NGS的问题

covaris

K****n
发帖数: 5970

来自主题: Biology版 - 求教一个生物统计学得问题

这个事儿是这样的
首先我是写网页的，我也不会统计
但是我倒着往上评论一下
pca吧，要算哪个点或者哪几个点的线性变换是principle component，那你必须要以每
个点为一个dimension算covariance matrix，现在你每个点就一个数据点，直接算不出
来。要算出来可以用hyperparameter，可以考虑gaussian process，但是我觉得对你有
难度，而且这个方法好像不太典型。
mixture model是对的，在你这种情况下也是必要的，但是它解决不了帮你分组这个问
题。
还是二楼说得最实在。你可以找点儿outlier detection的常用算法，把你怀疑的点都
打成outlier，然后分组，然后上mixture model。

l**********1
发帖数: 5204

来自主题: Biology版 - 求教一个生物统计学得问题

no s/he is quitted bio PD now on IT world just and her/his spouse now still
on wet bio field (from her/his older vast post here you can got that
conclusion)
>

>>
发信人: KeeVan (Kevin), 信区: Biology
这个事儿是这样的
首先我是写网页的，我也不会统计
但是我倒着往上评论一下
pca吧，要算哪个点或者哪几个点的线性变换是principle component，那你必须要以每
个点为一个dimension算covariance matrix，现在你每个点就一个数据点，直接算不出
来。要算出来可以用hyperparameter，可以考虑gaussian process，但是我觉得对你有
难度，而且这个方法好像不太典型。
mixture model是对的，在你这种情况下也是必要的，但是它解决不了帮你分组这个问
题。
还是二楼说得最实在。你可以找点儿outlier detection的常用算法，把你怀疑的点都... 阅读全帖

l*****i
发帖数: 1163

来自主题: Biology版 - Chip-Seq 样品制备求助

1，2，3使用10m细胞，Covaris超声30min，三个样品的Bioanalyzer Ultra Sensitive
DNA分析图谱。
4，5，6是对应于1，2，3，使用Millipore的Magana Chip kit，抗体CHIP后得到的结果。
请问为什么会得到这么烂的CHIP DNA？
超声片段的图谱有问题么（大小在300－400之间）？
多谢

k*****n
发帖数: 323

来自主题: Biology版 - Chip-Seq 样品制备求助

1－3，我没用过covaris，而且检测sonication效果的话一般都是琼脂糖胶。不过根据
你的bioanalyzer的结果，如果是library preparation之前的话，size偏大且范围偏窄
。这不是个好兆头。如果是pcr后的文库的话，大小还算合适
至于4－6，chip下来的dna量太少，没有任何检测size的意义。

o*p
发帖数: 177

来自主题: Biology版 - 问个统计学的问题，这种改用什么test

I would do:
repeated measures ANOVA
covariance matrix: AR1
test: group by time (linear or quadratic trend) interaction

N******n
发帖数: 3003

来自主题: Biology版 - 生物信息学其实粗分可以分两种

顺便帮我看看这个问题：
标题: 问个简单的matrix变换的问题，包子答谢
发信站: BBS 未名空间站 (Thu Apr 10 15:27:16 2014, 美东)
有一个symmetric matrix, 我是平方得来的，应当是positive definite: 看了
eigvalue 都是正的，但是在matlab上用cholcov: Cholesky-like covariance
decomposition
出不来：
[T,err] = cholcov(sigma)
T是空的。不知道为什么，谢谢
A=
sigma =
0.4382 -0.3990 -1.2396 -1.0880 0.9084 1.5463 -0.6721 0.
4587
-0.3990 0.5448 1.3429 1.1029 -1.0068 -1.7077 0.6971 -0.
4047
-1.2396 1.3429 15.4717 10.6047 -10.6348 -18.3169 6.8... 阅读全帖

c*****u
发帖数: 439

来自主题: Biology版 - 问个问题，关于细胞富集的。谢谢。

我用老鼠组织做了fix 然后sorting，因为我要的细胞所占比例很低，所以最后sorting
下来，细胞核所在溶液体积很大。而我恰好因为要用covaris 这个机器做sonication,
最大体积只可以130ul。
请问什么方法可以最大程度的把我要的细胞核保留下来，并且缩小溶液的体积？不知
道low retention tube 怎么样。
谢谢

N******n
发帖数: 3003

来自主题: Biology版 - 谁装过python的limix package (转载)

/Developer/SDKs/MacOSX10.5.sdk/usr/include/architecture/i386/math.h:477:17:
note: 'lroundl' declared here
extern long int lroundl(long double);
^
In file included from ./src/limix/covar/combinators.cpp:15:
In file included from ./src/limix/covar/combinators.h:19:
In file included from src/limix/covar/covariance.h:18:
In file included from src/limix/types.h:18:
In file included from External/Eigen/Dense:1:
In file included from External/Eigen/Core:28:
In file included from /Applic... 阅读全帖

y***s
发帖数: 4225

来自主题: Biology版 - 一个统计问题：怎么评判哪种测量方法好

ANCOVA, 测量方法 being the independent variable, 观测值 being the outcome,
实际值being the covariate.

w*e
发帖数: 740

来自主题: Biology版 - 大家有用过Clontech的smart-seq v4 这个rna-seq kit么？

以前用的nugen的，这次像富集mRNA,想试一下Clontech的smart-seq v4 这个rna-seq
kit。大家如何？拿到full-length cDNA 以后，大家是用nextera还是covaris＋low
input library prep kit （clontech）方法去做最后的library？
谢谢

k*****n
发帖数: 323

来自主题: Biology版 - 请教ChIP-seq用Dynabeads的问题

try washing your Dynabeads with PBS 0.5% BSA 3 times, then incubate
antibodies in PBS-BSA for two hour. After IP, make sure resuspend beads
totally during each washing step. I never using sperm DNA. I would suggest
you using Pol II monoantibody 4H8 as positive control. Several steps are
critical for ChIP, 1, crosslinking, always using fresh FA, like 16% FA from
Thermo. for cells 15 min crosslinking with rotation would be enough, quench
with 0.125 M glycine for 1 min. 2 sonication, try Branson so... 阅读全帖

发帖数: 1

来自主题: Biology版 - 请教ChIPseq的技术问题

1. 现在做ChIPseq是不是都得做重复？记得前几年的ChIPseq，都是一次实验，就可以。
2. 现在做ChIPseq，spiked－in 是必须的吗？
3. biorupter和covaris，哪个好？为什么？
谢谢！

发帖数: 1

来自主题: Biology版 - covaris和biorupter哪个好？

优缺点是什么？
只用过biorupter。
有人两个都用过吗？

d********m
发帖数: 3662

来自主题: Biology版 - 请出几道题，用来鉴别一个人是否生物博后

what's about the first one? I work extensively with covariance matrices but
I didn't actually note the so called special properties of eigenvectors of
symmetric matrix.

r**********e
发帖数: 587

来自主题: Biology版 - 请出几道题，用来鉴别一个人是否生物博后

Eigenvectors of symmetric matrix are orthoganoal to each other.
covariance matrices 就是 symmetric；eigenvector也就是PCA的principal
component啦
btw好奇我觉得你们数学都很强的样子，是还在做bioinformatics还是直接CS了？

but

s******9
发帖数: 283

来自主题: Biology版 - 急——紧急咨询超声破碎仪Covaris E220及E220evolution的价格

买个不错的二手就5000刀。真是不一样，国内舍得花这么多钱在这类小型仪器上。

Z******5
发帖数: 435

来自主题: Biology版 - 急——紧急咨询超声破碎仪Covaris E220及E220evolution的价格

这玩意在国内可是宝贝呀，哪有二手的卖。再说国内设备都是单位固定资产，也不允许
买卖。
目前单位有两台ME220，用来打断DNA或者染色质，效果非常好，重复性也不错。但就是
通量太低，做染色质打断实验，有时候一天的时间都耗费在超声打断这上面了。
最近单位平台建设刚好有点钱，准备申购一台高通量的，发现E220不错。从目前收集到
的信息看，供应商报的14万美金的价格不是一般的黑，买两台都绰绰有余了。

f*******e
发帖数: 628

来自主题: Biology版 - 急——紧急咨询超声破碎仪Covaris E220及E220evolution的价格

2011 年 quote 的 E220 是 $127k, 包括一些配件，安装和培训。

z*t
发帖数: 863

来自主题: Biology版 - 急——紧急咨询超声破碎仪Covaris E220及E220evolution的价格

试试bioruptor pico benchtop大小通量也高

s******9
发帖数: 283

来自主题: Biology版 - 急——紧急咨询超声破碎仪Covaris E220及E220evolution的价格

买个不错的二手就5000刀。真是不一样，国内舍得花这么多钱在这类小型仪器上。

Z******5
发帖数: 435

来自主题: Biology版 - 急——紧急咨询超声破碎仪Covaris E220及E220evolution的价格

f*******e
发帖数: 628

来自主题: Biology版 - 急——紧急咨询超声破碎仪Covaris E220及E220evolution的价格

2011 年 quote 的 E220 是 $127k, 包括一些配件，安装和培训。

z*t
发帖数: 863

来自主题: Biology版 - 急——紧急咨询超声破碎仪Covaris E220及E220evolution的价格

试试bioruptor pico benchtop大小通量也高

l**********t
发帖数: 5754

来自主题: Business版 - 一个概念性的问题，请求帮助~~~~~ (转载)

my 2 cents.
assuming the regression C = beta * A + error is liner,
beta =COV(A, C)/VAR(A) > 0 so COV(A,C) >0,
also, COV(A, B) > 0 as given,
question: is it possible to have COV(B, C)/VAR(B) < 0, or COV(B, C) <0
I think it is possible to have a negative COV(B,C),given COV(A,C) and COV(A,
B) are positive, as long as the covariance/correlation matrix of A, B, C is
still positive definite. For example, I can have
COR(A,B) = 0.5
COR(A,C) = 0.5
COR(B,C) = -0.1

s*****r
发帖数: 59

来自主题: Business版 - mean/ std

this statistic value is called something similiar to correlation covariance,
or so. but could find it in my book...

s*********o
发帖数: 14

来自主题: Business版 - 求助time-series model

请教各位大侠，有没有这样的time-series model:
Y_t=m(X_t, theta1)+s(X_t, theta2)*u_t
where X_t is some exogenous covariates, m and s are two functions of X_t and
/or X_t-1, X_t-2 ..., theta1 and theta2 are some parameters, u_t is error
term.
谢谢大家先

M****n
发帖数: 84

来自主题: Business版 - 想换个工作，请问薪水的期望值应该多少？ (转载)

【以下文字转载自 Quant 讨论区】
发信人: Merton (Robert), 信区: Quant
标题: 想换个工作，请问薪水的期望值应该多少？
发信站: BBS 未名空间站 (Thu Nov 5 23:44:52 2009, 美东)
请大伙指教。简单情况：06年Business Phd. 近了一个buyside institutional fund做
了几年。主要是equity quant, 不涉及derivatives。日常工作主要是和matlab, sas,
sql, python，S+, excel vba 这些东西打交道，不会java c++. 工作内容是building
factors, portfolio optimization, covariance matrix construction, performance
attribution. 现在在一个地方呆就了想换个环境多学点东西。当然工资涨些就更好了
。一般的数据库，软件之类的当然都是多少会些。还是想换到一个buyside mutual
fund或者institutional fund里面去。要去i

d***e
发帖数: 104

来自主题: Chemistry版 - Paper Help

Principal covariates regression : Part I. Theory
Sijmen de Jong and Henk A. L. Kiers
http://www.sciencedirect.com/science?_ob=ArticleURL&_udi=B6TFP-44GGKN3-BJ&_user=10&_rdoc=1&_fmt=&_orig=search&_sort=d&_docanchor=&view=c&_searchStrId=1129214264&_rerunOrigin=google&_acct=C000050221&_version=1&_urlVersion=0&_userid=10&md5=10e6e7205503ae288
Please send to l*******[email protected] or let me know where to download. Thanks!

c****n
发帖数: 21367

来自主题: ChineseMed版 - 靠，我算服了中医饭了

恰恰不是运气，自然界的常见系统生长一般都是random multiplication来刻画的
random multiplication导致power law，power law导致covariance dispersion
然后导致逼近形式的entropy concentration
对于线性逼近，当信息聚集在少数几个特征方向上的时候，
必然是这几个特征向量能还原大部分信息。
这个你可以问问做图像处理的同学，任意一个图像，随便找个空间，
比如Fourier空间，拿DFT一变换，然后去掉95%的高频分量再重构，
还是能看到大致的。这是有损压缩的理论基础。当然这已经是太高层的
应用了。要看到本质，空间可以不同（维数、平直性、etc），变换
可以不同（Foureier, Laplacian, Fokk-Planck, wavelet etc），
分解和重构方式可以不同，这些都是应用。其本质在于我们生活里的
信息都是power law的，不然搞这么多分解一点意义都没有。
如果信息是各向同性的或者比较均匀，那你换个空间表示有啥用嘛？:)
对于复杂系统，power law是无处不在的，呵呵。

c****n
发帖数: 21367

来自主题: ChineseMed版 - 中医的发展与继承

另外，个人觉得中医不是hypothesis-driven。写点理由，请多多批评指正啊。
中医并没有拿哪个命题过来做null test，而且其测试不是planned priori的。
基本上应该是exploratory。但我不同意exploratory是subjective的说法。
实际上，大数据量的dataset一抽象，模型自己就出来了。最简单的例子
就是PCA，取empirical covariance矩阵的特征方向来刻画dataset。
可以达到最小维数的目的。
抽象的说，就是先从数据中直接算出坐标轴，根据算出来的坐标轴来刻画
因果关系，而不是先假设一个因果关系，然后去检验了再依据检验结果
制订坐标轴来刻画dataset。中医用的是前一种方法。
放到生物学里面的例子就是。别先假设存在一个白化病基因是遗传物质然后
分显性隐性，再通过null test找基因什么的。拿几千几万个人的数据来，
数据分析一下就看到爹妈是什么情况子代会是什么情况了。算出来的结果不会
导致人类发现基因，但是会直接得出显+隐=显，隐+隐=隐，显+显=显的关系
（粗糙的这么说遗传关系，只是举个例子）。
然后说关于这

w**d
发帖数: 2334

来自主题: Computation版 - [转载] Also a simulation problem.

【以下文字转载自 Mathematics 讨论区】
【原文由 wind 所发表】
1) original problem:
a random field( process) w(x), x \in (0, pi).
The mean is zero everywhere.
Covariance function c(x1,x2) = exp(-|x1-x2|/b ), b>0
We also know the PDF of w(x) for any fixed x.
How to do the simulation?
2) simplified(approximated) version:
Use Karhunen-Loeve expansion of w(x), to obtain an approximation:
w(x) = f(x) alpha1 + g(x) alpha2
then both alpha1 and alpha2 have the given PDF, and are uncorrelated.
After this, the prob

o******d
发帖数: 1552

来自主题: Computation版 - 让我头痛死了的问题

去看看关于统计方面的书，特别是modelling，你需要做些统计，然后选定
一个model,当然，model里面的参数需要定出。你的主要观测是response,
那些影响因素是covariates .据个例子，比如牛二，你要确定两者的关系，
比如用线性方程去model,就是妞儿了。

r****y
发帖数: 1437

来自主题: Computation版 - 怎么解这个方程组阿

maybe this works
let a priori covariance matrix as
Sa = diag{0.25, 0.25, ...}
first guess [0.5, 0.5, ...]
minimize
(y-Kx)'(y-Kx) + x'Sax

x*********l
发帖数: 594

来自主题: Computation版 - gsl_fit_linear

里面这个stride什么意思？
Function: int gsl_fit_linear (const double * x, const size_t xstride, const
double * y, const size_t ystride, size_t n, double * c0, double * c1, double
* cov00, double * cov01, double * cov11, double * sumsq)
This function computes the best-fit linear regression coefficients (c0,
c1) of the model Y = c_0 + c_1 X for the datasets (x, y), two vectors of
length n with strides xstride and ystride. The variance-covariance matrix
for the parameters (c0, c1) is estimated from the s

g****y
发帖数: 199

来自主题: Computation版 - matlab如何生成一个特殊的随机矩阵

you can use mvnrnd:
Random numbers from multivariate normal distribution
Syntax
R = mvnrnd(mu,sigma)
r = mvnrnd(mu,sigma,cases)
Description
R = mvnrnd(mu,sigma)
returns an n-by-d matrix R of random vectors chosen from the multivariate
normal distribution with
mean mu, and covariance sigma. mu is
an n-by-d matrix, and mvnrnd generates
each row of R using the corresponding row of mu. sigma is
a d-by-d symmetric positive semi-definite
matrix, or a d-by-d-by-n array.
If sigma is an array, mvnrnd g

s**b
发帖数: 169

来自主题: Computation版 - a question about random number generator

not my major, just curious,
covariance only can determine PDF?

J**Y
发帖数: 34

来自主题: Economics版 - Re: Help, anyone has experience with estimating multinomial probit mod

My experience is that the values of latent variable from truncated normal
are easily to explode. You may want to add some truncations into your code.
Additionally, since you normalize the 1st diagnol element of covariance matrix
as 1, you can not still assume this matrix is wishart distributed. You can
check a recent paper (2000) in J. of Econometrics to see how to estimate
this kind of identified MNP model by Bayesian.
An easy way is to work on unidentified model, in which you specify priors
on

c**p
发帖数: 2

来自主题: Economics版 - econometric question

FIML is asymp effiecient if there is the normality assumption--reach CR bound.
3SLS is an extention of IV--2SLS. ---It should definitely consistent.---The
idea of using 3SLS is the stacked simultaneous equations.
Compared with the OLS equation-by-equation, when stacking for the equations,
more information are used now-- that is why we call it Full Information.
FIML & 3SLS are asymp equivaleint if there is no error covariance
resitriction.

b********9
发帖数: 749

来自主题: Economics版 - 世界引用次数最高的20篇经济学论文(转载)

欧洲学者COUPE统计了世界引用次数最高的20 篇经济学论文（http://student.ulb.ac.be/~tcoupe/updaterevealedperformances.pdf），样本包括所有发表在1975年到2000的167728篇经济学论文，引用次数是指从论文发表到2001年底的被引用次数。该文发表于知名匿名审稿学术杂志Journal of the European Economic Association（欧洲经济学会的旗舰刊物）。
排名引用次数杂志年份作者1 作者2
1 2638 Econometrica 1980 WHITE, H
（Halbert White （1980）, ‘A Heteroskedasticity-Consistent Covariance Matrix
Estimator and a Direct Test for Heteroskedasticity’, Econometrica 48, p. 817-
838 关于异方差的经典）
2 2521 Econometrica 1979 KAHNEMAN, D TVERSKY, A
（D

s*********o
发帖数: 14

来自主题: Economics版 - 求助time-series models

kx
发帖数: 16384

来自主题: Economics版 - 世界引用次数最高的20篇经济学论文zz

欧洲学者COUPE统计了世界引用次数最高的20篇经济学论文（http://student.ulb.ac.be/
~tcoupe/updaterevealedperformances.pdf），样本包括所有发表在1975年到2000的
1677
28篇经济学论文，引用次数是指从论文发表到2001年底的被引用次数。该文发表于知名匿
名审稿学术杂志Journal of the European Economic Association（欧洲经济学会的旗舰
刊物）。
排名引用次数杂志年份作者1 作者2
1 2638 Econometrica 1980 WHITE, H
（Halbert White (1980), ‘A Heteroskedasticity-Consistent Covariance Matrix
Es
timator and a Direct Test for Heteroskedasticity’, Econometrica 48, p. 817-
83
8 关于异方差的经典）
2 2521 Econometrica 1979 KAHNEMAN, D TVERSKY

G*********e
发帖数: 614

来自主题: Economics版 - [合集] 上财统计学系教师在国际一流期刊发表论文情况

☆─────────────────────────────────────☆
jjxc (jjxc) 于 (Tue Jul 8 02:33:36 2008) 提到:
序号
作者姓名
论文名称
发表刊物
权威
1
周勇*
Yong Zhou
Statistical Inference for Semi-parametric Varying-coefficient Partially
Linear Models with Error-prone Linear Covariates.
Annals of Statistics, accepted.
2
周勇*
Yong Zhou
Estimating equations inference with missing data
Journal of the American Statistical Association, accepted
3
周勇
Yong Zhou
Marginal hazard models with varying-coefficients for multivariate failure
time da

l********s
发帖数: 430

来自主题: Economics版 - How to calculate the odds Ratio using propensity score in stata

use logistic regression including propensity score as a covariate.

the
ratio?

a**n
发帖数: 3801

来自主题: Economics版 - 问个econometrics问题（包子贴）

很奇怪为啥说Y是biased？你的X里面不包括constant term？
俺觉得你要看的是GLS
WLS只是一种特殊情况，按你的描述来看，好象是有serial correlation
那一般意义上的WLS不适用
GLS的关键是怎么知道error的variance-covariance matrix
如果你能够consistently estimate var(u_t)
那么fGLS就应该是consistent的
俺觉得用Y(t-1)不合理。。这个咋能估计var(u_t)呢。。

不是

g*******a
发帖数: 43

来自主题: Economics版 - 上财经院多位教师近期在国际著名学术期刊上发表论文(ZT)

近期，经济学院又有多位教师在国际著名学术期刊上发表（或被接受发表）署名上海财
经大学的论文。自2005年以来，经济学院教师已经在国际知名期刊上发表了超过90篇论
文，其中77篇署名上海财经大学。
我院存量教师孙燕副教授与Wenyang Zhang和Jianqing Fan的合作论文“A
semiparametric model for cluster data”被国际数理统计顶尖期刊《Annals of
Statistic》正式录用，将发表在该期刊2009年10月刊上。这已是孙燕副教授近年来在
该期刊发表的第2篇论文，2007年她曾与合作者在该期刊发表过1篇题为 “Estimation
of the Covariance Matrix of Random Effects in Longitudinal Studies”的论文。
经济学院今年刚引进的海归教师李哲博士接到《Journal of Monetary Economics》的
确认函，她与Miquel Faig的合作论文“The welfare costs of expected and
unexpected inflation”

on
发帖数: 199

来自主题: Economics版 - SAS: does it have this feature? (转载)

【以下文字转载自 Statistics 讨论区】
发信人: on (一瞬), 信区: Statistics
标题: SAS: does it have this feature?
发信站: BBS 未名空间站 (Fri Feb 26 00:08:22 2010, 美东)
Want to estimate a model by Generalized Least Square, i.e., assuming the
error term has heteroscedsticity. The covariance matrix of the error is
known. I thought this is fairly standard method, but couldn't find anything
from SAS users' guide.
Does anyone know SAS has a procedure to do this? thanks.

s*****w
发帖数: 2065

来自主题: Economics版 - 请教大牛，inverse covariance matrix在金融或者计量经济方面有什么作用

I think it has already been studied in the asset pricing literature.
http://en.wikipedia.org/wiki/Cram%C3%A9r%E2%80%93Rao_bound

#	版面	帖数(主题数)
-	全站	4871 (796)
1	Military	3777 (569)
2	Stock	341 (51)
3	Joke	117 (17)
4	History	116 (3)
5	Automobile	100 (9)
6	USANews	55 (9)
7	Midlife	45 (1)
8	Headline	41 (41)
9	Dreamer	33 (13)
10	FleaMarket	32 (20)
11	Living	30 (7)

topics

未名新帖统计// 7月16日

历史上的今天