o****9 发帖数: 479 | 1 NSF在刚公布的更新中提到了Big data的false discoveries问题。
“Big data sets are important tools for modern science. Mining for
correlations between millions of pieces of information can reveal vital
relationships or predict future outcomes, such as risk factors for a disease
or structures of new chemical compounds.
These mining operations are not without risk, however. Researchers can have
a tough time telling when they have unearthed a nugget of truth, or what
amounts to fool's gold: A correlation that seems to have predictive value,
but ...”
http://www.nsf.gov/news/news_summ.jsp?cntn_id=135894&WT.mc_id=U | h*****w 发帖数: 8561 | 2 文章不错,谢谢分享,那篇SCIENCE 文章正好可以组里面讨论. | s**********e 发帖数: 33562 | | s******8 发帖数: 2131 | 4 现在有什么好的方法减少false positives?还是case by case? False negatives是不
是更难 |
|