由买买提看人间百态

boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
Immigration版 - 转让CS/EE TCAD journal审稿机会
相关主题
Review 的文章需要给出全文吗Any chance? Any input is highly appreciated!
review Chinese manuscripts关于review次数的问题
template for journal manuscript review审稿后又收到revision,算两次审稿吗?
问:回复 RFE 时,还没有submit 的 manuscript 可以用吗?要审稿的邮件主题这样写行吗?
请问大家推荐信中在审稿方面的contribution是怎么写的?做审稿要保留什么资料,方便以后申请绿卡?
So far, I only get 3 manuscript reviews done.关于推荐信
review次数的问题正在审稿的文章也可以写在PS当中吗?
一个星期后意外的惊喜请问审稿算次数还是篇数
相关话题的讨论汇总
话题: divergence话题: control话题: impact话题: gpu
进入Immigration版参与讨论
1 (共1页)
t********e
发帖数: 34
1
Manuscript ID TCAD-2015-0250 entitled "An Accurate GPU Performance Model for
Effective Control Flow Divergence Optimization" with Prof. Liang as contact
author has been submitted to the Transactions on Computer-Aided Design of
Integrated Circuits and Systems.
The abstract appears at the end of this letter, along with the names of the
authors.
.....
MANUSCRIPT DETAILS
TITLE: An Accurate GPU Performance Model for Effective Control Flow
Divergence Optimization
AUTHORS: Liang, Yun; Satria, Muhammad; Rupnow, Kyle; Chen, Deming
ABSTRACT: Graphic processing units (GPUs) are composed of a group of single-
instruction multiple data (SIMD) streaming multiprocessors (SMs).
GPUs are able to efficiently execute highly data parallel tasks through SIMD
execution on the SMs. However, if those threads take diverging control
paths, all divergent paths are executed serially. In the worst case, every
thread takes a different control path and the highly parallel architecture
is used serially by each thread. This control flow divergence problem is
well known in GPU development; code transformation, memory access
redirection, and data layout reorganization are commonly used to reduce the
impact of divergence. These techniques attempt to eliminate divergence by
grouping together threads or data to ensure identical behavior.
However, prior efforts using these techniques do not model the performance
impact of any particular divergence or consider that complete elimination of
divergence may not be possible. Thus, we perform analysis of the
performance impact of divergence and potential thread regrouping algorithms
that eliminate divergence or minimize the impact of remaining divergence.
Finally, we develop a divergence optimization framework that analyzes and
transforms the kernel at compile-time and regroups the threads at run-time.
Our proposed metrics achieve performance estimation accuracy within 6.2% of
measured performance. Using these metrics, we develop thread regrouping
algorithms, which consider the impact of divergence, and speed up kernel
execution up to 4.7X on an NVIDIA GTX480.
1 (共1页)
进入Immigration版参与讨论
相关主题
请问审稿算次数还是篇数请问大家推荐信中在审稿方面的contribution是怎么写的?
同一个文章major revision后继续让我审算两次审稿么?So far, I only get 3 manuscript reviews done.
EB1A律师求推荐review次数的问题
大家简历上作者是用全名还是简写?一个星期后意外的惊喜
Review 的文章需要给出全文吗Any chance? Any input is highly appreciated!
review Chinese manuscripts关于review次数的问题
template for journal manuscript review审稿后又收到revision,算两次审稿吗?
问:回复 RFE 时,还没有submit 的 manuscript 可以用吗?要审稿的邮件主题这样写行吗?
相关话题的讨论汇总
话题: divergence话题: control话题: impact话题: gpu