高级搜索

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

基于缓存行为特征的线程数据预取距离控制策略

黄艳 张启坤 段赵磊 古志民

黄艳, 张启坤, 段赵磊, 古志民. 基于缓存行为特征的线程数据预取距离控制策略[J]. 电子与信息学报, 2015, 37(7): 1633-1638. doi: 10.11999/JEIT141429
引用本文: 黄艳, 张启坤, 段赵磊, 古志民. 基于缓存行为特征的线程数据预取距离控制策略[J]. 电子与信息学报, 2015, 37(7): 1633-1638. doi: 10.11999/JEIT141429
Huang Yan, Zhang Qi-kun, Duan Zhao-lei, Gu Zhi-min③. Prefetch Distance Control Strategy Based on Cache Behavior in Threaded Prefetching[J]. Journal of Electronics & Information Technology, 2015, 37(7): 1633-1638. doi: 10.11999/JEIT141429
Citation: Huang Yan, Zhang Qi-kun, Duan Zhao-lei, Gu Zhi-min③. Prefetch Distance Control Strategy Based on Cache Behavior in Threaded Prefetching[J]. Journal of Electronics & Information Technology, 2015, 37(7): 1633-1638. doi: 10.11999/JEIT141429

基于缓存行为特征的线程数据预取距离控制策略

doi: 10.11999/JEIT141429
基金项目: 

国家自然科学基金(61370062),郑州市科技攻关计划项目(20130725)和博士基金项目(2013BSJJ050)资助课题

Prefetch Distance Control Strategy Based on Cache Behavior in Threaded Prefetching

  • 摘要: 针对目前大多数面向指针应用程序的线程数据预取方法在预取距离控制方面的不足,该文提出一种基于缓存行为特征的数据预取距离控制策略。该策略利用指针应用程序执行时的数据缓存特征构建预取距离控制模型,以避免共享缓存污染,降低系统资源竞争,并通过忽略对部分非循环依赖数据预取平衡帮助线程与主线程间的执行任务,提高线程数据预取的时效性。实验结果表明,通过该策略控制线程数据预取距离能进一步提高线程预取性能。
  • Chen T F and Baer J L. A performance study of software and hardware data prefetching schemes[C]. Proceedings of 21st International Symposium on Computer Architecture, Chicago, USA, 1994: 223-232.
    Saavedra R H and Daeyeon P. Improving the effectiveness of software prefetching with adaptive execution[C]. Proceedings of Conference on Parallel Architectures and Compilation Techniques, Boston, USA, 1996: 68-78.
    Hur I and Lin C. Feedback mechanisms for improving probabilistic memory prefetching[C]. Proceedings of 15th International Symposium on High Performance Computer Architecture, North Carolina, USA, 2009: 443-454.
    Dongkeun K, Liao S S W, Wang P H, et al.. Physical experimentation with prefetching helper threads on Intel,s hyper-threaded processors[C]. Proceedings of International Symposium on Code Generation and Optimization, California, USA, 2004: 27-38.
    Lu J. Design and implementation of a lightweight runtime optimization system on modern computer architectures[D]. [Ph.D. dissertation], University of Minnesota, 2006.
    Ro W W and Gaudiot J L. Speculative pre-execution assisted by compiler (SPEAR)[J]. Journal of Parallel and Distributed Computing, 2006, 66(8): 1076-1089.
    Somogyi S, Wenisch T F, Ailamaki A, et al.. Spatial-temporal memory streaming[C]. Proceedings of the 36th International Symposium on Computer Architecture, Austin, USA, 2009: 69-80.
    Lee J, Jung C, Lim D, et al.. Prefetching with helper threads for loosely coupled multiprocessor systems[J]. IEEE Transactions on Parallel and Distributed Systems, 2009, 20(9): 1309-1324.
    单书畅, 胡瑜, 李晓维. 基于数据预取的多核处理器末级缓存优化方法[J]. 计算机辅助设计与图形学学报, 2012, 24(9): 1241-1248.
    Shan Shu-chang, Hu Yu, and Li Xiao-wei. Date prefetching based last-level cache optimization for chip multiprocessors [J]. Journal of Computer-Aided Design Computer Graphics, 2012, 24(9): 1241-1248.
    张建勋, 古志民, 胡潇涵, 等. 面向非规则大数据分析应用的多核帮助线程预取方法[J]. 通信学报, 2014, 35(8): 137-146.
    Zhang Jian-xun, Gu Zhi-min, Hu Xiao-han, et al.. Multi-core helper thread prefetching forirregular data intensive applications[J]. Journal on Communications, 2014, 35(8): 137-146.
    Marin G, McCurdy C, and Vetter J S. Diagnosis and optimization of application prefetching performance[C]. Proceedings of the 27th International ACM Conference on International Conference on Supercomputing, Oregon, USA, 2013: 303312.
    Garside J and Audsley N C. Prefetching across a shared memory tree within a network-on-chip architecture[C]. Proceedings of 15th International Symposium on System-on- Chip, Melbourne, Australia, 2013: 1-4.
    Jain A and Lin C. Linearizing irregular memory accesses for improved correlated prefetching[C]. Proceedings of the 46th IEEE/ACM International Symposium on Microarchitecture (MICRO), Davis, USA, 2013: 247-259.
    Zhao Y, Yoshigoe K J, and Xie M J. Pre-execution data prefetching with I/O scheduling[J]. The Journal of Supercomputing, 2014, 68(2): 733-752.
    巫旭敏, 殷保群, 黄静, 等. 流媒体服务系统中一种基于数据预取的缓存策略[J]. 电子与信息学报, 2010, 32(10): 2440-2445.
    Wu Xu-min, Yin Bao-qun, Huang Jing, et al.. A prefetching- based caching policy in streaming service systems[J]. Journal of Electronics Information Technology, 2010, 32(10): 2440-2445.
    刘斌, 赵银亮, 韩博, 等. 基于性能预测的推测多线程循环选择方法[J]. 电子与信息学报, 2014, 36(11): 2768-2774.
    Liu Bin, Zhao Yin-liang, Han Bo, et al.. A loop selection approach based on performance prediction for speculative multithreading[J]. Journal of Electronics Information Technology, 2014, 36(11): 2768-2774.
    Emma P G, Hartstein A, Puzak T R, et al.. Exploring the limits of prefetching[J]. IBM Journal of Research and Development, 2005, 49(1): 127-144.
    Srinath S, Mutlu O, Hyesoon K, et al.. Feedback directed prefetching: improving the performance and bandwidth- efficiency of hardware prefetchers[C]. Proceedings of the IEEE 13th International Symposium on High Performance Computer Architecture, Arizona, USA, 2007: 63-74.
    Doweck J. White paper: inside intel core microarchitecture and smart memory access[R]. Intel Corporation, 2006.
    Hui K and Jennifer L W. To hardware prefetch or not to prefetch?: a virtualized environment study and core binding approach[C]. Proceedings the 8th International Conference on Architectural Support For Programming Languages And Operating Systems, Houston, USA, 2013: 357-368.
  • 加载中
计量
  • 文章访问数:  1487
  • HTML全文浏览量:  116
  • PDF下载量:  495
  • 被引次数: 0
出版历程
  • 收稿日期:  2014-11-13
  • 修回日期:  2015-03-17
  • 刊出日期:  2015-07-19

目录

    /

    返回文章
    返回