- [1] Agarwal, Amit, Kaushik Roy, and T. N. Vijaykumar. "Exploring high bandwidth pipelined cache architecture for scaled technology." In 2003 Design, Automation and Test in Europe Conference and Exhibition, pp. 778-783. IEEE, 2003.
- [2] Kang, Wang, Yangqi Huang, Chentian Zheng, Weifeng Lv, Na Lei, Youguang Zhang, Xichao Zhang, Yan Zhou, and Weisheng Zhao. "Voltage controlled magnetic skyrmion motion for racetrack memory." Scientific reports 6 (2016): 23164.
- [3] [Bhattacharjee, Abhishek. "Appendix L: Advanced Concepts on Address Translation." (2018).](https://sci-hub.tw/https://www.cs.rutgers.edu/~abhib/abhib-appendix-l.pdf)
- [4] Zhang, Xiao, Sandhya Dwarkadas, and Kai Shen. "Towards practical page coloring-based multicore cache management." In Proceedings of the 4th ACM European conference on Computer systems, pp. 89-102. 2009.
- [5] Roy, Amitabha, Ivo Mihailovic, and Willy Zwaenepoel. "X-stream: Edge-centric graph processing using streaming partitions." In Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles, pp. 472-488. 2013.
- [6] Kepner, Jeremy, Peter Aaltonen, David Bader, Aydin Buluç, Franz Franchetti, John Gilbert, Dylan Hutchison et al. "Mathematical foundations of the GraphBLAS." In 2016 IEEE High Performance Extreme Computing Conference (HPEC), pp. 1-9. IEEE, 2016.
- [7] Liu, Xiaoxiao, Mengjie Mao, Xiuyuan Bi, Hai Li, and Yiran Chen. "An efficient STT-RAM-based register file in GPU architectures." In The 20th Asia and South Pacific Design Automation Conference, pp. 490-495. IEEE, 2015.
- [8] Yavits, Leonid, Roman Kaplan, and Ran Ginosar. "Enabling Full Associativity with Memristive Address Decoder." IEEE Micro 38, no. 5 (2018): 32-40.
- [9] Kocay, William, and Donald L. Kreher. Graphs, algorithms, and optimization. CRC Press, 2016.
- [10] Handy, Jim. "Understanding the intel/micron 3d xpoint memory." Proc. SDC (2015).
- [11] Zhu, Maohua, Youwei Zhuo, Chao Wang, Wenguang Chen, and Yuan Xie. "Performance evaluation and optimization of hbm-enabled gpu for data-intensive applications." IEEE Transactions on Very Large Scale Integration (VLSI) Systems 26, no. 5 (2018): 831-840.
- [12] Ainsworth, Sam, and Timothy M. Jones. "Graph prefetching using data structure knowledge." In Proceedings of the 2016 International Conference on Supercomputing, pp. 1-11. 2016.
- [13] Lv, Huiwei, Guangming Tan, Mingyu Chen, and Ninghui Sun. "Understanding parallelism in graph traversal on multi-core clusters." Computer Science-Research and Development 28, no. 2-3 (2013): 193-201.
- [14] Nai, Lifeng, Ramyad Hadidi, Jaewoong Sim, Hyojong Kim, Pranith Kumar, and Hyesoon Kim. "Graphpim: Enabling instruction-level pim offloading in graph computing frameworks." In 2017 IEEE International symposium on high performance computer architecture (HPCA), pp. 457-468. IEEE, 2017.