[HPDC] Jaehoon Jung, Daeyoung Park, Gangwon Jo, Jungho Park, and Jaejin Lee. SnuRHAC: A Runtime for Heterogeneous Accelerator Clusters with CUDA Unified Memory. HPDC '21: Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing, pp. 107—120, Online, June 2021. DOI: 10.1145/3431379.3460647
[PPoPP] Jaehoon Jung, Daeyoung Park, Youngdong Do, Jungho Park, and Jaejin Lee. Overlapping Host-to-Device Copy and Computation using Hidden Unified Memory. PPoPP '20: Proceedings of the 25th Symposium on Principles and Practice of Parallel Programming, pp. 321—335, San Diego, California, USA, February 2020. DOI: 10.1145/3332466.3374531
[LCPC] Gangwon Jo, Jaehoon Jung, Jiyoung Park, and Jaejin Lee. Memory-Access-Pattern Analysis Techniques for OpenCL Kernels. LCPC '17: Proceedings of the 30th International Workshop on Languages and Compilers for Parallel Computing, pp. 109—126, College Station, Texas, USA, October 2017. DOI: 10.1007/978-3-030-35225-7_9
[PPoPP-Poster] Gangwon Jo, Jaehoon Jung, Jiyoung Park, and Jaejin Lee. MAPA: An Automatic Memory Access Pattern Analyzer for GPU Applications. PPoPP '17: Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp. 443—444, Austin, Texas, US, February 2017. DOI: 10.1145/3018743.3019034
[PLDI] Junghyun Kim, Gangwon Jo, Jaehoon Jung, Jungwon Kim, and Jaejin Lee. A Distributed OpenCL Framework using Redundant Computation and Data Replication. PLDI '16: Proceedings of the 37th Annual ACM SIGPLAN Conference on Programming Language Design and Implementation, pp. 553—569, Santa Barbara, California, USA, June 2016. DOI: 10.1145/2908080.2908094
[SC] Junghyun Kim, Thanh Tuan Dao, Jaehoon Jung, Jinyoung Joo, and Jaejin Lee. Bridging OpenCL and CUDA: A Comparative Analysis and Translation. SC '15: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, No. 82, Austin, Texas, USA, November 2015. DOI: 10.1145/2807591.2807621
Domestic Papers
정재훈, 조강원, 이재진. CUDA 애플리케이션 바이너리의 GPU 코드를 수정하여 실행하는 기법. 한국정보과학회 2019년 한국소프트웨어종합학술대회 논문집, 1001-1003, 2019년 12월. RISS: 106573136