Hong, Ding-Yong Homepage

Associate Research Fellow/Professor | Hong, Ding-Yong

Publications

Journal Articles 1. Ding-Yong Hong, Tzu-Hsien Tsai, Ning Wang, Pangfeng Liu, Jan-Jan Wu, "GPU Memory Usage Optimization for Backward Propagation in Deep Network Training," Journal of Parallel and Distributed Computing (JPDC), volume 199, pages 105053, May 2025. ::: 2. Hong-Xuan Wei, Pangfeng Liu, Ding-Yong Hong, Jan-Jan Wu, An-Tai Chen, "CNN Models Acceleration Using Filter Pruning and Sparse Tensor Core," International Journal on Networking and Computing, volume 12, number 2, pages 270-294, July 2022. 3. Horng-Ruey Huang, Ding-Yong Hong, Jan-Jan Wu, Kung-Fu Chen, Pangfeng Liu, and Wei-Chung Hsu, "Accelerating Video Captioning on Heterogeneous System Architectures," ACM Transactions on Architecture and Code Optimization (TACO), volume 19, number 3, pages 1-25, May 2022. 4. Sheng-Yu Fu, Ding-Yong Hong, Yu-Ping Liu, Jan-Jan Wu, Wei-Chung Hsu, "Optimizing Data Permutations in Structured Loads/Stores Translation and SIMD Register Mapping for a Cross-ISA Dynamic Binary Translator," Journal of Systems Architecture (JSA), volume 98, pages 173-190, September 2019. 5. Ding-Yong Hong, Shih-Kai Lin, Sheng-Yu Fu, Jan-Jan Wu, Wei-Chung Hsu, "Enhancing Transactional Memory Execution via Dynamic Binary Translation," ACM Applied Computing Review (ACR), volume 19, number 1, pages 48-58, April 2019. ::: 6. Yu-Ping Liu, Ding-Yong Hong, Jan-Jan Wu, Sheng-Yu Fu, Wei-Chung Hsu, "Exploiting SIMD Asymmetry in ARM-to-x86 Dynamic Binary Translation," ACM Transactions on Architecture and Code Optimization (TACO), volume 16, number 1, pages 2:1-2:24, February 2019. 7. Ding-Yong Hong, Jan-Jan Wu, Yu-Ping Liu, Sheng-Yu Fu, Wei-Chung Hsu, "Processor-Tracing Guided Region Formation in Dynamic Binary Translation," ACM Transactions on Architecture and Code Optimization (TACO), volume 15, number 4, pages 52:1-52:25, November 2018. ::: ::: 8. Sheng-Yu Fu, Ding-Yong Hong, Yu-Ping Liu, Jan-Jan Wu, Wei-Chung Hsu, "Efﬁcient and Retargetable SIMD Translation in a Dynamic Binary Translator," Software: Practice and Experience (SPE), volume 48, number 6, pages 1312-1330, June 2018. 9. Ding-Yong Hong, Yu-Ping Liu, Sheng-Yu Fu, Jan-Jan Wu, Wei-Chung Hsu, "Improving SIMD Parallelism via Dynamic Binary Translation," ACM Transactions on Embedded Computing Systems (TECS), volume 17, number 3, pages 61:1 - 61:27, February 2018. 10. Ding-Yong Hong, Chun-Chen Hsu, Cheng-Yi Chou, Wei-Chung Hsu, Pangfeng Liu, Jan-Jan Wu, "Optimizing Control Transfer and Memory Virtualization in Full System Emulators," ACM Transactions on Architecture and Code Optimization (TACO), volume 12, number 4, pages 1-24, December 2015. ::: 11. Chun-Chen Hsu, Ding-Yong Hong, Wei-Chung Hsu, Pangfeng Liu and Jan-Jan Wu, "A Dynamic Binary Translation System in a Client/Server Environment," Journal of Systems Architecture (JSA), volume 61, number 7, pages 307 - 319, August 2015. ::: 12. Ding-Yong Hong, Jan-Jan Wu, Pen-Chung Yew, Wei-Chung Hsu, Chun-Chen Hsu, Pangfeng Liu, Chien-Min Wang and Yeh-Ching Chung, "Efficient and Retargetable Dynamic Binary Translation on Multicores," IEEE Transactions on Parallel and Distributed Systems (TPDS), volume 25, number 3, pages 622 - 632, March 2014. ::: Conference Papers 1. Ze-Wei Liou and Ding-Yong Hong, "Optimizing Compute Core Assignment for Dynamic Batch Inference in AI Inference Accelerator," ACM Symposium on Applied Computing (SAC), March 2025. ::: 2. Bing-Jou Wu, Ding-Yong Hong, Pangfeng Liu, Jan-Jan Wu, "Execution Time Optimization for Pipeline Deep Network Training on Multiple GPUs," Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP), March 2025. 3. Scott Cheng, Mahmut Kandemir, Ding-Yong Hong, "Speculative Monte-Carlo Tree Search," Annual Conference on Neural Information Processing Systems (NeurIPS), December 2024. 4. Ping-Han Tu, Yu-Che Cheng, Ding-Yong Hong, Pangfeng Liu, Jan-Jan Wu, "Approximation Algorithms and Simulated Annealing Heuristics for Row-And-Column Pruning of Deep Neural Networks," IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA), November 2024. ::: 5. Chi-Yu Chiu, Ding-Yong Hong, Pangfeng Liu and Jan-Jan Wu, "Effective Compression of Language Models by Combining Pruning and Knowledge Distillation," IEEE International Conference on Computers, Software, and Applications (COMPSAC), IEEE, IEEE, Osaka, Japan, July 2024. ::: 6. Chao-Yu Lee, Ding-Yong Hong, Pangfeng Liu and Jan-Jan Wu, "Function Clustering to Optimize Resource Utilization on Container Platform," IEEE International Conference on Parallel and Distributed Systems, December 2023. 7. Cheng-Hung Wu, Ding-Yong Hong, Pangfeng Liu and Jan-Jan Wu, "Exploiting Fine-Grained Structured Pruning for Efficient Inference on CNN Model," IEEE International Conference on Parallel and Distributed Systems, December 2023. 8. Chien-Hung Lin, Ding-Yong Hong, Pangfeng Liu, Jan-Jan Wu, "Accelerate Inference of CNN Models on CPU via Column Combining Based on Simulated Annealing," the International Symposium on Computing and Networking, Japan, November 2023. 9. Kung-Fu Chen and Ding-Yong Hong, "Rewriting Deep Learning Models for Maximizing Edge TPU Utilization," IEEE International Conference on Parallel and Distributed Systems (ICPADS), December 2022. 10. Yi You, Pangfeng Liu, Ding-Yong Hong, Jan-Jan Wu and Wei-Chung Hsu, "Accelerating Convolutional Neural Networks via Inter-operator Scheduling," IEEE International Conference on Parallel and Distributed Systems (ICPADS), Best Paper Runner-up, December 2022. ::: 11. Yu-Jen Chang, Ding-Yong Hong, Pangfeng Liu, and Jan-Jan Wu, "Efficient Inference on Convolutional Neural Networks by Image Difficulty Prediction," IEEE International Conference on Big Data, the Machine learning with Big Data track., December 2022. 12. Kuan-Wei Lu, Pangfeng Liu, Ding-Yong Hong, Jan-Jan Wu, "Efficient Dual Batch Size Deep Learning for Distributed Parameter Server Systems," IEEE Computers, Software, and Applications Conference (COMPSAC 2022, acceptance rate 22%), June 2022. 13. Chang-Han Chiang, Pangfeng Liu, Da-Wei Wang, Ding-Yong Hong, and Jan-Jan Wu, "Optimal Branch Location Finding for Cost effective Inference on Branchynet," IEEE International Conference on Big Data (top conference), the Machine Learning with Big Data track, December 2021. 14. An-Tai Chen, Pangfeng Liu, Ding-Yong Hong, and Jan-Jan Wu, "Accelerate CNN Models via Filter Pruning and Sparse Tensor Core," International Symposium on Computing and Networking, November 2021. 15. Horng-Ruey Huang, Ding-Yong Hong, Jan-Jan Wu, Pangfeng Liu, Wei-Chung Hsu, "Efficient Video Captioning on Heterogeneous System Architectures," 35th IEEE International Parallel and Distributed Processing Symposium (IPDPS, top conference), Portland, Oregon USA (online), May 2021. 16. Chih-Min Lin, Sheng-Yu Fu, Ding-Yong Hong, Yu-Ping Liu, Jan-Jan Wu, Wei-Chung Hsu, "Exploiting Vector Processing in Dynamic Binary Translation," the International Conference on Parallel Processing (ICPP), Kyoto, Japan, August 2019. 17. Ding-Yong Hong, Jan-Jan Wu, Yu-Ping Liu, Sheng-Yu Fu, Wei-Chung Hsu, "Processor-Tracing Guided Region Formation in Dynamic Binary Translation," High Performance and Embedded Architecture and Compilation (HiPEAC), Valencia, Spain, January 2019. ::: ::: 18. Shih-Kai Lin, Ding-Yong Hong, Sheng-Yu Fu, Jan-Jan Wu, Wei-Chung Hsu, "Dynamic Tuning of Applications using Restricted Transactional Memory," ACM Research in Adaptive and Convergent Systems, ACM Digital Library, Hawaii, USA, October 2018. 19. Sheng-Yu Fu, Chih-Min Lin, Ding-Yong Hong, Yu-Ping Liu, Jan-Jan Wu, Wei-Chung Hsu, "Exploiting SIMD Capability in an ARMv7-to-ARMv8 Dynamic Binary Translator," International Conference on Compilers, Architectures and Synthesis for Embedded Systems (CASES), pages 1-3, Turin, Italy, September 2018. 20. Chih-Min Lin, Sheng-Yu Fu, Ding-Yong Hong, Yu-Ping Liu, Jan-Jan Wu, Wei-Chung Hsu, "Exploiting SIMD Optimization in an ARMv7 Dynamic Binary Translator," Design Automation Conference, San Francisco, USA, June 2018. 21. Yu-Ping Liu, Ding-Yong Hong, Jan-Jan Wu, Sheng-Yu Fu, Wei-Chung Hsu, "Exploiting Asymmetric SIMD Register Configurations in ARM-to-x86 Dynamic Binary Translation," The 26th International Conference on Parallel Architectures and Compilation Techniques (PACT), Portland, Oregon, USA, September 2017, This is the first Taiwan paper accepted by PACT. 22. Sheng-Yu Fu, Ding-Yong Hong, Yu-Ping Liu, Jan-Jan Wu, Wei-Chung Hsu, "Dynamic Translation of Structured Loads/Stores and Register Mapping for Architectures with SIMD Extensions," ACM SIGPLAN/SIGBED Conference on Languages, Compilers, Tools and Theory for Embedded Systems, Barcelona, Spain, June 2017, acceptance rate 25% 23. Ding-Yong Hong, Sheng-Yu Fu, Yu-Ping Liu, Jan-Jan Wu, and Wei-Chung Hsu, "Exploiting Longer SIMD Lanes in Dynamic Binary Translation," IEEE International Conference on Parallel and Distributed Systems (ICPADS), December 2016, Best Paper (out of 412 submissions) ::: 24. Chun-Chen Hsu, Ding-Yong Hong, Cheng-Yi Chou, Jan-Jan Wu, Wei-Chung Hsu, and Pangfeng Liu, "Optimizing Control Transfer and Memory Virtualization in Full System Emulators," Eruopean Network of Excellence on High-Performance and Embedded Architecture and Compilation (HiPEAC), January 2016. 25. Sheng-Yu Fu, Ding-Yong Hong, Jan-Jan Wu, Pangfeng Liu and Wei-Chung Hsu, "SIMD Code Translation in an Enhanced HQEMU," IEEE International Conference on Parallel and Distributed Systems (ICPADS), December 2015. ::: 26. Yi-Hong Lyu, Ding-Yong Hong, Tai-Yi Wu, Jan-Jan Wu, Wei-Chung Hsu, Pangfeng Liu and Pen-Chung Yew, "DBILL: An Efficient and Retargetable Dynamic Binary Instrumentation Framework using LLVM Backend," ACM International Conference on Virtual Execution Environments (VEE), March 2014. 27. Chun-Chen Hsu, Pangfeng Liu, Jan-Jan Wu, Pen-Chung Yew, Ding-Yong Hong, Wei-Chung Hsu and Chien-Min Wang, "Improving Dynamic Binary Optimization Through Early-Exit Guided Code Region Formation," ACM International Conference on Virtual Execution Environments (VEE), March 2013. 28. Chun-Chen Hsu, Pangfeng Liu, Jan-Jan Wu, Pen-Chung Yew, Ding-Yong Hong, Wei-Chung Hsu and Chien-Min Wang, "Improving Region Selection Through Early-Exit Detection," Asia-Pacific Programming Languages and Compilers Workshop (APPLC), Beijing, China, June 2012. 29. Ding-Yong Hong, Chun-Chen Hsu, Pen-Chung Yew, Jan-Jan Wu, Wei-Chung Hsu, Pangfeng Liu, Chien-Min Wang and Yeh-Ching Chung, "HQEMU: A Multi-Threaded and Retargetable Dynamic Binary Translator on Multicores," Proceedings of the Tenth International Symposium on Code Generation and Optimization (CGO), March 2012. ::: 30. Chun-Chen Hsu, Pangfeng Liu, Chien-Min Wang, Jan-Jan Wu, Ding-Yong Hong, Pen-Chung Yew and Wei-Chung Hsu, "LnQ: Building High Performance Dynamic Binary Translators with Existing Compiler Backends," International Conference on Parallel Processing (ICPP), September 2011. 31. Ding-Yong Hong, Fang-Ping Pai, Shih-Hsiang Lo and Yeh-Ching Chung, "A Scalable HLA RTI System Based on Multiple-FedServ Architecture," International Conference on Computer Modelling and Simulation (UKSim), March 2010. 32. Shih-Hsiang Lo, Cheng-An Chiu, Fang-Ping Pai, Ding-Yong Hong and Yeh-Ching Chung, "MGRID: A Modifiable- Grid Region Matching Approach for DDM in the HLA RTI," Spring Simulation Multiconference (SpringSim), March 2009. 33. Seetharami Seelam, I-Hsin Chung, Ding-Yong Hong, Hui-Fang Wen and Hao Yu, "Early Experiences in Application Level I/O Tracing on Blue Gene Systems," IEEE International Parallel and Distributed Processing Symposium (IPDPS), April 2008. 34. Ding-Yong Hong, Ching-Wen You and Yeh-Ching Chung, "An Efficient MPI-IO for Noncontiguous Data Access over InfiniBand," International Symposium on Parallel Architectures, Algorithms, and Networks (I-SPAN), December 2005. Technical Reports 1. Ding-Yong Hong, "Efficient and Retargetable Dynamic Binary Translation," Ph.D. dissertation, 2013. :::