모바일 메뉴 닫기
 

연구

Research & Laboratory

제목
세미나 [05/03] On-Device AI Systems: From DNN Model Compressions to Machine Learning Accelerators
작성일
2019.04.29
작성자
전기전자공학부
게시글 내용

< BK21 플러스 BEST 정보기술 사업단 세미나 개최 안내 >


개최일시 : 2019년 5월 3일 (금) 16:00 ~ 17:00

개최장소 : 제 4공학관 D403호

세미나 제목 : On-Device AI Systems: From DNN Model Compressions to Machine Learning Accelerators

내용 :

AI is one of the hottest keywords recently in computer societies. Its great successes in computer vision and speech recognition have encouraged many researchers and engineers to apply AI to almost every area of science and engineering. Though key AI algorithms were originally developed in decades ago, it was big data availability and hardware acceleration that really made AI so successful today.

With On-Device AI systems, we aim to deliver transparent AI user experience on user devices without connecting to cloud servers. To enable AI functionalities on mobile and consumer devices, a system-level holistic optimization from algorithm to chip is crucial to overcome AI application’s compute, memory and power requirements. We have to develop small but accurate DNN models, hardware accelerators to run the models, and runtime and compiler to manage the accelerators efficiently.

First, I will brief discuss background knowledge to understand DNN accelerators. I would like to cover important academic achievements. And, I like to show the landscape of NPU industry, introducing various NPU chips and explaining their pros and cons. NPU technology makes progress so fast and the gap between academia and industry is small. Many commercial NPUs such as Huawei’s Kirin NPU were initially made by university researchers. Then, I will introduce our efforts for DNN Model compression and DNN compiler and runtime acceleration. I would like to show model compression techniques such as pruning, quantization, and matrix factorization. And, I like to discuss how to efficiently exploit computing resources on a mobile SoC via a runtime technique. Finally, I will conclude the talk by discussing future works. Throughout the talk, I would like to provide the audience chances to grasp key ideas on on-device AI systems.



강연자 성함&직함 / 소속 : 김대현 상무 / Samsung Research

초청자 : 전기전자공학과 교수 노원우