Compressing Neural Networks on Limited Computing Resources
Network compression is a crucial technique for applying deep learning models to edge or mobile devices. However, the cost of achieving higher benchmark performance through compression is continuously increasing, making network compression a significant burden—especially for small industri...
Saved in:
| Main Authors: | Seunghyun Lee, Dongjun Lee, Minju Hyun, Heeje Kim, Byung Cheol Song |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10988545/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Pruning convolution neural networks using filter clustering based on normalized cross-correlation similarity
by: Niaz Ashraf Khan, et al.
Published: (2025-04-01) -
A unified DNN weight compression framework using reweighted optimization methods
by: Mengchen Fan, et al.
Published: (2025-09-01) -
A survey of model compression techniques: past, present, and future
by: Defu Liu, et al.
Published: (2025-03-01) -
<italic>WhiteDwarf</italic>: A Holistic Co-Design Approach to Ultra-Compact Neural Inference Acceleration
by: Angel Lopez Garcia-Arias, et al.
Published: (2025-01-01) -
Neural Image Compression and Explanation
by: Xiang Li, et al.
Published: (2020-01-01)