Artikel in einem Konferenzbericht,

Pruning and Early-Exit Co-Optimization for CNN Acceleration on FPGAs

G. Korol, M. Jordan, M. Rutzig, J. Castrillon, und A. Beck.
Proceedings of the 2023 Design, Automation and Test in Europe Conference (DATE), Seite 6pp. IEEE, (April 2023)
DOI: 10.23919/DATE56975.2023.10137244

Zusammenfassung

The challenge of processing heavy-load ML tasks, particularly CNN-based ones at resource-constrained IoT devices, has encouraged the use of edge servers. The edge offers performance levels higher than the end devices and better latency and security levels than the Cloud. On top of that, the rising complexity of ML applications, the ever-increasing number of connected devices, and the current demands for energy efficiency require optimizing such CNN models. Pruning and early-exit are notable optimizations that have been successfully used to alleviate the computational cost of inference. However, these optimizations have not yet been exploited simultaneously: while pruning is usually applied at design time, which involves retraining the CNN before deployment, early-exit is inherently dynamic. In this work, we propose AdaPEx, a framework that exploits the intrinsic reconfigurable FPGA capabilities so both can be cooperatively employed. AdaPEx first explores the trade-off between pruning and early-exit at design-time, creating a design space never exploited in the state-of-the-art. Then, AdaPEx applies FPGA reconfiguration as a means to enable the combined use of pruning and early-exit dynamically. At runtime, this allows matching the inference processing to the current edge conditions and a user-configurable accuracy threshold. In a smart IoT application, AdaPEx processes up to 1.32x more inferences and improves EDP by up to 2.55x over the state-of-the-art FPGA-based FINN accelerator.

BibTeX-Schlüssel: korol_date23
Eintragstyp: inproceedings
Buchtitel: Proceedings of the 2023 Design, Automation and Test in Europe Conference (DATE)
Jahr: 2023
Monat: apr
Seiten: 6pp
Verlag: IEEE
Reihe: DATE'23
location: Antwerp, Belgium
DOI: 10.23919/DATE56975.2023.10137244
URL: https://ieeexplore.ieee.org/document/10137244

Nutzer

Kommentare und Rezensionenanzeigen / verbergen

Bitte melden Sie sich an um selbst Rezensionen oder Kommentare zu erstellen.

Zitieren Sie diese Publikation

@inproceedings{korol_date23, abstract = {The challenge of processing heavy-load ML tasks, particularly CNN-based ones at resource-constrained IoT devices, has encouraged the use of edge servers. The edge offers performance levels higher than the end devices and better latency and security levels than the Cloud. On top of that, the rising complexity of ML applications, the ever-increasing number of connected devices, and the current demands for energy efficiency require optimizing such CNN models. Pruning and early-exit are notable optimizations that have been successfully used to alleviate the computational cost of inference. However, these optimizations have not yet been exploited simultaneously: while pruning is usually applied at design time, which involves retraining the CNN before deployment, early-exit is inherently dynamic. In this work, we propose AdaPEx, a framework that exploits the intrinsic reconfigurable FPGA capabilities so both can be cooperatively employed. AdaPEx first explores the trade-off between pruning and early-exit at design-time, creating a design space never exploited in the state-of-the-art. Then, AdaPEx applies FPGA reconfiguration as a means to enable the combined use of pruning and early-exit dynamically. At runtime, this allows matching the inference processing to the current edge conditions and a user-configurable accuracy threshold. In a smart IoT application, AdaPEx processes up to 1.32x more inferences and improves EDP by up to 2.55x over the state-of-the-art FPGA-based FINN accelerator.}, added-at = {2025-01-02T10:40:15.000+0100}, author = {Korol, Guilherme and Jordan, Michael Guilherme and Rutzig, Mateus Beck and Castrillon, Jeronimo and Beck, Antonio Carlos Schneider}, biburl = {https://puma.scadsai.uni-leipzig.de/bibtex/294bf4edfa76ed148aa979f59d5e3ccf6/joca354e}, booktitle = {Proceedings of the 2023 Design, Automation and Test in Europe Conference (DATE)}, doi = {10.23919/DATE56975.2023.10137244}, editor = {IEEE}, interhash = {bfec57578ee4bc56726670db02833b83}, intrahash = {94bf4edfa76ed148aa979f59d5e3ccf6}, keywords = {FPGA accelerator nn}, location = {Antwerp, Belgium}, month = apr, pages = {6pp}, publisher = {IEEE}, series = {DATE'23}, timestamp = {2025-01-02T10:40:15.000+0100}, title = {Pruning and Early-Exit Co-Optimization for CNN Acceleration on FPGAs}, url = {https://ieeexplore.ieee.org/document/10137244}, year = 2023 }

PUMA

Pruning and Early-Exit Co-Optimization for CNN Acceleration on FPGAs

Zusammenfassung

Tags

Nutzer

Kommentare und Rezensionenanzeigen / verbergen

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf