Case study of WebAssembly Runtimes for AI Applications on the Edge

Téléchargements

Téléchargements par mois depuis la dernière année

Khelifa, S. e., Bagaa, M., Messaoud, A. O. et Ksentini, A. (2024). Case study of WebAssembly Runtimes for AI Applications on the Edge. Dans 2024 Global Information Infrastructure and Networking Symposium (GIIS) DOI 10.1109/GIIS59465.2024.10449907.

[thumbnail of BAGAA_M_145_POST.pdf]
Prévisualisation
PDF
Télécharger (761kB) | Prévisualisation

Résumé

In the realm of Artificial Intelligence (AI), the need for immediate response times has given rise to the Cloud Edge Computing Continuum (CECC). This new paradigm, aided by emerging technologies, addresses latency and network delays while promoting portability, security, and efficiency, thereby enhancing Quality of Service (QoS). A noteworthy technology in this context is WebAssembly (Wasm), originally conceived to amplify web performance. It has transitioned to the CECC, primarily due to key enablers like the WebAssembly System Interface (Wasi) and the Wasm runtime. Besides offering heightened security through its sandboxing mechanism, WebAssembly's compact code paves the way for rapid cold start times and seamless migration in AI applications. However, with WebAssembly's nascent integration into the CECC, several questions arise. Prominent among them is the efficiency of deploying AI tasks in Wasm binary format, particularly the performance of Wasm runtimes in AI-centric tasks and potential factors affecting such executions. Addressing these queries, our study examines various deep-learning models on standalone WebAssembly runtimes. Our findings indicate that, for smaller networks with optimized parameters, standalone runtimes approach native performance, presenting just a 1.1x overhead on average. Contrarily, networks with an extensive parameter set exhibited pronounced overheads. We also identified multiple factors, associated both with run-times and neural networks, offering insights for future research endeavors.

Type de document: Document issu d'une conférence ou d'un atelier
Mots-clés libres: Runtime Quality of service Security Time factors Artificial intelligence Task analysis Edge computing
Date de dépôt: 03 juill. 2026 15:30
Dernière modification: 03 juill. 2026 15:30
Version du document déposé: Post-print (version corrigée et acceptée)
URI: https://depot-e.uqtr.ca/id/eprint/12960

Actions (administrateurs uniquement)

Éditer la notice Éditer la notice