Constrained Edge AI Deployment: Fine-Tuning vs Distillation for LLM Compression

Jan 1, 2025·

Jacob Sander

David Moe

Achraf Cohen

Brent Venable

Venkat Dasari

Brian Jalaian

· 1 min read

Cite

Abstract

This paper investigates LLM compression techniques for edge deployment, comparing fine-tuning and knowledge distillation approaches for resource-constrained environments.

Type

Journal article

Publication

arXiv preprint arXiv:2505.18166

Add the full text or supplementary notes for the publication here using Markdown formatting.

Last updated on Jan 23, 2026

Ai Optimization Deep Learning

← ORCA: Agentic Reasoning For Hallucination and Adversarial Robustness in Vision-Language Models Jan 15, 2025

Neurosymbolic AI for network intrusion detection systems: A survey Jan 1, 2025 →