LIBRISTO
LIBROAMANTO
obvezno
Postanite del skupnosti ljubiteljev knjig z vsega sveta in uživajte v številnih ugodnostih. Ustvarite brezplačen račun
0
Brezplačna dostava Zásilkovna nad 69.99 €
Zbirna točka GLS 4.49 Zbirna točka DPD 2.99 Kurirska služba GLS 5.49 Kurir DPD 3.49 Kurirska služba 3.49 Zbirno mesto 3.49 Zbirno mesto 3.49 Dostava preko Pošte Slovenije 3.49

Brezplačna dostava za naročila nad 69.99 € na paketomatih Pošte Slovenije.

Quantized Model Deployment

INT8 and FP16 Compression for Mobile Acceleration

Jezik AngleščinaAngleščina
Knjiga Mehka
Knjiga Quantized Model Deployment Clara Whiskers
Koda Libristo: 52388434
Založba Independently published, maj 2026
What if the only thing standing between your neural network and real-time mobile performance is the... Celoten opis
? points 44 b Novo Novo
18.32
Na zalogi pri dobavitelju Odposlali bomo v 9-15 dneh

30 dni za vračilo blaga

What if the only thing standing between your neural network and real-time mobile performance is the precision you refuse to give up?
Your model ran flawlessly in PyTorch-400MB of FP32 weights, a 350-watt GPU, and all the thermal headroom in the world. Then you deployed it to a phone. It stuttered. It heated up. The OS killed it before it produced a single inference. The market no longer asks whether AI can run on mobile. It asks why your AI is slower and less accurate than the cloud version. The answer is not your architecture. It is your precision.
This book is the field manual for engineers who refuse to accept the old compromise of smaller models and weaker accuracy. Inside, you will learn:
• Why INT8 and FP16 are not arbitrary format choices, but hardware-mandated keys to dedicated acceleration paths on Snapdragon, Apple Neural Engine, and MediaTek APU • How naïve post-training quantization can crater accuracy by double-digit percentages-and the calibration, range estimation, and outlier handling techniques that prevent it • The exact deployment architecture for TensorFlow Lite, Core ML, ONNX Runtime Mobile, and NNAPI, including operator fusion and numerical equivalence testing • Why quantization is the only optimization that simultaneously improves latency, accuracy, and power consumption-and how to combine it with pruning and knowledge distillation for wearables and IoT
Stop accepting the compromise between speed and accuracy. Build models that run cooler, faster, and sharper on the devices already in your users' pockets. The precision you can no longer afford is the precision you can finally reclaim.

Igralka & Poliglotka
EWA KASP za
Predvajaj video
Ewa Kasp
Libristo ima največjo izbiro tujejezične literature. Zato svoje knjige kupujem tukaj.

O knjigi

Polni naslov Quantized Model Deployment
Jezik Angleščina
Vezava Knjiga - Mehka
Datum izida 2026
Število strani 234
EAN 9798196245466
Koda Libristo 52388434
Teža 381
Mere 170 x 244 x 13
Podarite to knjigo še danes
To je povsem preprosto
1 Dodajte knjigo v košarico in izberite dostavo kot darilo 2 V zameno vam bomo poslali kupon 3 Knjiga bo dostavljena na naslov obdarovanca

Prijava

Prijavite se v svoj račun. Še nimate računa Libristo? Ustvarite ga zdaj!

 
obvezno
obvezno

Še nimate računa? Izkoristite prednosti računa Libristo!

Z računom Libristo boste imeli vedno vse pod nadzorom.

Ustvarite račun Libristo