INCREMENTAL PRECISION NETWORKS USING RESIDUAL INFERENCE AND FINE-GRAIN QUANTIZATION
Register | USPTO Patent |
---|---|
Application Number | 18060414 |
Status | Pending |
Filing Date | 2022-11-30 |
First Publication Date | 2023-03-23 |
Publication Date | 2023-03-23 |
Owner | Intel Corporation (USA) |
Inventor |
|
Abstract
One embodiment provides for a computer-readable medium storing instructions that cause one or more processors to perform operations comprising determining a per-layer scale factor to apply to tensor data associated with layers of a neural network model and converting the tensor data to converted tensor data. The tensor data may be converted from a floating point datatype to a second datatype that is an 8-bit datatype. The instructions further cause the one or more processors to generate an output tensor based on the converted tensor data and the per-layer scale factor.IPC Classes ?
- G06N 3/08 - Learning methods
- G06T 15/00 - 3D [Three Dimensional] image rendering
- G06N 5/04 - Inference or reasoning models
- G06N 3/04 - Architecture, e.g. interconnection topology
- G06F 9/46 - Multiprogramming arrangements
- G06N 3/063 - Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means