The document presents a new technique for adapting trained neural networks to fixed-point arithmetic, aimed at improving computational efficiency in resource-constrained embedded systems while maintaining accuracy. It details how to synthesize fixed-point codes through linear programming methods to ensure performance aligns closely with original floating-point networks, providing specific examples and constraints managed during the tuning process. Experimental results validate the method, demonstrating that the synthesized fixed-point neural networks can meet user-defined accuracy thresholds comparable to their floating-point counterparts.
Related topics: