reduce-ml-cost-with-quantization-pruning
PublicThis project demonstrates the impact of model design choices on both energy consumption and economic cost. It analyzes the weight importance within a neural network, estimates the total FLOPs required for inference, and explores how quantization and pruning affect resource efficiency.
cost-optimizationefficiencymachine-learningneural-networkneural-networkspruningquantizationquantization-efficient-network
Creat:2025-07-04T23:13:20
Update:2025-07-06T02:18:04
1
Stars
0
Stars Increase