reduce-ml-cost-with-quantization-pruning
PublicThis project demonstrates the impact of model design choices on both energy consumption and economic cost. It analyzes the weight importance within a neural network, estimates the total FLOPs required for inference, and explores how quantization and pruning affect resource efficiency.