Skip to content

Hyper-Parameter Tuning

Advantage Disadvantage
Manual Time-Consuming
Grid Search Computationally-expensive
Random Search Non-deterministic
Evolutionary Randomization, Natural Selection, Mutation
Bayesian Probabilistic model of relationship b/w cost function and hyper-parameters, using information gathered from trials
Gradient-Based Treat hyper parameter tuning like parameter fitting
Early-Stopping Focus resources on settings that look promising
eg: Successive Halving

Speed Up

  • Parallelizing
  • Caching
  • Random sampling: Won’t work with caching

image-20240317160544276

Clustering

Elbow Method

Plot cost function as function of no of clusters

image-20240711155441455

Last Updated: 2024-05-14 ; Contributors: AhmedThahir

Comments