Loss Function

Cross Entropy, KL Divergence

MLE(Maximum Likelihood Estimation)

Optimization

Gradient Descent

Bayesian Opt