Stochastic Non-convex Optimization

This is the repository of the Machine learning course project, working with the

The main purpose of this projects was to explore, implement some of the most used optimization algorithms and compare them with Stocastic Cubic Regularization (SCRN) and Stocastic Cubic Regularization with Momentum (SCRN_Momentum) in a non-convex problem.

Optimizer implemented in this repository:

SGD
Adam
Stochastic Cubic Regularization (SCRN)
Stochastic Cubic Regularization with Momentum (SCRN_Momentum)

Files in this repository:

optimizers.py: contains the optimizers implemented in this repository
models.py: Contain the final model implemented, an enconder with 3 conv net and 3 linear layers.
utils.py

How to run the code

Create a conda environment with the following command:

conda create --name <env> --file requirements.txt

conda  activate <env>

To run one model you have the following args

dataset: dataset to use, MNIST, CIFAR10 or CIFAR100.
conv_numbers: number of conv layers, default 3.
linear_numbers: number of linear layers, default 3.
hidden: size of the linera layers, default 128.
epochs: number of epochs to run the training defautl 2.
batch_size: batch size, default 100.
lr: learning rate, default 0.001.
optimizer: optimizer to use, SGD, Adam, Sophia, SCRN or SCRN_Momentum. can be one or a list
activation: activation function, default relu.
scheduler: set a scheduler for the learning rate.
verbose: Whether to print detailed training progress.
save: save the model, default False.
save_path: path to save the model, default ./models/.
model_selection: for grid search on learning rate on the optimizers required.

Examples

run one optimizer over one learning rate

Run Neural network of 3 convolutaional layer, 3 fully connected layers, over 50 epochs over MNIST dataset, over Adam SGD, SCRN and SCRN_Momentum optimizer with learning rate 0.001, 0.1, 0.001 and 0.001 respectively, verbose and save the model.

python main.py --dataset MNIST --num_layers 3 --conv_number 3 --epochs 50 --lr 0.001,0.1,0.001,0.001 --optimizer Adam,SGD,SCRN,SCRN_Momentum --verbose --save

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
results		results
.gitignore		.gitignore
README.md		README.md
main.py		main.py
models.py		models.py
optimizers.py		optimizers.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

results

results

.gitignore

.gitignore

README.md

README.md

main.py

main.py

models.py

models.py

optimizers.py

optimizers.py

requirements.txt

requirements.txt

utils.py

utils.py

Repository files navigation

Stochastic Non-convex Optimization

How to run the code

Examples

run one optimizer over one learning rate

Datsets

Principal references

About

Releases

Packages

Contributors 3

Languages

CS-433/ml-project-2-hessian4science

Folders and files

Latest commit

History

Repository files navigation

Stochastic Non-convex Optimization

How to run the code

Examples

run one optimizer over one learning rate

Datsets

Principal references

About

Resources

Stars

Watchers

Forks

Languages