ANNPSO

Repo for the final year college project.

Proposed idea: Training a neural network with the help of particle swarm optimization (PSO) instead of the traditional backprop and gradient descent.

Current issues with backprop:

Convergence: Having a single initialization point for weights and biases often leads to a suboptimal convergence at a local minimum. Another major issue is the vanishing gradient problem, i.e., plateau regions in the function topology where the slope is zero.
Derivatives: Calculating them requires a differentiable non-linear activation function. Thus backprop is inherently ~2x slower than feedforward.

This where PSO comes in: a combinatorial optimization method which:

Ensures a good convergence point if done right.
Does not require a differentiable "fitness" function.

Purely PSO-based:

Currently what this repo is all about. Completely eliminating backprop from the equation and relying solely on PSO.

Results: It has been established that a network is certainly trainable by this method. We over-fit a basic XOR network (5-3-3-3-1) and observed the losses. Using 32 particles, it took roughly 2k iterations for the training loss to drop to 0.06 (~94% training accuracy). Largely over-fit, but trainable nonetheless.

A best-of-both-worlds method: Hybrid PSO with backprop, written in TensorFlow

Training using both PSO and backprop for a better rate of convergence. Current results are far better than a purely PSO based approach, but the method itself is a major memory-hog.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
application.cpp		application.cpp
dump.txt		dump.txt
dumper.py		dumper.py
neural_network.cu		neural_network.cu
neural_network.h		neural_network.h
options.txt		options.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ANNPSO

Repo for the final year college project.

Proposed idea: Training a neural network with the help of particle swarm optimization (PSO) instead of the traditional backprop and gradient descent.

Purely PSO-based:

A best-of-both-worlds method: Hybrid PSO with backprop, written in TensorFlow

Issues on this repo serve more as a to-do list than as actual issues.

Any suggestions and feedback are welcome. Feel free to raise an issue.

About

Releases

Packages

Languages

License

c-soni/ANNPSO

Folders and files

Latest commit

History

Repository files navigation

ANNPSO

Repo for the final year college project.

Proposed idea: Training a neural network with the help of particle swarm optimization (PSO) instead of the traditional backprop and gradient descent.

Purely PSO-based:

A best-of-both-worlds method: Hybrid PSO with backprop, written in TensorFlow

Issues on this repo serve more as a to-do list than as actual issues.

Any suggestions and feedback are welcome. Feel free to raise an issue.

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages