neural_networks

A neural network project based on GPT-2

First, an investigation into the ways how neural networks work, through the eyes of backpropagation and gradient descent. This can be found in the 'backpropagation' folder, where we implement a "bare-bones" version of pytorch, using Values instead of Tensors and a backwards backpropagation algorithm to calculate the gradients of all of the neurons. Then, a further investigation into manual backpropagation to advance our knowledge on the most important algorithm in machine learning, and specifically, neural networks.
Secondly, the creation of a generation model called makemore. As the name suggests, Makemore is about making more from data that was given. In this case, we take a dataset of names and are trying to generate new names from the data in multiple different ways. These include:
- A Bigram Model
- A MLP (using trigrams with tanh activation functions to create outputs with multinomial choosing)
- Wavenet Structure (trying to incorporate the model of using multiple hidden layers in order to tune hyperparameters and increase the complexity)
- A RNN, a GRU, and finally, the current most popular way (circa 2024), a Transformer. Through this, we will train a neural network to create brand new, readable names that are understandable and structured.
Finally, we'll create a version of GPT. Using a LLM, we will build a GPT tokenizer in order to help generate data for the model, as well as use a complicated system of neural networks in order to build an algorithm that will allow us to create ChatGPT right from our own browser.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
backpropagation		backpropagation
makemore_project		makemore_project
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

neural_networks

About

Releases

Packages

Languages

kev374k/neural_networks

Folders and files

Latest commit

History

Repository files navigation

neural_networks

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages