-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Increase training data on VAE #22
Comments
Now we have a script that can use a single image and create numerous different versions of it, and this way we can have a relatively large size training data set, which is critical to training of VAE. Now a training dataset containing 500 images takes 6 Gibs and more, and it takes a lot of RAM on local machine. Also, for a VAE to have better performance, we might considering adding more layers, which also is a computation consuming task. |
Jul 28 updates: Because of the complexity of GAN models, it works well with input size smaller than 64 x 64, but it provides high resolution and sharpness, which means if we decide to use smaller pieces of image, GAN would work better than the VAE. I think right now I would keep working on developing the VAE, try to optimize the code so it would not crash. By the way, I think there might be another way of generating topologies. Sometimes video games makers would need to produce a lot of territories, and they usually do not create those mountains and lands by hand, they have a method called fractal generation. I will look this up when I have spare time. |
Hi everyone, I have just finished configuring the new VAE model. I have tried two different ways, one is to feed it with 20 identical TG plots and the result is that it can perfectly produce copies of the original data which is not what we wanted for a VAE. The other way I used is to feed it with splits of the original TG data. I split the TG plot into 20 pieces with same size and give it to the VAE, then it can produce several fusion plots. We can still observe some of the features we want from the latent spcae: The next step would be do some hyper-parameter configuring. In the mean time, maybe we should feed the model with more diversed input data. |
Based on our discussion Andrew, we think there could be two potential way to overcome the disruption caused by the white background. First method is to crop the original data again and keep only the central part which means we give up the data on the edge but in exchange get multiple full size data. Method two is to using more new data as input. Andrew also mentions another data source that we can use for the model, it could provide us much more data so we can improve the quality of this model by adding much more training data. |
Update on Aug 11: Using method 1, I have generated 144 smaller images. Each one of them has 128 pixels x 128 pixels. Using these smaller images and the updated codes, I now have the new VAE model. It can now generate new and similar images to the input. However there are still two problems with this model. One is that the model still suffers from high noise, which people can observe some white noise from generated data. The other problem is that if we generate two new images and put them together, we can still see a obvious line between two images. This is because the two images are generated stochastically, and not continuous. This could be a problem for later stages, but for now, I will focus on solving the first problem. My initial thoughts are adding the complicity of the model which may use a lot more computation resources and take much more time. I will update here for the further improvement of the model. Below is some of the images generated. I used 001m topology of both upstream and downstream TG data as input. |
I have just uploaded the VAE model to my fork. It uses upstream and downstream 001m topology as input data. I also included 5 demos that produced by the model. I will keep working on this model, and make another update once it has a major improvement. |
Update: We can still observe a few problems about the model. First of all, because there are a lot of blank space in the original images, the model somehow learned this pattern and also tend to produce this same blank spaces. |
VAE turns to be a quite good model when we want to generate similar yet new images. However, as our data has several patterns that is hard to capture, such as the geographical features, the delicate change in depth and the specific shape of our glacier, it would be much helpful if we can use multiple input training data instead of only one.
There are two potential options. One is that we slice the original TG into many slices like a cake, and use the slices as input data. But the pieces of images are connected and related to each other, and the smaller pieces would create difficulties when we want to put generated data back to a larger picture.
Another option would be using data augmentation. It is a algorithm that can generate different versions of original data, it keeps everything except it would rotate, or reflect the data, sometimes we can manually add some noises. This could be a better option because it preserves the whole picture and we would not lose too many features. I am now working a way to generate maybe 1000 new training data based on the original TG data and then put them into a VAE.
I think I can finish the script by Friday and I will upload a new folder to the fork and then keep updating it and documenting it.
The text was updated successfully, but these errors were encountered: