Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AI AI #44

Open
barnoid opened this issue Oct 31, 2016 · 8 comments
Open

AI AI #44

barnoid opened this issue Oct 31, 2016 · 8 comments

Comments

@barnoid
Copy link

barnoid commented Oct 31, 2016

I have an idea to feed 5000 frames of the movie AI to an image captioning neural net and see what comes out. I think 5000 should give at least 50000 words. I may put in some randomish paragraph breaks and simulate chapters somehow. This might take longer than a month to run though.

@superMDguy
Copy link

Wow that sounds cool. Good idea.

@barnoid
Copy link
Author

barnoid commented Nov 1, 2016

I've ripped the DVD and extracted 5036 frames (0.6 frames per second). I've started a run through the neural net (I'm using https://github.com/karpathy/neuraltalk2), it looks like it will take three or four hours. The output is looking quite unvaried so I may have to intervene a lot to make it not extremely boring.

@hugovk
Copy link
Member

hugovk commented Nov 1, 2016

I saw this demo https://twitter.com/kcimc/status/668094003791929344 of https://github.com/karpathy/neuraltalk2 when it was released last November and thought it'd be great for nanogenmo!

@pointyointment
Copy link

Perhaps segmenting each frame according to motion (by comparing with adjacent frames) prior to running the neural net would help with getting more interesting output.

@superMDguy
Copy link

This would be a lot of work, but I wonder if you could modify the neural net, or maybe just write some extra code to compare two descriptions and focus on the differences between frames, so you could talk more about movement.

@barnoid
Copy link
Author

barnoid commented Nov 30, 2016

It is done.

https://github.com/barnoid/AIAI/blob/master/aiai.pdf

Write up here: https://github.com/barnoid/AIAI

@hugovk
Copy link
Member

hugovk commented Nov 30, 2016

A preview:

A regular view of a nighttime city landscape. A picture of
a stately couple in the open distance, and a view of a room
with a window and a window. A room with a bed and a
table. A view of a building with a large clock on the side
of it; a close up of a banana on a table. A stop sign with a
sticker on it; a red and white sign with a sky background.
A large red stop sign sitting in the center of the street.

A red and white sign with a sky background. Clean cars is
seen while strange the brown and green jelly next to them.

@superMDguy
Copy link

superMDguy commented Dec 16, 2016

I recently saw this, which is a neural network trained specifically to describe what's happening in videos. I haven't tried it out myself, but it seems pretty interesting. I don't know if it's generalized enough to work in something as broad as a movie.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants