Skip to content

a light word2vec model which has about 46,000 person names

License

Notifications You must be signed in to change notification settings

inaniwa3/name2vec

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

name2vec

a light word2vec model which has about 46,000 person names

How to use

console.gif

Process

source/jawiki-20170201-pages-articles-multistream.xml.bz2
(https://dumps.wikimedia.org/jawiki/20170201/)
  |
  | WikiExtractor.py
  | (https://github.com/attardi/wikiextractor)
  v
corpus/AA/wiki_*
  |
  | extractnames.py
  v
names.txt
  |
  | train.py
  v
names.bin

About

a light word2vec model which has about 46,000 person names

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages