A curated list of resources related to NLP (Natural Language Processing) for Korean + NLP resources in Korean.
Tools specialized for Korean is listed ahead of language-agnostic tools.
Feel free to contribute!
Maintainer: Jaemin Cho
- Tools
- Dataset
- Papers
- Lectures
- Blog Posts / Slides
- Researchers / Institutes
- Journals / Conferences / Events
- Online Forums
- How to contribute
- Hannanum (한나눔) (Java, C) [link]
- KoNLPy (Python) [link]
- Kkma (꼬꼬마) (Java) [link] [paper]
- KoNLPy (Python) [link]
- Komoran (Java) [link]
- KoNLPy (Python) [link]
- Mecab-ko (C++) [link]
- KoNLPy (Python) [link]
- Twitter (Scala, Java) [link]
- KoNLPy (Python) [link]
- .NET, Node.js, Python, Ruby, Elasitc Search bindings
- KTS [paper]
- Arirang (Lucence, Java) [link]
- 깜짝새 [link]
- dparser (REST API) [link]
- Rouzeta [link] [slide] [video]
- seunjeon (Scala, Java) [link]
- RHINO (라이노) [link]
- annie [link]
- KoNLP (R) [link]
- KoNLPy (Python) [link] [paper]
- KoalaNLP (Scala) [link]
- NLTK (Python) [link] [paper]
- gensim (Python) [link]
- FastText (C) [link]
- FastText.py (Python) [link]
- Hangulpy (Python) [link]
- 자동 조사/접미사 첨부, 자모 분해 및 결합
- Hangulize (Python) [link]
- 외래어 한글 변환
- kroman [link]
- Hangul Romanization
- Ruby, Python, NodeJS, Objective-C, Swift
- hangul (Perl) [link]
- Hangul Romanization
- textrankr (Python) [link] [demo]
- TextRank 기반 한국어 문서 요약
- 한국어 Word2Vec [link] [paper]
- 나쁜 단어 사전 [link]
- crowdsourced dic about badword in korean
- Sejong Corpus [link]
- KAIST Corpus [link]
- Yonsei Univ. Corpus
- Korea Univ. Corpus
- Wikipedia Dump [link] [Extractor]
- NamuWiki Dump [link] [Extractor]
- Naver News Archive [link]
- Chosun Archive [link]
- Naver sentiment movie corpus [link]
- sci-news-sum-kr-50 [link]
- Stanford CS224N: Natural Language Processing [link] [YouTube]
- Stanford CS224d: Deep Learning for Natural Language Processing [link] [YouTube]
- NLTK with Python 3 for NLP (by Sentdex) [YouTube]
- LDA Topic Models [link]
- dsindex's blog [link]
- 엑사젠, "혼자 힘으로 한국어 챗봇 개발하기" [link]
- Beomsu Kim, "word2vec 관련 이론 정리" [link]
- CPUU, "Google 자연어 처리 오픈소스 SyntaxNet 공개" (Korean tranlsation of Google blog) [link]
- theeluwin, "python-crfsuite를 사용해서 한국어 자동 띄어쓰기를 학습해보자" [link]
- Lucy Park, "한국어와 NLTK, Gensim의 만남" (PyCon APAC 2015) [link]
- Jeongkyu Shin, "Building AI Chat bot using Python 3 & TensorFlow" (PyCon APAC 2016) [link]
- Changki Lee, "RNN & NLP Application" (Kangwon Univ. Machine Learning course) [link]
- Kyunghoon Kim, "뉴스를 재미있게 만드는 방법; 뉴스잼" (PyCon APAC 2016) [link]
- Hongjoo Lee, "Python 으로 19대 국회 뽀개기" (PyCon APAC 2016) [link]
- 進藤裕之 (translated by Hongbae Kim), "딥러닝을 이용한 자연어처리의 연구동향" [link]
- Hongbae Kim, "머신러닝의 자연어 처리기술(I)" [link]
- Changki Lee, "자연어처리를 위한 기계학습 소개" [link]
- 국어 정보 처리 시스템 경진 대회 [link]
- 언어공학연구회 [link]
- Reddit Machine Learning Top posts [link]
- AI Korea (Facebook Group) [link]
- Tensorflow KR (Facebook Group) [link]
- Bot Group (Facebook Group) [link]
- 바벨피쉬 (Facebook Group) [link]
-
Fork this Repository
-
Edit
-
Create Pull Request! [Help]