Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

王相淳完成 DL102 入学任务啦 #82

Open
lonelyKSA opened this issue Oct 5, 2017 · 0 comments
Open

王相淳完成 DL102 入学任务啦 #82

lonelyKSA opened this issue Oct 5, 2017 · 0 comments
Assignees

Comments

@lonelyKSA
Copy link

基本任务

代码链接
任务效果

由于对二元词组没有概念,所以参照举例说明的二元词组,统计了两个相连的两个字词组,并进行排序。

  1. 思路大体是:读入文件,构成二元词组,统计,输出。
  2. 在做的过程中遇到了中文编码和解码的问题,让我意识到了还是用python3比较方面。
  3. 看了其他同学的作业,有同学在统计时直接调用库,比较简单;还有同学直接用正则来筛掉标点符号,对更广泛的二元词组进行统计,这个也是我在作业时尝试但是没能够实现的。简而言之,先进的工具才是生产力啊.
  4. 进阶作业继续跟进中。。。
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants