Commit 1a64b777 by Yuan

Add new file

parent 884456c3
Summary:
代码还有许多 优化的地方
Detailed Comments:
1、在读取数据集这一步,需要把很多空的数据集过滤掉;
2、停用词用一个文件存起来,不要放在代码中;
3、数据预处理部分,完成的不够,比如一些没有用的符号需要去掉等;
4、heappush需要导入,或者是heapq.heappush;
5、倒排索引的这部分还需要继续完成;
6、整体代码还是比较凌乱,需要整理一下;
Overall Score: 78
--------------------------------------------------------------------------------------------------------------------------
Thanks for your efforts.
-Your instructor
\ No newline at end of file
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment