Skip to content
Projects
Groups
Snippets
Help
This project
Loading...
Sign in / Register
Toggle navigation
P
project_1
Overview
Overview
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
20200519027
project_1
Commits
1a64b777
Commit
1a64b777
authored
Jul 23, 2020
by
Yuan
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
Add new file
parent
884456c3
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
23 additions
and
0 deletions
+23
-0
review
+23
-0
No files found.
review
0 → 100644
View file @
1a64b777
Summary:
代码还有许多 优化的地方
Detailed Comments:
1、在读取数据集这一步,需要把很多空的数据集过滤掉;
2、停用词用一个文件存起来,不要放在代码中;
3、数据预处理部分,完成的不够,比如一些没有用的符号需要去掉等;
4、heappush需要导入,或者是heapq.heappush;
5、倒排索引的这部分还需要继续完成;
6、整体代码还是比较凌乱,需要整理一下;
Overall Score: 78
--------------------------------------------------------------------------------------------------------------------------
Thanks for your efforts.
-Your instructor
\ No newline at end of file
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment